L.A.S.L.A. and Collatinus: a convergence in lexica
Document type :
Article dans une revue scientifique
Title :
L.A.S.L.A. and Collatinus: a convergence in lexica
Author(s) :
Verkerk, Philippe [Auteur]
Laboratoire de Physique des Lasers, Atomes et Molécules - UMR 8523 [PhLAM]
Ouvrard, Yves [Auteur]
Fantoli, Margherita [Auteur]
Longrée, Dominique [Auteur]
Laboratoire de Physique des Lasers, Atomes et Molécules - UMR 8523 [PhLAM]
Ouvrard, Yves [Auteur]
Fantoli, Margherita [Auteur]
Longrée, Dominique [Auteur]
Journal title :
Studi e saggi linguistici
Publisher :
ETS
Publication date :
2019
ISSN :
0085-6827
HAL domain(s) :
Sciences de l'Homme et Société/Linguistique
Sciences de l'Homme et Société/Etudes classiques
Sciences de l'Homme et Société/Etudes classiques
English abstract : [en]
L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the ...
Show more >L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the interested scholars and they now count approximatively 2.500.000 words, the lemmatisation of which has been checked by a philologist. In the early 2.000's, Collatinus has been developed by Yves Ouvrard for teaching. Its goal was to generate a complete lexical aid, with a short translation and the morphological analyses of the forms, for any text that can be given to the students. Although these two projects look very different, they met a few years ago in the conception of a new tool to speed up the lemmatisation process of Latin texts at L.A.S.L.A.. This tool is based on a concurrent lemmatisation of each word by looking for the form in those already analysed in the L.A.S.L.A. files and by Collatinus. This lemmatisation is followed by a disambiguation process with a second-order hidden Markov model and the result is presented in a text-editor to be corrected by the philologist.Show less >
Show more >L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the interested scholars and they now count approximatively 2.500.000 words, the lemmatisation of which has been checked by a philologist. In the early 2.000's, Collatinus has been developed by Yves Ouvrard for teaching. Its goal was to generate a complete lexical aid, with a short translation and the morphological analyses of the forms, for any text that can be given to the students. Although these two projects look very different, they met a few years ago in the conception of a new tool to speed up the lemmatisation process of Latin texts at L.A.S.L.A.. This tool is based on a concurrent lemmatisation of each word by looking for the form in those already analysed in the L.A.S.L.A. files and by Collatinus. This lemmatisation is followed by a disambiguation process with a second-order hidden Markov model and the result is presented in a text-editor to be corrected by the philologist.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Internationale
Popular science :
Non
Source :
Files
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Open access
- Access the document
- document
- Open access
- Access the document
- Article_LASLA_Collatinus_final.pdf
- Open access
- Access the document