L.A.S.L.A. and Collatinus: a convergence in lexica
Type de document :
Compte-rendu et recension critique d'ouvrage
Titre :
L.A.S.L.A. and Collatinus: a convergence in lexica
Auteur(s) :
Verkerk, Philippe [Auteur]
Laboratoire de Physique des Lasers, Atomes et Molécules - UMR 8523 [PhLAM]
Ouvrard, Yves [Auteur]
Fantoli, Margherita [Auteur]
Laboratoire d'Analyse Statistique des Langues Anciennes [LASLA]
Longrée, Dominique [Auteur]
Laboratoire de Physique des Lasers, Atomes et Molécules - UMR 8523 [PhLAM]
Ouvrard, Yves [Auteur]
Fantoli, Margherita [Auteur]
Laboratoire d'Analyse Statistique des Langues Anciennes [LASLA]
Longrée, Dominique [Auteur]
Titre de la revue :
Studi e saggi linguistici
Éditeur :
ETS
Date de publication :
2019
ISSN :
0085-6827
Discipline(s) HAL :
Sciences de l'Homme et Société/Linguistique
Sciences de l'Homme et Société/Etudes classiques
Sciences de l'Homme et Société/Etudes classiques
Résumé en anglais : [en]
L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the ...
Lire la suite >L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the interested scholars and they now count approximatively 2.500.000 words, the lemmatisation of which has been checked by a philologist. In the early 2.000's, Collatinus has been developed by Yves Ouvrard for teaching. Its goal was to generate a complete lexical aid, with a short translation and the morphological analyses of the forms, for any text that can be given to the students. Although these two projects look very different, they met a few years ago in the conception of a new tool to speed up the lemmatisation process of Latin texts at L.A.S.L.A.. This tool is based on a concurrent lemmatisation of each word by looking for the form in those already analysed in the L.A.S.L.A. files and by Collatinus. This lemmatisation is followed by a disambiguation process with a second-order hidden Markov model and the result is presented in a text-editor to be corrected by the philologist.Lire moins >
Lire la suite >L.A.S.L.A. has begun in 1961 a project of lemmatisation and morphosyntactic tagging of Latin texts. This project is still running with new texts lemmatised each year. The resulting files have been recently opened to the interested scholars and they now count approximatively 2.500.000 words, the lemmatisation of which has been checked by a philologist. In the early 2.000's, Collatinus has been developed by Yves Ouvrard for teaching. Its goal was to generate a complete lexical aid, with a short translation and the morphological analyses of the forms, for any text that can be given to the students. Although these two projects look very different, they met a few years ago in the conception of a new tool to speed up the lemmatisation process of Latin texts at L.A.S.L.A.. This tool is based on a concurrent lemmatisation of each word by looking for the form in those already analysed in the L.A.S.L.A. files and by Collatinus. This lemmatisation is followed by a disambiguation process with a second-order hidden Markov model and the result is presented in a text-editor to be corrected by the philologist.Lire moins >
Langue :
Anglais
Vulgarisation :
Non
Source :
Fichiers
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-02399878v1/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- Article_LASLA_Collatinus_final.pdf
- Accès libre
- Accéder au document