Adaptation of Cross-Lingual Transfer Methods ...
Type de document :
Autre communication scientifique (congrès sans actes - poster - séminaire...): Communication dans un congrès avec actes
Titre :
Adaptation of Cross-Lingual Transfer Methods for the Building of Medical Terminology in Ukrainian
Auteur(s) :
Hamon, Thierry [Auteur]
Université Paris 13 [UP13]
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
Grabar, Natalia [Auteur]
Savoirs, Textes, Langage (STL) - UMR 8163 [STL]
Université Paris 13 [UP13]
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
Grabar, Natalia [Auteur]
Savoirs, Textes, Langage (STL) - UMR 8163 [STL]
Titre de la manifestation scientifique :
CICLING 2016
Ville :
Konya
Pays :
Turquie
Date de début de la manifestation scientifique :
2016-04-03
Mot(s)-clé(s) en anglais :
Cross-Lingual Transfer
Parallel Corpora
Terminology
Ukrainian
Parallel Corpora
Terminology
Ukrainian
Discipline(s) HAL :
Informatique [cs]/Traitement du texte et du document
Sciences de l'Homme et Société/Linguistique
Sciences de l'Homme et Société/Linguistique
Résumé en anglais : [en]
An increasing availability of parallel bilingual corpora and of automatic methods and tools makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit corpora ...
Lire la suite >An increasing availability of parallel bilingual corpora and of automatic methods and tools makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit corpora available in several languages for building bilingual and trilingual terminologies. Typically, terminology information extracted in better resourced languages is associated with the corresponding units in lower-resourced languages thanks to the multilingual transfer. The method is applied on corpora involving Ukrainian language. According to the experiments, precision of term extraction varies between 0.454 and 0.966, while the quality of the interlingual relations varies between 0.309 and 0.965. The resource built contains 4,588 medical terms in Ukrainian and their 34,267 relations with French and English terms.Lire moins >
Lire la suite >An increasing availability of parallel bilingual corpora and of automatic methods and tools makes it possible to build linguistic and terminological resources for low-resourced languages. We propose to exploit corpora available in several languages for building bilingual and trilingual terminologies. Typically, terminology information extracted in better resourced languages is associated with the corresponding units in lower-resourced languages thanks to the multilingual transfer. The method is applied on corpora involving Ukrainian language. According to the experiments, precision of term extraction varies between 0.454 and 0.966, while the quality of the interlingual relations varies between 0.309 and 0.965. The resource built contains 4,588 medical terms in Ukrainian and their 34,267 relations with French and English terms.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://hal.archives-ouvertes.fr/hal-01426807/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-01426807/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-01426807/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- hamon-CICLING2016.pdf
- Accès libre
- Accéder au document