CLEAR-Simple Corpus for Medical French
Type de document :
Autre communication scientifique (congrès sans actes - poster - séminaire...): Communication dans un congrès avec actes
Titre :
CLEAR-Simple Corpus for Medical French
Auteur(s) :
Titre de la manifestation scientifique :
ATA
Ville :
Tilburg
Pays :
Pays-Bas
Date de début de la manifestation scientifique :
2018-11-08
Discipline(s) HAL :
Sciences de l'Homme et Société/Sciences de l'information et de la communication
Informatique [cs]/Traitement du texte et du document
Informatique [cs]/Traitement du texte et du document
Résumé en anglais : [en]
Availability of corpora with technical and simplified contents is crucial for the development and test of methods for text simplification. We describe this kind of corpus for the French medical language. The corpus contains ...
Lire la suite >Availability of corpora with technical and simplified contents is crucial for the development and test of methods for text simplification. We describe this kind of corpus for the French medical language. The corpus contains texts from three sources: encyclopedia, drug leaflets and scientific summaries. Each source proposes comparable information in specialized and plain languages. A subset of this corpus has been processed manually in order to find and align parallel sentences. This subset currently contains 663 pairs with parallel sentences. Alignment has been done by two annota-tors and shows 0.76 inter-annotator agreement .Lire moins >
Lire la suite >Availability of corpora with technical and simplified contents is crucial for the development and test of methods for text simplification. We describe this kind of corpus for the French medical language. The corpus contains texts from three sources: encyclopedia, drug leaflets and scientific summaries. Each source proposes comparable information in specialized and plain languages. A subset of this corpus has been processed manually in order to find and align parallel sentences. This subset currently contains 663 pairs with parallel sentences. Alignment has been done by two annota-tors and shows 0.76 inter-annotator agreement .Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://halshs.archives-ouvertes.fr/halshs-01968355/document
- Accès libre
- Accéder au document
- https://halshs.archives-ouvertes.fr/halshs-01968355/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- grabar-ATA2018c.pdf
- Accès libre
- Accéder au document