Alector: A Parallel Corpus of Simplified ...
Type de document :
Autre communication scientifique (congrès sans actes - poster - séminaire...): Communication dans un congrès sans actes
URL permanente :
Titre :
Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic Readers
Auteur(s) :
Gala, Núria [Auteur]
Laboratoire Parole et Langage [LPL]
Tack, Anaïs [Auteur]
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
Javourey - Drevet, Ludivine [Auteur]
Apprentissage, Didactique, Evaluation, Formation [ADEF]
Apprentissage, Didactique, Evaluation, Formation [ADEF]
François, Thomas [Auteur]
Université Catholique de Louvain = Catholic University of Louvain [UCL]
Ziegler, Johannes C. [Auteur]
Laboratoire de psychologie cognitive [LPC]
Laboratoire Parole et Langage [LPL]
Tack, Anaïs [Auteur]
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur [LIMSI]
Javourey - Drevet, Ludivine [Auteur]
Apprentissage, Didactique, Evaluation, Formation [ADEF]
Apprentissage, Didactique, Evaluation, Formation [ADEF]
François, Thomas [Auteur]
Université Catholique de Louvain = Catholic University of Louvain [UCL]
Ziegler, Johannes C. [Auteur]
Laboratoire de psychologie cognitive [LPC]
Titre de la manifestation scientifique :
Language Resources and Evaluation for Language Technologies (LREC)
Ville :
Marseille
Pays :
France
Date de début de la manifestation scientifique :
2020-05-11
Mot(s)-clé(s) en anglais :
Dyslexia
Poor-readers
Misreading
Linguistic complexity
Readability
Text simplification
Parallel corpora
Poor-readers
Misreading
Linguistic complexity
Readability
Text simplification
Parallel corpora
Discipline(s) HAL :
Sciences de l'Homme et Société/Linguistique
Résumé en anglais : [en]
In this paper, we present a new parallel corpus addressed to researchers, teachers, and speech therapists interested in text simplification as a means of alleviating difficulties in children learning to read. The corpus ...
Lire la suite >In this paper, we present a new parallel corpus addressed to researchers, teachers, and speech therapists interested in text simplification as a means of alleviating difficulties in children learning to read. The corpus is composed of excerpts drawn from 79 authentic literary (tales, stories) and scientific (documentary) texts commonly used in French schools for children aged between 7 to 9 years old. The excerpts were manually simplified at the lexical, morpho-syntactic, and discourse levels in order to propose a parallel corpus for reading tests and for the development of automatic text simplification tools. A sample of 21 poor-reading and dyslexic children with an average reading delay of 2.5 years read a portion of the corpus. The transcripts of readings errors were integrated into the corpus with the goal of identifying lexical difficulty in the target population. By means of statistical testing, we provide evidence that the manual simplifications significantly reduced reading errors, highlighting that the words targeted for simplification were not only well-chosen but also substituted with substantially easier alternatives. The entire corpus is available for consultation through a web interface and available on demand for research purposes.Lire moins >
Lire la suite >In this paper, we present a new parallel corpus addressed to researchers, teachers, and speech therapists interested in text simplification as a means of alleviating difficulties in children learning to read. The corpus is composed of excerpts drawn from 79 authentic literary (tales, stories) and scientific (documentary) texts commonly used in French schools for children aged between 7 to 9 years old. The excerpts were manually simplified at the lexical, morpho-syntactic, and discourse levels in order to propose a parallel corpus for reading tests and for the development of automatic text simplification tools. A sample of 21 poor-reading and dyslexic children with an average reading delay of 2.5 years read a portion of the corpus. The transcripts of readings errors were integrated into the corpus with the goal of identifying lexical difficulty in the target population. By means of statistical testing, we provide evidence that the manual simplifications significantly reduced reading errors, highlighting that the words targeted for simplification were not only well-chosen but also substituted with substantially easier alternatives. The entire corpus is available for consultation through a web interface and available on demand for research purposes.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Établissement(s) :
Université de Lille
Collections :
Date de dépôt :
2022-05-04T15:04:48Z
2022-06-07T13:26:31Z
2022-07-05T09:31:02Z
2022-06-07T13:26:31Z
2022-07-05T09:31:02Z
Fichiers
- document
- Accès libre
- Accéder au document