Issues underlying a common Sign Language ...
Type de document :
Communication dans un congrès avec actes
Titre :
Issues underlying a common Sign Language Corpora annotation scheme
Auteur(s) :
Titre de la manifestation scientifique :
LREC 2010
Ville :
Valetta
Pays :
Malte
Date de début de la manifestation scientifique :
2010-05-19
Titre de l’ouvrage :
4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language Technologies, LREC 2010
Date de publication :
2010-05-19
Mot(s)-clé(s) en anglais :
Annotation Scheme
Sign Language Corpora
Sign Language Linguistics
Sign Language Corpora
Sign Language Linguistics
Discipline(s) HAL :
Sciences de l'Homme et Société/Linguistique
Résumé en anglais : [en]
Corpus-based Sign Language linguistics has emerged as a new linguistic domain, and as a consequence large-scale and controlled video data repositories are under construction for different Sign Languages. Nevertheless, as ...
Lire la suite >Corpus-based Sign Language linguistics has emerged as a new linguistic domain, and as a consequence large-scale and controlled video data repositories are under construction for different Sign Languages. Nevertheless, as pointed by (Johnston, 2008) no unified annotation scheme is yet available, which compromises any chance of comparing or reusing corpora across research teams. Another related issue is the comparability of descriptions and formalizations between SL linguistics and mainstream linguistics. In this paper, we address the issue of the definition of a common annotation scheme for Sign Language corpora annotation, distribution, exchange and comparison. In section 2. we discuss the challenge of building inter-operable corpora for corpus-based linguistics. We also examine existing annotation schemes or strategies proposed for SL linguistics. In section 3. we propose a small set of annotation tiers, based on Frame-Semantics, as a common annotation scheme. We also propose to add text-level as well as utterance-level metadata to this common annotation scheme, in order to broaden the range of future uses of SL corpora.Lire moins >
Lire la suite >Corpus-based Sign Language linguistics has emerged as a new linguistic domain, and as a consequence large-scale and controlled video data repositories are under construction for different Sign Languages. Nevertheless, as pointed by (Johnston, 2008) no unified annotation scheme is yet available, which compromises any chance of comparing or reusing corpora across research teams. Another related issue is the comparability of descriptions and formalizations between SL linguistics and mainstream linguistics. In this paper, we address the issue of the definition of a common annotation scheme for Sign Language corpora annotation, distribution, exchange and comparison. In section 2. we discuss the challenge of building inter-operable corpora for corpus-based linguistics. We also examine existing annotation schemes or strategies proposed for SL linguistics. In section 3. we propose a small set of annotation tiers, based on Frame-Semantics, as a common annotation scheme. We also propose to add text-level as well as utterance-level metadata to this common annotation scheme, in order to broaden the range of future uses of SL corpora.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://halshs.archives-ouvertes.fr/halshs-01077785/document
- Accès libre
- Accéder au document
- https://halshs.archives-ouvertes.fr/halshs-01077785/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- Balvet_SL-Workshop_Issues%20%282%29.pdf
- Accès libre
- Accéder au document