Facing the facts of fake: a distributional ...
Document type :
Compte-rendu et recension critique d'ouvrage
Title :
Facing the facts of fake: a distributional semantics and corpus annotation approach
Author(s) :
Cappelle, Bert [Auteur]
Savoirs, Textes, Langage (STL) - UMR 8163 [STL]
Denis, Pascal [Auteur]
Machine Learning in Information Networks [MAGNET]
Keller, Mikaela [Auteur]
Machine Learning in Information Networks [MAGNET]
Savoirs, Textes, Langage (STL) - UMR 8163 [STL]
Denis, Pascal [Auteur]
Machine Learning in Information Networks [MAGNET]
Keller, Mikaela [Auteur]
Machine Learning in Information Networks [MAGNET]
Journal title :
Yearbook of the German Cognitive Linguistics Association
Publisher :
De Gruyter
Publication date :
2018-11-28
ISSN :
2197-2796
English keyword(s) :
distributional semantics
bigram
modification
fake
privative adjective
context-sensitivity
word and phrase embedding
bigram
modification
fake
privative adjective
context-sensitivity
word and phrase embedding
HAL domain(s) :
Sciences de l'Homme et Société/Sciences de l'information et de la communication
English abstract : [en]
Fake is often considered the textbook example of a so-called 'privative' adjective, one which, in other words, allows the proposition that '(a) fake x is not (an) x'. This study tests the hypothesis that the contexts of ...
Show more >Fake is often considered the textbook example of a so-called 'privative' adjective, one which, in other words, allows the proposition that '(a) fake x is not (an) x'. This study tests the hypothesis that the contexts of an adjective-noun combination are more different from the contexts of the noun when the adjective is such a 'privative' one than when it is an ordinary (subsective) one. We here use 'embeddings', that is, dense vector representations based on word co-occurrences in a large corpus, which in our study is the entire English Wikipedia as it was in 2013. Comparing the cosine distance between the adjective-noun bigram and single noun embeddings across two sets of adjectives, privative and ordinary ones, we fail to find a noticeable difference. However, we contest that fake is an across-the-board privative adjective, since a fake article, for instance, is most definitely still an article. We extend a recent proposal involving the noun's qualia roles (how an entity is made, what it consists of, what it is used for, etc.) and propose several interpretational types of fake-noun combinations, some but not all of which are privative. These interpretations, which we assign manually to the 100 most frequent fake-noun combinations in the Wikipedia corpus, depend to a large extent on the meaning of the noun, as combinations with similar interpretations tend to involve nouns that are linked in a distributions-based network. When we restrict our focus to the privative uses of fake only, we do detect a slightly enlarged difference between fake + noun bigram and noun distributions compared to the previously obtained average difference between adjective + noun bigram and noun distributions. This result contrasts with negative or even opposite findings reported in the literature.Show less >
Show more >Fake is often considered the textbook example of a so-called 'privative' adjective, one which, in other words, allows the proposition that '(a) fake x is not (an) x'. This study tests the hypothesis that the contexts of an adjective-noun combination are more different from the contexts of the noun when the adjective is such a 'privative' one than when it is an ordinary (subsective) one. We here use 'embeddings', that is, dense vector representations based on word co-occurrences in a large corpus, which in our study is the entire English Wikipedia as it was in 2013. Comparing the cosine distance between the adjective-noun bigram and single noun embeddings across two sets of adjectives, privative and ordinary ones, we fail to find a noticeable difference. However, we contest that fake is an across-the-board privative adjective, since a fake article, for instance, is most definitely still an article. We extend a recent proposal involving the noun's qualia roles (how an entity is made, what it consists of, what it is used for, etc.) and propose several interpretational types of fake-noun combinations, some but not all of which are privative. These interpretations, which we assign manually to the 100 most frequent fake-noun combinations in the Wikipedia corpus, depend to a large extent on the meaning of the noun, as combinations with similar interpretations tend to involve nouns that are linked in a distributions-based network. When we restrict our focus to the privative uses of fake only, we do detect a slightly enlarged difference between fake + noun bigram and noun distributions compared to the previously obtained average difference between adjective + noun bigram and noun distributions. This result contrasts with negative or even opposite findings reported in the literature.Show less >
Language :
Anglais
Popular science :
Non
Collections :
Source :
Files
- https://hal.archives-ouvertes.fr/hal-01959609/document
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-01959609/document
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-01959609/document
- Open access
- Access the document
- document
- Open access
- Access the document
- Facing%20the%20facts%20of%20fake%20__%20Authors%27%20version.pdf
- Open access
- Access the document