Visualization of medical concepts represented ...
Type de document :
Article dans une revue scientifique: Article original
PMID :
URL permanente :
Titre :
Visualization of medical concepts represented using word embeddings: a scoping review.
Auteur(s) :
Oubenali, Naima [Auteur]
Laboratoire Traitement du Signal et de l'Image [LTSI]
Messaoud, Sabrina [Auteur]
Filiot, Alexandre [Auteur]
Centre Hospitalier Régional Universitaire [CHU Lille] [CHRU Lille]
Lamer, Antoine [Auteur]
METRICS : Evaluation des technologies de santé et des pratiques médicales - ULR 2694
Laboratoire Traitement du Signal et de l'Image [LTSI]
Messaoud, Sabrina [Auteur]
Filiot, Alexandre [Auteur]
Centre Hospitalier Régional Universitaire [CHU Lille] [CHRU Lille]
Lamer, Antoine [Auteur]

METRICS : Evaluation des technologies de santé et des pratiques médicales - ULR 2694
Titre de la revue :
BMC Medical Informatics and Decision Making
Nom court de la revue :
BMC Med Inform Decis Mak
Numéro :
22
Pagination :
83
Date de publication :
2022-04-03
ISSN :
1472-6947
Mot(s)-clé(s) en anglais :
Word embeddings
Visualization
Data mining
Medical
Natural language processing
Deep learning
Visualization
Data mining
Medical
Natural language processing
Deep learning
Discipline(s) HAL :
Sciences du Vivant [q-bio]
Résumé en anglais : [en]
Background
Analyzing the unstructured textual data contained in electronic health records (EHRs) has always been a challenging task. Word embedding methods have become an essential foundation for neural network-based ...
Lire la suite >Background Analyzing the unstructured textual data contained in electronic health records (EHRs) has always been a challenging task. Word embedding methods have become an essential foundation for neural network-based approaches in natural language processing (NLP), to learn dense and low-dimensional word representations from large unlabeled corpora that capture the implicit semantics of words. Models like Word2Vec, GloVe or FastText have been broadly applied and reviewed in the bioinformatics and healthcare fields, most often to embed clinical notes or activity and diagnostic codes. Visualization of the learned embeddings has been used in a subset of these works, whether for exploratory or evaluation purposes. However, visualization practices tend to be heterogeneous, and lack overall guidelines. Objective This scoping review aims to describe the methods and strategies used to visualize medical concepts represented using word embedding methods. We aim to understand the objectives of the visualizations and their limits. Methods This scoping review summarizes different methods used to visualize word embeddings in healthcare. We followed the methodology proposed by Arksey and O’Malley (Int J Soc Res Methodol 8:19–32, 2005) and by Levac et al. (Implement Sci 5:69, 2010) to better analyze the data and provide a synthesis of the literature on the matter. Results We first obtained 471 unique articles from a search conducted in PubMed, MedRxiv and arXiv databases. 30 of these were effectively reviewed, based on our inclusion and exclusion criteria. 23 articles were excluded in the full review stage, resulting in the analysis of 7 papers that fully correspond to our inclusion criteria. Included papers pursued a variety of objectives and used distinct methods to evaluate their embeddings and to visualize them. Visualization also served heterogeneous purposes, being alternatively used as a way to explore the embeddings, to evaluate them or to merely illustrate properties otherwise formally assessed. Conclusions Visualization helps to explore embedding results (further dimensionality reduction, synthetic representation). However, it does not exhaust the information conveyed by the embeddings nor constitute a self-sustaining evaluation method of their pertinence.Lire moins >
Lire la suite >Background Analyzing the unstructured textual data contained in electronic health records (EHRs) has always been a challenging task. Word embedding methods have become an essential foundation for neural network-based approaches in natural language processing (NLP), to learn dense and low-dimensional word representations from large unlabeled corpora that capture the implicit semantics of words. Models like Word2Vec, GloVe or FastText have been broadly applied and reviewed in the bioinformatics and healthcare fields, most often to embed clinical notes or activity and diagnostic codes. Visualization of the learned embeddings has been used in a subset of these works, whether for exploratory or evaluation purposes. However, visualization practices tend to be heterogeneous, and lack overall guidelines. Objective This scoping review aims to describe the methods and strategies used to visualize medical concepts represented using word embedding methods. We aim to understand the objectives of the visualizations and their limits. Methods This scoping review summarizes different methods used to visualize word embeddings in healthcare. We followed the methodology proposed by Arksey and O’Malley (Int J Soc Res Methodol 8:19–32, 2005) and by Levac et al. (Implement Sci 5:69, 2010) to better analyze the data and provide a synthesis of the literature on the matter. Results We first obtained 471 unique articles from a search conducted in PubMed, MedRxiv and arXiv databases. 30 of these were effectively reviewed, based on our inclusion and exclusion criteria. 23 articles were excluded in the full review stage, resulting in the analysis of 7 papers that fully correspond to our inclusion criteria. Included papers pursued a variety of objectives and used distinct methods to evaluate their embeddings and to visualize them. Visualization also served heterogeneous purposes, being alternatively used as a way to explore the embeddings, to evaluate them or to merely illustrate properties otherwise formally assessed. Conclusions Visualization helps to explore embedding results (further dimensionality reduction, synthetic representation). However, it does not exhaust the information conveyed by the embeddings nor constitute a self-sustaining evaluation method of their pertinence.Lire moins >
Langue :
Anglais
Audience :
Internationale
Vulgarisation :
Non
Établissement(s) :
Université de Lille
CHU Lille
CHU Lille
Date de dépôt :
2023-11-15T04:24:58Z
2024-04-26T13:05:50Z
2024-04-26T13:05:50Z
Fichiers
- s12911-022-01822-9.pdf
- Non spécifié
- Accès libre
- Accéder au document