Improvement of the quality of medical ...
Type de document :
Article dans une revue scientifique: Article original
PMID :
URL permanente :
Titre :
Improvement of the quality of medical databases: data-mining-based prediction of diagnostic codes from previous patient codes
Auteur(s) :
Djennaoui, Mehdi [Auteur]
Ficheur, Gregoire [Auteur]
Beuscart, Regis [Auteur]
Chazard, Emmanuel [Auteur]
Evaluation des technologies de santé et des pratiques médicales - ULR 2694 [METRICS]
Ficheur, Gregoire [Auteur]

Beuscart, Regis [Auteur]
Chazard, Emmanuel [Auteur]

Evaluation des technologies de santé et des pratiques médicales - ULR 2694 [METRICS]
Titre de la revue :
Studies in health technology and informatics
Nom court de la revue :
Stud Health Technol Inform
Numéro :
210
Pagination :
419-23
Date de publication :
2015-01-01
ISSN :
0926-9630
Mot(s)-clé(s) en anglais :
Data mining
Nationwide Database
Electronic Health Records
Decision Support Techniques
Nationwide Database
Electronic Health Records
Decision Support Techniques
Discipline(s) HAL :
Sciences du Vivant [q-bio]
Résumé en anglais : [en]
BACKGROUND: Diagnoses and medical procedures collected under the French system of information are recorded in a nationwide database, the "PMSI national database", which is accessible for exploitation. Quality of the data ...
Lire la suite >BACKGROUND: Diagnoses and medical procedures collected under the French system of information are recorded in a nationwide database, the "PMSI national database", which is accessible for exploitation. Quality of the data in this database is directly related to the quality of coding, which can be of poor quality. Among the proposed methods for the exploitation of health databases, data mining techniques are particularly interesting. Our objective is to build sequential rules for missing diagnoses prediction by data mining of the PMSI national database. METHODS: Our working sample was constructed from the national database for years 2007 to 2010. The information retained for rules construction were medical diagnoses and medical procedures. The rules were selected using a statistical filter, and selected rules were validated by case review based on medical letters, which enabled to estimate the improvement of diagnoses recoding. RESULTS: The work sample was made of 59,170 inpatient stays. The predicted ICD codes were E11 (non-insulin-dependent diabetes mellitus), I48 (atrial fibrillation and flutter) and I50 (heart failure).We validated three sequential rules with a substantial improvement of positive predictive value: {E11,I10,DZQM006}=>{E11} {E11,I10,I48}=>{E11} {I48,I69}=>{I48} CONCLUSIONS: We were able to extract by data mining three simple, reliable and effective sequential rules, with a substantial improvement in diagnoses recoding. The results of our study indicate the opportunity to improve the data quality of the national database by data mining methods.Lire moins >
Lire la suite >BACKGROUND: Diagnoses and medical procedures collected under the French system of information are recorded in a nationwide database, the "PMSI national database", which is accessible for exploitation. Quality of the data in this database is directly related to the quality of coding, which can be of poor quality. Among the proposed methods for the exploitation of health databases, data mining techniques are particularly interesting. Our objective is to build sequential rules for missing diagnoses prediction by data mining of the PMSI national database. METHODS: Our working sample was constructed from the national database for years 2007 to 2010. The information retained for rules construction were medical diagnoses and medical procedures. The rules were selected using a statistical filter, and selected rules were validated by case review based on medical letters, which enabled to estimate the improvement of diagnoses recoding. RESULTS: The work sample was made of 59,170 inpatient stays. The predicted ICD codes were E11 (non-insulin-dependent diabetes mellitus), I48 (atrial fibrillation and flutter) and I50 (heart failure).We validated three sequential rules with a substantial improvement of positive predictive value: {E11,I10,DZQM006}=>{E11} {E11,I10,I48}=>{E11} {I48,I69}=>{I48} CONCLUSIONS: We were able to extract by data mining three simple, reliable and effective sequential rules, with a substantial improvement in diagnoses recoding. The results of our study indicate the opportunity to improve the data quality of the national database by data mining methods.Lire moins >
Langue :
Anglais
Audience :
Internationale
Vulgarisation :
Non
Établissement(s) :
CHU Lille
Université de Lille
Université de Lille
Date de dépôt :
2019-12-09T18:20:02Z