Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun
{"title":"利用自由文本诊断识别糖尿病、肥胖症或血脂异常患者--一项在瑞士大型初级保健数据库中进行的横断面研究。","authors":"Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun","doi":"10.57187/s.3360","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.</p><p><strong>Objective: </strong>This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.</p><p><strong>Methods: </strong>This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.</p><p><strong>Results: </strong>The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).</p><p><strong>Conclusion: </strong>Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.</p>","PeriodicalId":22111,"journal":{"name":"Swiss medical weekly","volume":"155 ","pages":"3360"},"PeriodicalIF":2.1000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.\",\"authors\":\"Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun\",\"doi\":\"10.57187/s.3360\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.</p><p><strong>Objective: </strong>This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.</p><p><strong>Methods: </strong>This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.</p><p><strong>Results: </strong>The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).</p><p><strong>Conclusion: </strong>Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.</p>\",\"PeriodicalId\":22111,\"journal\":{\"name\":\"Swiss medical weekly\",\"volume\":\"155 \",\"pages\":\"3360\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Swiss medical weekly\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.57187/s.3360\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swiss medical weekly","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.57187/s.3360","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.
Background: Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.
Objective: This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.
Methods: This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.
Results: The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).
Conclusion: Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.
期刊介绍:
The Swiss Medical Weekly accepts for consideration original and review articles from all fields of medicine. The quality of SMW publications is guaranteed by a consistent policy of rigorous single-blind peer review. All editorial decisions are made by research-active academics.