Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.

IF 2.1 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL
Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun
{"title":"Leveraging free-text diagnoses to identify patients with diabetes mellitus, obesity or dyslipidaemia - a cross-sectional study in a large Swiss primary care database.","authors":"Donika Balaj, Jakob M Burgstaller, Audrey Wallnöfer, Katja Weiss, Oliver Senn, Thomas Rosemann, Thomas Grischott, Stefan Markun","doi":"10.57187/s.3360","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.</p><p><strong>Objective: </strong>This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.</p><p><strong>Methods: </strong>This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.</p><p><strong>Results: </strong>The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).</p><p><strong>Conclusion: </strong>Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.</p>","PeriodicalId":22111,"journal":{"name":"Swiss medical weekly","volume":"155 ","pages":"3360"},"PeriodicalIF":2.1000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swiss medical weekly","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.57187/s.3360","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Electronic medical records (EMRs) in general practice provide various methods for identifying patients with specific diagnoses. While several studies have focused on case identification via structured EMR components, diagnoses in general practice are frequently documented as unstructured free-text entries, making their use for research challenging. Furthermore, diagnoses may remain undocumented even when evidence of the underlying disease exists within structured EMR data.

Objective: This study aimed to quantify the extent to which free-text diagnoses contribute to identifying additional cases of diabetes mellitus, obesity and dyslipidaemia (target diseases) and assess the cases missed when relying exclusively on free-text entries.

Methods: This cross-sectional study utilised EMR data from all consultations up to 2019 for 6,000 patients across 10 general practices in Switzerland. Diagnoses documented in a free-text entry field for diagnoses were manually coded for target diseases. Cases were defined as patients with a corresponding coded free-text diagnosis or meeting predefined criteria in structured EMR components (medication data or clinical and laboratory parameters). For each target disease, prevalence was calculated along with the proportion of cases identified exclusively via free-text diagnoses and the proportion missed when using free-text diagnoses alone.

Results: The prevalence estimates for diabetes mellitus, obesity and dyslipidaemia were 8.8%, 16.2% and 38.9%, respectively. Few cases relied exclusively on free-text diagnoses for identification, but a substantial proportion of cases were missed when relying solely on free-text diagnoses, particularly for obesity (19.5% exclusively identified; 50.7% missed) and dyslipidaemia (8.7% exclusively identified; 53.3% missed).

Conclusion: Free-text diagnoses were of limited utility for case identification of diabetes mellitus, obesity or dyslipidaemia, suggesting that manual coding of free-text diagnoses may not always be justified. Relying solely on free-text diagnoses for case identification is not recommended, as substantial proportions of cases may remain undetected, leading to biased prevalence estimates.

利用自由文本诊断识别糖尿病、肥胖症或血脂异常患者--一项在瑞士大型初级保健数据库中进行的横断面研究。
背景:电子病历(EMRs)在全科实践中提供了多种方法来识别具有特定诊断的患者。虽然有几项研究侧重于通过结构化电子病历组件进行病例识别,但在一般实践中,诊断经常被记录为非结构化的自由文本条目,这使得它们在研究中的使用具有挑战性。此外,即使在结构化电子病历数据中存在潜在疾病的证据,诊断也可能没有记录。目的:本研究旨在量化自由文本诊断在多大程度上有助于识别糖尿病、肥胖和血脂异常(目标疾病)的额外病例,并评估完全依赖自由文本条目时遗漏的病例。方法:这项横断面研究利用了截至2019年瑞士10个全科诊所6000名患者的所有咨询的电子病历数据。在诊断的自由文本输入字段中记录的诊断是针对目标疾病手动编码的。病例定义为具有相应的编码自由文本诊断或满足结构化电子病历组件(药物数据或临床和实验室参数)中预定义标准的患者。对于每种目标疾病,患病率与仅通过自由文本诊断确定的病例比例以及单独使用自由文本诊断时遗漏的比例一起计算。结果:糖尿病患病率为8.8%,肥胖患病率为16.2%,血脂异常患病率为38.9%。很少有病例完全依赖于自由文本诊断进行识别,但当完全依赖于自由文本诊断时,有相当大比例的病例被遗漏,特别是肥胖(19.5%完全被识别;50.7%未检出)和血脂异常(8.7%完全确诊;错过了53.3%)。结论:自由文本诊断在糖尿病、肥胖或血脂异常的病例识别中应用有限,提示手工编码自由文本诊断可能并不总是合理的。不建议仅仅依靠自由文本诊断来确定病例,因为相当大比例的病例可能仍未被发现,导致患病率估计有偏差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Swiss medical weekly
Swiss medical weekly 医学-医学:内科
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
3-8 weeks
期刊介绍: The Swiss Medical Weekly accepts for consideration original and review articles from all fields of medicine. The quality of SMW publications is guaranteed by a consistent policy of rigorous single-blind peer review. All editorial decisions are made by research-active academics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信