通过综合血液分析检测缺铁的机器学习。

IF 6.3 2区 医学 Q1 MEDICAL LABORATORY TECHNOLOGY
Yu-Hsin Chang,Chia-Yu Chen,Chiung-Tzu Hsiao,Yu-Chang Chang,Hsin-Yu Lai,Hsiu-Hsien Lin,Ya-Lun Wu,Chien-Chih Chen,Lin-Chen Hsu,Tzu-Ting Chen,Hong-Mo Shih,Po-Ren Hsueh,Der-Yang Cho
{"title":"通过综合血液分析检测缺铁的机器学习。","authors":"Yu-Hsin Chang,Chia-Yu Chen,Chiung-Tzu Hsiao,Yu-Chang Chang,Hsin-Yu Lai,Hsiu-Hsien Lin,Ya-Lun Wu,Chien-Chih Chen,Lin-Chen Hsu,Tzu-Ting Chen,Hong-Mo Shih,Po-Ren Hsueh,Der-Yang Cho","doi":"10.1093/clinchem/hvaf074","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nIron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population.\r\n\r\nMETHODS\r\nWe retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows.\r\n\r\nRESULTS\r\nWe retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations.\r\n\r\nCONCLUSIONS\r\nOur study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.","PeriodicalId":10690,"journal":{"name":"Clinical chemistry","volume":"672 1","pages":""},"PeriodicalIF":6.3000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Detecting Iron Deficiency through Comprehensive Blood Analysis.\",\"authors\":\"Yu-Hsin Chang,Chia-Yu Chen,Chiung-Tzu Hsiao,Yu-Chang Chang,Hsin-Yu Lai,Hsiu-Hsien Lin,Ya-Lun Wu,Chien-Chih Chen,Lin-Chen Hsu,Tzu-Ting Chen,Hong-Mo Shih,Po-Ren Hsueh,Der-Yang Cho\",\"doi\":\"10.1093/clinchem/hvaf074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\r\\nIron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population.\\r\\n\\r\\nMETHODS\\r\\nWe retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows.\\r\\n\\r\\nRESULTS\\r\\nWe retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations.\\r\\n\\r\\nCONCLUSIONS\\r\\nOur study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.\",\"PeriodicalId\":10690,\"journal\":{\"name\":\"Clinical chemistry\",\"volume\":\"672 1\",\"pages\":\"\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1093/clinchem/hvaf074\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/clinchem/hvaf074","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:缺铁是一个普遍存在的全球健康问题,对人类福祉有重大影响。早期检测是至关重要的,但由于其非特异性症状和传统诊断测试的局限性,这对于大规模筛查是不切实际的,因此具有挑战性。本研究提出了一种机器学习(ML)方法,使用全血细胞计数(CBC)数据和细胞群数据(CPD)来检测普通人群中的ID。方法回顾性收集3家医院的患者数据,利用CBC、CPD和人口统计信息开发并验证5ml模型。在确定了表现最好的模型后,我们评估了各种特征集的影响,并评估了模型在不同子组中的表现,以确保在不同人群中的稳健性。该模型也被部署并集成到临床工作流程中。结果我们回顾性纳入了来自3家医院急诊、住院和门诊的9608名成年患者,ID的患病率从17.4%到19.6%不等。在验证过程中,ML模型的受试者工作特征曲线下面积(AUROC)超过0.94,精确查全率曲线值(AUPRC)超过0.83。模型融入临床系统后,在真实世界中保持稳定的性能,AUROC为0.948,AUPRC为0.854。亚组分析显示,男性和非贫血人群的表现较差。结论我们的研究强调了结合CPD和CBC参数的ML模型在普通人群中筛查ID的有效性。利用常规血液数据而无需生化测试,该模型可以在队列中进行高效和一致的身份筛查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning for Detecting Iron Deficiency through Comprehensive Blood Analysis.
BACKGROUND Iron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population. METHODS We retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows. RESULTS We retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations. CONCLUSIONS Our study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Clinical chemistry
Clinical chemistry 医学-医学实验技术
CiteScore
11.30
自引率
4.30%
发文量
212
审稿时长
1.7 months
期刊介绍: Clinical Chemistry is a peer-reviewed scientific journal that is the premier publication for the science and practice of clinical laboratory medicine. It was established in 1955 and is associated with the Association for Diagnostics & Laboratory Medicine (ADLM). The journal focuses on laboratory diagnosis and management of patients, and has expanded to include other clinical laboratory disciplines such as genomics, hematology, microbiology, and toxicology. It also publishes articles relevant to clinical specialties including cardiology, endocrinology, gastroenterology, genetics, immunology, infectious diseases, maternal-fetal medicine, neurology, nutrition, oncology, and pediatrics. In addition to original research, editorials, and reviews, Clinical Chemistry features recurring sections such as clinical case studies, perspectives, podcasts, and Q&A articles. It has the highest impact factor among journals of clinical chemistry, laboratory medicine, pathology, analytical chemistry, transfusion medicine, and clinical microbiology. The journal is indexed in databases such as MEDLINE and Web of Science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信