{"title":"通过综合血液分析检测缺铁的机器学习。","authors":"Yu-Hsin Chang,Chia-Yu Chen,Chiung-Tzu Hsiao,Yu-Chang Chang,Hsin-Yu Lai,Hsiu-Hsien Lin,Ya-Lun Wu,Chien-Chih Chen,Lin-Chen Hsu,Tzu-Ting Chen,Hong-Mo Shih,Po-Ren Hsueh,Der-Yang Cho","doi":"10.1093/clinchem/hvaf074","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nIron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population.\r\n\r\nMETHODS\r\nWe retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows.\r\n\r\nRESULTS\r\nWe retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations.\r\n\r\nCONCLUSIONS\r\nOur study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.","PeriodicalId":10690,"journal":{"name":"Clinical chemistry","volume":"672 1","pages":""},"PeriodicalIF":6.3000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Detecting Iron Deficiency through Comprehensive Blood Analysis.\",\"authors\":\"Yu-Hsin Chang,Chia-Yu Chen,Chiung-Tzu Hsiao,Yu-Chang Chang,Hsin-Yu Lai,Hsiu-Hsien Lin,Ya-Lun Wu,Chien-Chih Chen,Lin-Chen Hsu,Tzu-Ting Chen,Hong-Mo Shih,Po-Ren Hsueh,Der-Yang Cho\",\"doi\":\"10.1093/clinchem/hvaf074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"BACKGROUND\\r\\nIron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population.\\r\\n\\r\\nMETHODS\\r\\nWe retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows.\\r\\n\\r\\nRESULTS\\r\\nWe retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations.\\r\\n\\r\\nCONCLUSIONS\\r\\nOur study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.\",\"PeriodicalId\":10690,\"journal\":{\"name\":\"Clinical chemistry\",\"volume\":\"672 1\",\"pages\":\"\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1093/clinchem/hvaf074\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/clinchem/hvaf074","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
Machine Learning for Detecting Iron Deficiency through Comprehensive Blood Analysis.
BACKGROUND
Iron deficiency (ID) is a prevalent global health issue with a major impact on well-being. Early detection of ID is crucial but challenging due to its nonspecific symptoms and the limitations of traditional diagnostic tests, which are impractical for large-scale screening. This study proposes a machine learning (ML) approach using complete blood count (CBC) data and cell population data (CPD) for detecting ID in the general population.
METHODS
We retrospectively collected patient data from 3 hospitals to develop and validate 5 ML models using CBC, CPD, and demographic information. After identifying the best-performing model, we evaluated the impact of various feature sets and also assessed model performance across different subgroups to ensure robustness in diverse populations. The model was also deployed and integrated into clinical workflows.
RESULTS
We retrospectively enrolled 9608 adult patients across emergency, inpatient, and outpatient departments from 3 hospitals, and prevalence of ID ranged from 17.4% to 19.6%. The ML model achieved an area under the receiver operating characteristic curve (AUROC) exceeding 0.94 and a precision-recall curve values (AUPRC) exceeding 0.83 during validation. After integration into the clinical system, the model maintained stable real-world performance, with an AUROC of 0.948 and an AUPRC of 0.854. Subgroup analysis showed lower performance in male and nonanemic populations.
CONCLUSIONS
Our study highlights the effectiveness of a ML model integrating CPD with CBC parameters for screening ID in the general population. Leveraging routine blood data without requiring biochemical tests, the model enables efficient and consistent ID screening across cohorts.
期刊介绍:
Clinical Chemistry is a peer-reviewed scientific journal that is the premier publication for the science and practice of clinical laboratory medicine. It was established in 1955 and is associated with the Association for Diagnostics & Laboratory Medicine (ADLM).
The journal focuses on laboratory diagnosis and management of patients, and has expanded to include other clinical laboratory disciplines such as genomics, hematology, microbiology, and toxicology. It also publishes articles relevant to clinical specialties including cardiology, endocrinology, gastroenterology, genetics, immunology, infectious diseases, maternal-fetal medicine, neurology, nutrition, oncology, and pediatrics.
In addition to original research, editorials, and reviews, Clinical Chemistry features recurring sections such as clinical case studies, perspectives, podcasts, and Q&A articles. It has the highest impact factor among journals of clinical chemistry, laboratory medicine, pathology, analytical chemistry, transfusion medicine, and clinical microbiology.
The journal is indexed in databases such as MEDLINE and Web of Science.