基于红细胞参数的机器学习识别小细胞性低色素贫血。

Jing Lv, Jinmi Li, Xiaodong Ren, Qing Huang, Shaoli Deng
{"title":"基于红细胞参数的机器学习识别小细胞性低色素贫血。","authors":"Jing Lv, Jinmi Li, Xiaodong Ren, Qing Huang, Shaoli Deng","doi":"10.1111/ijlh.14524","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Thalassemia trait (TT) and iron deficiency anemia (IDA) are two common types of microcytic hypochromic anemia (MHA), but current diagnostic methods have limitations. This research sought to employ machine learning (ML) algorithms to identify MHA using erythrocyte parameters to distinguish between TT and IDA.</p><p><strong>Methods: </strong>One hundred and ninety-three subjects with MHA (98 TT and 95 IDA) were retrospectively analyzed. The cohort was randomized to training set (60%), validation set (20%) and test set (20%). Erythrocyte parameters were collected on an automated hematology analyzer (DxH800, Beckman Coulter), and five ML algorithms were selected to build discriminant models, including Random Forest, XGBoost, logistic regression, AdaBoost and LightGBM. In the assessment of discriminant performance of different models, indicators including sensitivity, specificity, accuracy, AUC, NPV, PPV, cutoff, F1 score and Kappa coefficients were utilized.</p><p><strong>Results: </strong>Among the five ML algorithms aforementioned, the Random Forest and logistic regression models presented excellent discriminant performance, outperforming other models in the testing set, with the AUC value, sensitivity, specificity, and ACC of 0.977, 0.928, 0.953, and 0.940 for Random Forest, and 0.978, 0.879, 0.979, 0.928 for logistic regression. Eight vital peripheral erythrocyte parameters were finally selected, including RBC, RDW, MCV, MCHC, RDWSD, HGB, MAF, and LHD.</p><p><strong>Conclusion: </strong>We successfully developed a discriminant model using ML algorithms based on erythrocyte parameters to identify MHA rapidly from TT or IDA, which may assist patients in taking preventive measures.</p>","PeriodicalId":94050,"journal":{"name":"International journal of laboratory hematology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Discriminating Microcytic Hypochromic Anemia Based on Erythrocyte Parameters.\",\"authors\":\"Jing Lv, Jinmi Li, Xiaodong Ren, Qing Huang, Shaoli Deng\",\"doi\":\"10.1111/ijlh.14524\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Thalassemia trait (TT) and iron deficiency anemia (IDA) are two common types of microcytic hypochromic anemia (MHA), but current diagnostic methods have limitations. This research sought to employ machine learning (ML) algorithms to identify MHA using erythrocyte parameters to distinguish between TT and IDA.</p><p><strong>Methods: </strong>One hundred and ninety-three subjects with MHA (98 TT and 95 IDA) were retrospectively analyzed. The cohort was randomized to training set (60%), validation set (20%) and test set (20%). Erythrocyte parameters were collected on an automated hematology analyzer (DxH800, Beckman Coulter), and five ML algorithms were selected to build discriminant models, including Random Forest, XGBoost, logistic regression, AdaBoost and LightGBM. In the assessment of discriminant performance of different models, indicators including sensitivity, specificity, accuracy, AUC, NPV, PPV, cutoff, F1 score and Kappa coefficients were utilized.</p><p><strong>Results: </strong>Among the five ML algorithms aforementioned, the Random Forest and logistic regression models presented excellent discriminant performance, outperforming other models in the testing set, with the AUC value, sensitivity, specificity, and ACC of 0.977, 0.928, 0.953, and 0.940 for Random Forest, and 0.978, 0.879, 0.979, 0.928 for logistic regression. Eight vital peripheral erythrocyte parameters were finally selected, including RBC, RDW, MCV, MCHC, RDWSD, HGB, MAF, and LHD.</p><p><strong>Conclusion: </strong>We successfully developed a discriminant model using ML algorithms based on erythrocyte parameters to identify MHA rapidly from TT or IDA, which may assist patients in taking preventive measures.</p>\",\"PeriodicalId\":94050,\"journal\":{\"name\":\"International journal of laboratory hematology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of laboratory hematology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1111/ijlh.14524\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of laboratory hematology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/ijlh.14524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

地中海贫血(Thalassemia trait, TT)和缺铁性贫血(iron deficiency anemia, IDA)是两种常见的小细胞性低色素贫血(microcytic hypochromic anemia, MHA),但目前的诊断方法存在局限性。本研究试图采用机器学习(ML)算法来识别MHA,使用红细胞参数来区分TT和IDA。方法:对193例MHA患者(TT 98例,IDA 95例)进行回顾性分析。队列随机分为训练集(60%)、验证集(20%)和测试集(20%)。在自动血液学分析仪(DxH800, Beckman Coulter)上采集红细胞参数,选择5种ML算法建立判别模型,包括Random Forest、XGBoost、logistic回归、AdaBoost和LightGBM。在评价不同模型的判别性能时,采用敏感性、特异性、准确性、AUC、NPV、PPV、截止值、F1评分和Kappa系数等指标。结果:在上述5种ML算法中,Random Forest和logistic回归模型的判别性能优异,在测试集中优于其他模型,Random Forest的AUC值、灵敏度、特异度和ACC分别为0.977、0.928、0.953和0.940,logistic回归的AUC值分别为0.978、0.879、0.979和0.928。最终选取8个重要外周血红细胞参数,包括RBC、RDW、MCV、MCHC、RDWSD、HGB、MAF和LHD。结论:我们成功建立了基于红细胞参数的ML算法判别模型,可快速从TT或IDA中识别MHA,有助于患者采取预防措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning for Discriminating Microcytic Hypochromic Anemia Based on Erythrocyte Parameters.

Introduction: Thalassemia trait (TT) and iron deficiency anemia (IDA) are two common types of microcytic hypochromic anemia (MHA), but current diagnostic methods have limitations. This research sought to employ machine learning (ML) algorithms to identify MHA using erythrocyte parameters to distinguish between TT and IDA.

Methods: One hundred and ninety-three subjects with MHA (98 TT and 95 IDA) were retrospectively analyzed. The cohort was randomized to training set (60%), validation set (20%) and test set (20%). Erythrocyte parameters were collected on an automated hematology analyzer (DxH800, Beckman Coulter), and five ML algorithms were selected to build discriminant models, including Random Forest, XGBoost, logistic regression, AdaBoost and LightGBM. In the assessment of discriminant performance of different models, indicators including sensitivity, specificity, accuracy, AUC, NPV, PPV, cutoff, F1 score and Kappa coefficients were utilized.

Results: Among the five ML algorithms aforementioned, the Random Forest and logistic regression models presented excellent discriminant performance, outperforming other models in the testing set, with the AUC value, sensitivity, specificity, and ACC of 0.977, 0.928, 0.953, and 0.940 for Random Forest, and 0.978, 0.879, 0.979, 0.928 for logistic regression. Eight vital peripheral erythrocyte parameters were finally selected, including RBC, RDW, MCV, MCHC, RDWSD, HGB, MAF, and LHD.

Conclusion: We successfully developed a discriminant model using ML algorithms based on erythrocyte parameters to identify MHA rapidly from TT or IDA, which may assist patients in taking preventive measures.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信