Optimizing machine learning models for predicting anemia among under-five children in Ethiopia: insights from Ethiopian demographic and health survey data.

IF 2 3区 医学 Q2 PEDIATRICS
Ali Yimer, Hassen Ahmed Yesuf, Sada Ahmed, Alemu Birara Zemariam, Endris Mussa, Nurye Sirage, Adem Yesuf, Abdulaziz Kebede Kassaw
{"title":"Optimizing machine learning models for predicting anemia among under-five children in Ethiopia: insights from Ethiopian demographic and health survey data.","authors":"Ali Yimer, Hassen Ahmed Yesuf, Sada Ahmed, Alemu Birara Zemariam, Endris Mussa, Nurye Sirage, Adem Yesuf, Abdulaziz Kebede Kassaw","doi":"10.1186/s12887-025-05659-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Healthcare practitioners require a robust predictive system to accurately diagnose diseases, especially in young children with conditions such as anemia. Delays in diagnosis and treatment can have severe consequences, potentially leading to serious complications and childhood mortality. By leveraging machine learning methods with extensive datasets, valuable and scientifically sound insights can be generated to address pressing health and healthcare-related challenges.</p><p><strong>Objectives: </strong>The primary objective of this study was to identify the most effective machine-learning algorithm for predicting anemia among under five children in Ethiopia.</p><p><strong>Methods: </strong>The data utilized in this study were sourced from the 2016 Ethiopian Demographic and Health Survey. Six machine-learning models, comprising a classic logistic regression model along with random forest, decision tree, support vector machine, Naïve Bayes, and K-nearest neighbors, were employed to predict factors influencing anemia in children under five. The predictive capacities of each machine-learning model were evaluated using receiver operating characteristic curves and various measures of model accuracy.</p><p><strong>Results: </strong>The random forest model demonstrated the highest accuracy among the algorithms tested, achieving an overall accuracy of 81.16%. The accuracy rates for the decision tree, support vector machines, Naïve Bayes, K-nearest neighbors, and classical logistic regression models were 68.40%, 59.94%, 53.06%, 69.96%, and 54.79%, respectively.</p><p><strong>Conclusion: </strong>In general, the random forest algorithm emerged as the preferred model for predicting anemia in children under five. The model exhibited a specificity of 79.26%, sensitivity of 83.07%, positive predictive value of 80.02%, negative predictive value of 82.40%, and an area under the curve of 81.80%.</p>","PeriodicalId":9144,"journal":{"name":"BMC Pediatrics","volume":"25 1","pages":"311"},"PeriodicalIF":2.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12013019/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12887-025-05659-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Healthcare practitioners require a robust predictive system to accurately diagnose diseases, especially in young children with conditions such as anemia. Delays in diagnosis and treatment can have severe consequences, potentially leading to serious complications and childhood mortality. By leveraging machine learning methods with extensive datasets, valuable and scientifically sound insights can be generated to address pressing health and healthcare-related challenges.

Objectives: The primary objective of this study was to identify the most effective machine-learning algorithm for predicting anemia among under five children in Ethiopia.

Methods: The data utilized in this study were sourced from the 2016 Ethiopian Demographic and Health Survey. Six machine-learning models, comprising a classic logistic regression model along with random forest, decision tree, support vector machine, Naïve Bayes, and K-nearest neighbors, were employed to predict factors influencing anemia in children under five. The predictive capacities of each machine-learning model were evaluated using receiver operating characteristic curves and various measures of model accuracy.

Results: The random forest model demonstrated the highest accuracy among the algorithms tested, achieving an overall accuracy of 81.16%. The accuracy rates for the decision tree, support vector machines, Naïve Bayes, K-nearest neighbors, and classical logistic regression models were 68.40%, 59.94%, 53.06%, 69.96%, and 54.79%, respectively.

Conclusion: In general, the random forest algorithm emerged as the preferred model for predicting anemia in children under five. The model exhibited a specificity of 79.26%, sensitivity of 83.07%, positive predictive value of 80.02%, negative predictive value of 82.40%, and an area under the curve of 81.80%.

优化预测埃塞俄比亚五岁以下儿童贫血的机器学习模型:来自埃塞俄比亚人口和健康调查数据的见解。
背景:医疗从业者需要一个强大的预测系统来准确诊断疾病,特别是在患有贫血等疾病的幼儿中。诊断和治疗的延误可能产生严重后果,可能导致严重并发症和儿童死亡。通过利用具有广泛数据集的机器学习方法,可以生成有价值且科学合理的见解,以解决紧迫的健康和医疗保健相关挑战。目的:本研究的主要目的是确定预测埃塞俄比亚五岁以下儿童贫血的最有效的机器学习算法。方法:本研究使用的数据来自2016年埃塞俄比亚人口与健康调查。采用六种机器学习模型,包括经典逻辑回归模型、随机森林、决策树、支持向量机、Naïve贝叶斯和k近邻,预测五岁以下儿童贫血的影响因素。每个机器学习模型的预测能力使用接收者工作特征曲线和各种模型精度测量来评估。结果:随机森林模型在被测算法中准确率最高,总体准确率为81.16%。决策树模型、支持向量机模型、Naïve贝叶斯模型、k近邻模型和经典逻辑回归模型的准确率分别为68.40%、59.94%、53.06%、69.96%和54.79%。结论:总的来说,随机森林算法是预测5岁以下儿童贫血的首选模型。该模型的特异性为79.26%,敏感性为83.07%,阳性预测值为80.02%,阴性预测值为82.40%,曲线下面积为81.80%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Pediatrics
BMC Pediatrics PEDIATRICS-
CiteScore
3.70
自引率
4.20%
发文量
683
审稿时长
3-8 weeks
期刊介绍: BMC Pediatrics is an open access journal publishing peer-reviewed research articles in all aspects of health care in neonates, children and adolescents, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信