Exploring Machine Learning Algorithms to Predict Diarrhea Disease and Identify its Determinants among Under-Five Years Children in East Africa.

IF 3.8 4区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Tirualem Zeleke Yehuala, Nebiyu Mekonnen Derseh, Makda Fekadie Tewelgne, Sisay Maru Wubante
{"title":"Exploring Machine Learning Algorithms to Predict Diarrhea Disease and Identify its Determinants among Under-Five Years Children in East Africa.","authors":"Tirualem Zeleke Yehuala, Nebiyu Mekonnen Derseh, Makda Fekadie Tewelgne, Sisay Maru Wubante","doi":"10.1007/s44197-024-00259-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The second most common cause of death for children under five is diarrhea. Early Predicting diarrhea disease and identify its determinants (factors) using an advanced machine learning model is the most effective way to save the lives of children. Hence, this study aimed to predict diarrheal diseases, identify their determinants, and generate some rules using machine learning models.</p><p><strong>Methods: </strong>The study used secondary data from the 12 east African countries for DHS dataset analysis using Python. Machine learning techniques such as Random Forest, Decision Tree (DT), K-Nearest Neighbor, Logistic Regression (LR), wrapper feature selection and SHAP values are used for identify determinants.</p><p><strong>Result: </strong>The final experimentation results indicated the random forest model performed the best to predict diarrhea disease with an accuracy of 86.5%, precision of 89%, F-measure of 86%, AUC curve of 92%, and recall of 82%. Important predictors' identified age, countries, wealth status, mother's educational status, mother's age, source of drinking water, number of under-five children immunization status, media exposure, timing of breast feeding, mother's working status, types of toilet, and twin status were associated with a higher predicted probability of diarrhea disease.</p><p><strong>Conclusion: </strong>According to this study, child caregivers are fully aware of sanitation and feeding their children, and moms are educated, which can reduce child mortality by diarrhea in children in east Africa. This leads to a recommendation for policy direction to reduce infant mortality in East Africa.</p>","PeriodicalId":15796,"journal":{"name":"Journal of Epidemiology and Global Health","volume":null,"pages":null},"PeriodicalIF":3.8000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11442874/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Epidemiology and Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s44197-024-00259-9","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The second most common cause of death for children under five is diarrhea. Early Predicting diarrhea disease and identify its determinants (factors) using an advanced machine learning model is the most effective way to save the lives of children. Hence, this study aimed to predict diarrheal diseases, identify their determinants, and generate some rules using machine learning models.

Methods: The study used secondary data from the 12 east African countries for DHS dataset analysis using Python. Machine learning techniques such as Random Forest, Decision Tree (DT), K-Nearest Neighbor, Logistic Regression (LR), wrapper feature selection and SHAP values are used for identify determinants.

Result: The final experimentation results indicated the random forest model performed the best to predict diarrhea disease with an accuracy of 86.5%, precision of 89%, F-measure of 86%, AUC curve of 92%, and recall of 82%. Important predictors' identified age, countries, wealth status, mother's educational status, mother's age, source of drinking water, number of under-five children immunization status, media exposure, timing of breast feeding, mother's working status, types of toilet, and twin status were associated with a higher predicted probability of diarrhea disease.

Conclusion: According to this study, child caregivers are fully aware of sanitation and feeding their children, and moms are educated, which can reduce child mortality by diarrhea in children in east Africa. This leads to a recommendation for policy direction to reduce infant mortality in East Africa.

探索机器学习算法,预测东非五岁以下儿童腹泻疾病并确定其决定因素。
背景:腹泻是导致五岁以下儿童死亡的第二大原因。利用先进的机器学习模型及早预测腹泻疾病并确定其决定因素是挽救儿童生命的最有效方法。因此,本研究旨在预测腹泻疾病,确定其决定因素,并利用机器学习模型生成一些规则:本研究使用 Python 分析了来自 12 个东非国家的人口与健康调查数据集的二手数据。使用随机森林、决策树(DT)、K-近邻、逻辑回归(LR)、包装特征选择和 SHAP 值等机器学习技术来识别决定因素:最终实验结果表明,随机森林模型在预测腹泻疾病方面表现最佳,准确率为 86.5%,精确率为 89%,F-measure 为 86%,AUC 曲线为 92%,召回率为 82%。年龄、国家、财富状况、母亲教育状况、母亲年龄、饮用水来源、5 岁以下儿童免疫接种情况、媒体接触、母乳喂养时间、母亲工作状况、厕所类型和双胞胎状况等重要预测因素与腹泻疾病的预测概率相关:根据这项研究,儿童看护者充分认识到了卫生和喂养孩子的重要性,母亲也受到了教育,这些都可以降低东非儿童因腹泻而导致的死亡率。这为降低东非婴儿死亡率的政策方向提出了建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.70
自引率
1.40%
发文量
57
审稿时长
19 weeks
期刊介绍: The Journal of Epidemiology and Global Health is an esteemed international publication, offering a platform for peer-reviewed articles that drive advancements in global epidemiology and international health. Our mission is to shape global health policy by showcasing cutting-edge scholarship and innovative strategies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信