Advancing predictive analytics in child malnutrition: Machine, ensemble and deep learning models with balanced class distribution for early detection of stunting and wasting

IF 1.8 Q3 ENDOCRINOLOGY & METABOLISM
Wisdom Richard Mgomezulu , Paul Thangata , Bertha Mkandawire , Nana Amoah
{"title":"Advancing predictive analytics in child malnutrition: Machine, ensemble and deep learning models with balanced class distribution for early detection of stunting and wasting","authors":"Wisdom Richard Mgomezulu ,&nbsp;Paul Thangata ,&nbsp;Bertha Mkandawire ,&nbsp;Nana Amoah","doi":"10.1016/j.hnm.2025.200340","DOIUrl":null,"url":null,"abstract":"<div><div>Child malnutrition remains a critical public health challenge in sub-Saharan Africa, with traditional surveillance methods proving inadequate for early detection and intervention. This study leverages advanced machine learning and deep learning techniques to revolutionize stunting and wasting prediction in Malawi, utilizing nationally representative World Bank's Living Standards Measurement Surveys (LSMS) data to develop robust predictive models capable of identifying at-risk children before clinical manifestations emerge. Seven classification algorithms were evaluated, including ensemble methods (Random Forest, XGBoost), Deep Neural Networks (DNN), and traditional approaches (SVM, Logistic Regression, KNN, Gradient Boosting). Class imbalance challenges were addressed through SMOTE implementation and strategic class weighting. Model performance was assessed using accuracy, precision, recall, F1-score, and AUC-ROC metrics across balanced datasets. Results demonstrate exceptional predictive capabilities, with Random Forest achieving perfect performance for wasting prediction (100 % accuracy, precision, recall, F1-score, and AUC-ROC) and near-perfect stunting classification (99.98 % accuracy). XGBoost demonstrated comparable excellence with 99.49 % accuracy for wasting and 95.52 % for stunting prediction. DNN showed strong performance (91.50 % wasting accuracy, 76.64 % stunting accuracy), while traditional methods exhibited moderate effectiveness, with logistic regression achieving the lowest performance (66.58 % wasting, 64.72 % stunting accuracy). These findings represent a paradigm shift toward proactive nutritional surveillance, enabling early identification of vulnerable populations through data-driven approaches. The superior performance of ensemble algorithms provides policymakers with powerful tools for evidence-based resource allocation and targeted interventions. Implementation of these predictive models within Malawi's health systems could significantly enhance early detection capabilities, facilitate timely nutritional interventions, and contribute substantially to achieving global nutrition targets while reducing childhood mortality rates.</div></div>","PeriodicalId":36125,"journal":{"name":"Human Nutrition and Metabolism","volume":"42 ","pages":"Article 200340"},"PeriodicalIF":1.8000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Nutrition and Metabolism","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666149725000441","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Child malnutrition remains a critical public health challenge in sub-Saharan Africa, with traditional surveillance methods proving inadequate for early detection and intervention. This study leverages advanced machine learning and deep learning techniques to revolutionize stunting and wasting prediction in Malawi, utilizing nationally representative World Bank's Living Standards Measurement Surveys (LSMS) data to develop robust predictive models capable of identifying at-risk children before clinical manifestations emerge. Seven classification algorithms were evaluated, including ensemble methods (Random Forest, XGBoost), Deep Neural Networks (DNN), and traditional approaches (SVM, Logistic Regression, KNN, Gradient Boosting). Class imbalance challenges were addressed through SMOTE implementation and strategic class weighting. Model performance was assessed using accuracy, precision, recall, F1-score, and AUC-ROC metrics across balanced datasets. Results demonstrate exceptional predictive capabilities, with Random Forest achieving perfect performance for wasting prediction (100 % accuracy, precision, recall, F1-score, and AUC-ROC) and near-perfect stunting classification (99.98 % accuracy). XGBoost demonstrated comparable excellence with 99.49 % accuracy for wasting and 95.52 % for stunting prediction. DNN showed strong performance (91.50 % wasting accuracy, 76.64 % stunting accuracy), while traditional methods exhibited moderate effectiveness, with logistic regression achieving the lowest performance (66.58 % wasting, 64.72 % stunting accuracy). These findings represent a paradigm shift toward proactive nutritional surveillance, enabling early identification of vulnerable populations through data-driven approaches. The superior performance of ensemble algorithms provides policymakers with powerful tools for evidence-based resource allocation and targeted interventions. Implementation of these predictive models within Malawi's health systems could significantly enhance early detection capabilities, facilitate timely nutritional interventions, and contribute substantially to achieving global nutrition targets while reducing childhood mortality rates.
推进儿童营养不良预测分析:具有均衡阶层分布的机器、集成和深度学习模型,用于早期发现发育迟缓和消瘦
儿童营养不良仍然是撒哈拉以南非洲的一个重大公共卫生挑战,传统的监测方法证明不足以早期发现和干预。本研究利用先进的机器学习和深度学习技术,彻底改变马拉维的发育迟缓和消瘦预测,利用具有全国代表性的世界银行生活水平测量调查(LSMS)数据,开发强大的预测模型,能够在临床表现出现之前识别出有风险的儿童。评估了7种分类算法,包括集成方法(Random Forest, XGBoost),深度神经网络(DNN)和传统方法(SVM, Logistic回归,KNN, Gradient Boosting)。通过SMOTE的实施和战略性的班级加权来解决班级失衡的挑战。使用平衡数据集的准确性、精密度、召回率、f1评分和AUC-ROC指标评估模型性能。结果显示出卓越的预测能力,随机森林在预测消瘦方面取得了完美的表现(100%的准确率、精度、召回率、f1得分和AUC-ROC),在发育不良分类方面取得了近乎完美的表现(99.98%的准确率)。XGBoost在预测消瘦和发育迟缓方面的准确率分别为99.49%和95.52%。深度神经网络表现出较强的效果(浪费准确率为91.50%,发育不良准确率为76.64%),而传统方法的效果一般,其中逻辑回归的效果最低(浪费准确率为66.58%,发育不良准确率为64.72%)。这些发现代表了向主动营养监测的范式转变,能够通过数据驱动的方法早期识别弱势群体。集成算法的卓越性能为决策者提供了强有力的工具,用于基于证据的资源分配和有针对性的干预。在马拉维卫生系统内实施这些预测模型可以大大提高早期发现能力,促进及时的营养干预,并为实现全球营养目标做出重大贡献,同时降低儿童死亡率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Nutrition and Metabolism
Human Nutrition and Metabolism Agricultural and Biological Sciences-Food Science
CiteScore
1.50
自引率
0.00%
发文量
30
审稿时长
188 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信