Exploring Risk Factors and Predictive Modeling of Child Malnutrition in Pakistan Using Machine Learning

IF 1.7 4区 医学 Q1 ANTHROPOLOGY
Muhammad Usman Saleem, Muhammad Usman Aslam, Abdul Ghani Khatir, Quanbao Jiang
{"title":"Exploring Risk Factors and Predictive Modeling of Child Malnutrition in Pakistan Using Machine Learning","authors":"Muhammad Usman Saleem,&nbsp;Muhammad Usman Aslam,&nbsp;Abdul Ghani Khatir,&nbsp;Quanbao Jiang","doi":"10.1002/ajhb.70124","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objective</h3>\n \n <p>This study aims to identify risk factors and develop predictive models of child malnutrition (stunting, wasting, and underweight) in Pakistani children under five using machine learning approaches.</p>\n </section>\n \n <section>\n \n <h3> Study Design</h3>\n \n <p>This cross-sectional design utilized data from the Pakistan Demographic and Health Survey 2017–2018 (PDHS).</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Logistic regression was employed to identify significant socio-demographic and health-related risk factors. Four machine learning models—Random Forest (RF), Support Vector Machine (SVM), Naïve Bayes (NB), and AdaBoost—were applied to predict malnutrition indicators, with performance evaluated based on accuracy, sensitivity, specificity, and F-measure using a two-stage validation strategy (10-fold cross-validation and 80:20 train-test split).</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Key risk factors identified included consanguineous marriages, lower wealth status, low maternal education, and geographic disparities in Sindh and Baluchistan. Among the machine learning models, Random Forest demonstrated the highest overall accuracy and specificity across all indicators, while SVM showed higher sensitivity for wasted and underweight children.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The study highlights the complex interplay of socio-demographic factors in child malnutrition and the potential of machine learning models to effectively predict these conditions, underscoring the need for targeted interventions focusing on maternal education, access to clean water and sanitation, and poverty alleviation, particularly in high-risk regions.</p>\n </section>\n </div>","PeriodicalId":50809,"journal":{"name":"American Journal of Human Biology","volume":"37 9","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Human Biology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ajhb.70124","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

This study aims to identify risk factors and develop predictive models of child malnutrition (stunting, wasting, and underweight) in Pakistani children under five using machine learning approaches.

Study Design

This cross-sectional design utilized data from the Pakistan Demographic and Health Survey 2017–2018 (PDHS).

Methods

Logistic regression was employed to identify significant socio-demographic and health-related risk factors. Four machine learning models—Random Forest (RF), Support Vector Machine (SVM), Naïve Bayes (NB), and AdaBoost—were applied to predict malnutrition indicators, with performance evaluated based on accuracy, sensitivity, specificity, and F-measure using a two-stage validation strategy (10-fold cross-validation and 80:20 train-test split).

Results

Key risk factors identified included consanguineous marriages, lower wealth status, low maternal education, and geographic disparities in Sindh and Baluchistan. Among the machine learning models, Random Forest demonstrated the highest overall accuracy and specificity across all indicators, while SVM showed higher sensitivity for wasted and underweight children.

Conclusion

The study highlights the complex interplay of socio-demographic factors in child malnutrition and the potential of machine learning models to effectively predict these conditions, underscoring the need for targeted interventions focusing on maternal education, access to clean water and sanitation, and poverty alleviation, particularly in high-risk regions.

利用机器学习探索巴基斯坦儿童营养不良的风险因素和预测模型
本研究旨在利用机器学习方法识别巴基斯坦五岁以下儿童营养不良(发育迟缓、消瘦和体重不足)的风险因素并建立预测模型。本横断面设计利用了2017-2018年巴基斯坦人口与健康调查(PDHS)的数据。方法采用Logistic回归分析确定显著的社会人口统计学和健康相关危险因素。四种机器学习模型——随机森林(RF)、支持向量机(SVM)、Naïve贝叶斯(NB)和adaboost——被用于预测营养不良指标,使用两阶段验证策略(10倍交叉验证和80:20训练测试分割),基于准确性、敏感性、特异性和F-measure对性能进行评估。结果在信德省和俾路支省确定的主要危险因素包括近亲婚姻、较低的财富状况、较低的母亲教育水平和地理差异。在机器学习模型中,Random Forest在所有指标上表现出最高的总体准确性和特异性,而SVM对消瘦和体重不足的儿童表现出更高的敏感性。该研究强调了儿童营养不良中社会人口因素的复杂相互作用,以及机器学习模型有效预测这些情况的潜力,强调了有针对性的干预措施的必要性,重点是孕产妇教育、获得清洁水和卫生设施以及减贫,特别是在高风险地区。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.80
自引率
13.80%
发文量
124
审稿时长
4-8 weeks
期刊介绍: The American Journal of Human Biology is the Official Journal of the Human Biology Association. The American Journal of Human Biology is a bimonthly, peer-reviewed, internationally circulated journal that publishes reports of original research, theoretical articles and timely reviews, and brief communications in the interdisciplinary field of human biology. As the official journal of the Human Biology Association, the Journal also publishes abstracts of research presented at its annual scientific meeting and book reviews relevant to the field. The Journal seeks scholarly manuscripts that address all aspects of human biology, health, and disease, particularly those that stress comparative, developmental, ecological, or evolutionary perspectives. The transdisciplinary areas covered in the Journal include, but are not limited to, epidemiology, genetic variation, population biology and demography, physiology, anatomy, nutrition, growth and aging, physical performance, physical activity and fitness, ecology, and evolution, along with their interactions. The Journal publishes basic, applied, and methodologically oriented research from all areas, including measurement, analytical techniques and strategies, and computer applications in human biology. Like many other biologically oriented disciplines, the field of human biology has undergone considerable growth and diversification in recent years, and the expansion of the aims and scope of the Journal is a reflection of this growth and membership diversification. The Journal is committed to prompt review, and priority publication is given to manuscripts with novel or timely findings, and to manuscripts of unusual interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信