使用机器学习预测北方邦儿童死亡率的决定因素:来自全国家庭和健康调查的见解(2019-21)

IF 2.3 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Pinky Pandey , Sacheendra Shukla , Niraj Kumar Singh , Mukesh Kumar
{"title":"使用机器学习预测北方邦儿童死亡率的决定因素:来自全国家庭和健康调查的见解(2019-21)","authors":"Pinky Pandey ,&nbsp;Sacheendra Shukla ,&nbsp;Niraj Kumar Singh ,&nbsp;Mukesh Kumar","doi":"10.1016/j.cegh.2025.101949","DOIUrl":null,"url":null,"abstract":"<div><h3>Aim</h3><div>This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates.</div></div><div><h3>Methods</h3><div>The study utilized data from the National Family and Health Survey (NFHS) - V. Four machine learning algorithms—Random Forests, Logistic Regression, K-Nearest Neighbors (KNN), and Naive Bayes—were applied alongside a traditional logistic regression model. Predictive performance was evaluated using metrics such as model accuracy and receiver operating characteristic (ROC) curves. Descriptive analysis highlighted regional variations in under-five mortality rates.</div></div><div><h3>Results</h3><div>Notable regional disparities in under-five mortality were observed across Uttar Pradesh. Predictive accuracies ranged from 76 % to 79.4 %, with the logistic regression model achieving the highest accuracy (79.4 %). All ML models demonstrated comparable predictive capabilities. The most effective model identified key determinants of under-five mortality, including breastfeeding status, number of births in the preceding five years, child's gender, birth intervals, antenatal care, birth order, type of water source, and maternal body mass index.</div></div><div><h3>Conclusion</h3><div>Machine learning models provide valuable insights into the determinants of under-five mortality, with the logistic regression model demonstrating superior predictive performance. Policy measures targeting critical factors, such as promoting breastfeeding, optimizing birth intervals, and improving maternal health and antenatal care, can significantly enhance childhood survival rates in Uttar Pradesh.</div></div>","PeriodicalId":46404,"journal":{"name":"Clinical Epidemiology and Global Health","volume":"32 ","pages":"Article 101949"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)\",\"authors\":\"Pinky Pandey ,&nbsp;Sacheendra Shukla ,&nbsp;Niraj Kumar Singh ,&nbsp;Mukesh Kumar\",\"doi\":\"10.1016/j.cegh.2025.101949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Aim</h3><div>This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates.</div></div><div><h3>Methods</h3><div>The study utilized data from the National Family and Health Survey (NFHS) - V. Four machine learning algorithms—Random Forests, Logistic Regression, K-Nearest Neighbors (KNN), and Naive Bayes—were applied alongside a traditional logistic regression model. Predictive performance was evaluated using metrics such as model accuracy and receiver operating characteristic (ROC) curves. Descriptive analysis highlighted regional variations in under-five mortality rates.</div></div><div><h3>Results</h3><div>Notable regional disparities in under-five mortality were observed across Uttar Pradesh. Predictive accuracies ranged from 76 % to 79.4 %, with the logistic regression model achieving the highest accuracy (79.4 %). All ML models demonstrated comparable predictive capabilities. The most effective model identified key determinants of under-five mortality, including breastfeeding status, number of births in the preceding five years, child's gender, birth intervals, antenatal care, birth order, type of water source, and maternal body mass index.</div></div><div><h3>Conclusion</h3><div>Machine learning models provide valuable insights into the determinants of under-five mortality, with the logistic regression model demonstrating superior predictive performance. Policy measures targeting critical factors, such as promoting breastfeeding, optimizing birth intervals, and improving maternal health and antenatal care, can significantly enhance childhood survival rates in Uttar Pradesh.</div></div>\",\"PeriodicalId\":46404,\"journal\":{\"name\":\"Clinical Epidemiology and Global Health\",\"volume\":\"32 \",\"pages\":\"Article 101949\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-02-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Epidemiology and Global Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2213398425000387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epidemiology and Global Health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213398425000387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

摘要

目的本研究旨在描述北方邦5岁以下儿童死亡率的空间差异,并评估各种机器学习算法在确定影响这些死亡率的关键决定因素方面的功效。该研究利用了国家家庭与健康调查(NFHS)的数据。四种机器学习算法——随机森林、逻辑回归、k近邻(KNN)和朴素贝叶斯——与传统的逻辑回归模型一起应用。使用模型精度和受试者工作特征(ROC)曲线等指标评估预测性能。描述性分析强调了五岁以下儿童死亡率的区域差异。结果北方邦5岁以下儿童死亡率存在显著的地区差异。预测准确率从76%到79.4%不等,其中逻辑回归模型的准确率最高(79.4%)。所有ML模型都显示出相当的预测能力。最有效的模型确定了5岁以下儿童死亡率的关键决定因素,包括母乳喂养状况、前5年的出生人数、儿童性别、生育间隔、产前护理、出生顺序、水源类型和产妇体重指数。结论:机器学习模型为了解5岁以下儿童死亡率的决定因素提供了有价值的见解,逻辑回归模型显示出优越的预测性能。针对关键因素的政策措施,如促进母乳喂养、优化生育间隔、改善孕产妇保健和产前保健,可以显著提高北方邦的儿童存活率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)

Aim

This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates.

Methods

The study utilized data from the National Family and Health Survey (NFHS) - V. Four machine learning algorithms—Random Forests, Logistic Regression, K-Nearest Neighbors (KNN), and Naive Bayes—were applied alongside a traditional logistic regression model. Predictive performance was evaluated using metrics such as model accuracy and receiver operating characteristic (ROC) curves. Descriptive analysis highlighted regional variations in under-five mortality rates.

Results

Notable regional disparities in under-five mortality were observed across Uttar Pradesh. Predictive accuracies ranged from 76 % to 79.4 %, with the logistic regression model achieving the highest accuracy (79.4 %). All ML models demonstrated comparable predictive capabilities. The most effective model identified key determinants of under-five mortality, including breastfeeding status, number of births in the preceding five years, child's gender, birth intervals, antenatal care, birth order, type of water source, and maternal body mass index.

Conclusion

Machine learning models provide valuable insights into the determinants of under-five mortality, with the logistic regression model demonstrating superior predictive performance. Policy measures targeting critical factors, such as promoting breastfeeding, optimizing birth intervals, and improving maternal health and antenatal care, can significantly enhance childhood survival rates in Uttar Pradesh.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Clinical Epidemiology and Global Health
Clinical Epidemiology and Global Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-
CiteScore
4.60
自引率
7.70%
发文量
218
审稿时长
66 days
期刊介绍: Clinical Epidemiology and Global Health (CEGH) is a multidisciplinary journal and it is published four times (March, June, September, December) a year. The mandate of CEGH is to promote articles on clinical epidemiology with focus on developing countries in the context of global health. We also accept articles from other countries. It publishes original research work across all disciplines of medicine and allied sciences, related to clinical epidemiology and global health. The journal publishes Original articles, Review articles, Evidence Summaries, Letters to the Editor. All articles published in CEGH are peer-reviewed and published online for immediate access and citation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信