利用机器学习来预测撒哈拉以南非洲育龄妇女的蚊帐使用情况。

IF 3 3区 医学 Q3 INFECTIOUS DISEASES
Nebebe Demis Baykemagn, Tesfahun Zemene Tafere, Getachew Teshale, Andualem Yalew Aschalew, Melak Jejaw, Kaleb Assegid Demissie, Azmeraw Tadele, Asebe Hagos, Misganaw Guadie Tiruneh, Jenberu Mekurianew Kelkay
{"title":"利用机器学习来预测撒哈拉以南非洲育龄妇女的蚊帐使用情况。","authors":"Nebebe Demis Baykemagn, Tesfahun Zemene Tafere, Getachew Teshale, Andualem Yalew Aschalew, Melak Jejaw, Kaleb Assegid Demissie, Azmeraw Tadele, Asebe Hagos, Misganaw Guadie Tiruneh, Jenberu Mekurianew Kelkay","doi":"10.1186/s12936-025-05563-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Malaria remains a major public health challenge, particularly in sub-Saharan Africa, where women of reproductive age are especially vulnerable during pregnancy and childbirth. To identify key predictors and improve predictive accuracy, machine learning algorithms such as Random Forest were applied, along with SHAP analysis, to a large multi-country DHS dataset, with class imbalance addressed using Tomek Links and Random Over-Sampling.</p><p><strong>Methods: </strong>This study employed a weighted dataset of 153,015 participants from the Demographic and Health Survey (DHS) conducted across ten sub-Saharan African countries. Data preprocessing and analysis were carried out using STATA version 17 and Python 3.10. Feature scaling was applied to standardize numerical variables, ensuring uniform weighting across predictors and improving model stability. An 80:20 data split ratio was applied, and class imbalance was addressed using Tomek Links combined with Random Over-Sampling. Eight models were selected and trained using both balanced and unbalanced datasets. The model performance was evaluated using metrics such as ROC-AUC, accuracy, recall, F1 score, and precision.</p><p><strong>Results: </strong>The Random Forest algorithm performed best in this study, with an accuracy of 83%, an F1 score of 82%, recall of 80%, precision of 84%, and an AUC of 88%. Fifty-five percent of participants used mosquito nets. The SHAP analysis showed that Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use.</p><p><strong>Conclusion: </strong>Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use. Strengthening social media use for health information, promoting women's education, encouraging institutional delivery, motivate for antenatal care services, and providing support to socially and economically vulnerable women are essential strategies to enhance mosquito net utilization.</p>","PeriodicalId":18317,"journal":{"name":"Malaria Journal","volume":"24 1","pages":"317"},"PeriodicalIF":3.0000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging machine learning to predict mosquito bed net utilization among women of reproductive age in sub-Saharan Africa.\",\"authors\":\"Nebebe Demis Baykemagn, Tesfahun Zemene Tafere, Getachew Teshale, Andualem Yalew Aschalew, Melak Jejaw, Kaleb Assegid Demissie, Azmeraw Tadele, Asebe Hagos, Misganaw Guadie Tiruneh, Jenberu Mekurianew Kelkay\",\"doi\":\"10.1186/s12936-025-05563-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Malaria remains a major public health challenge, particularly in sub-Saharan Africa, where women of reproductive age are especially vulnerable during pregnancy and childbirth. To identify key predictors and improve predictive accuracy, machine learning algorithms such as Random Forest were applied, along with SHAP analysis, to a large multi-country DHS dataset, with class imbalance addressed using Tomek Links and Random Over-Sampling.</p><p><strong>Methods: </strong>This study employed a weighted dataset of 153,015 participants from the Demographic and Health Survey (DHS) conducted across ten sub-Saharan African countries. Data preprocessing and analysis were carried out using STATA version 17 and Python 3.10. Feature scaling was applied to standardize numerical variables, ensuring uniform weighting across predictors and improving model stability. An 80:20 data split ratio was applied, and class imbalance was addressed using Tomek Links combined with Random Over-Sampling. Eight models were selected and trained using both balanced and unbalanced datasets. The model performance was evaluated using metrics such as ROC-AUC, accuracy, recall, F1 score, and precision.</p><p><strong>Results: </strong>The Random Forest algorithm performed best in this study, with an accuracy of 83%, an F1 score of 82%, recall of 80%, precision of 84%, and an AUC of 88%. Fifty-five percent of participants used mosquito nets. The SHAP analysis showed that Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use.</p><p><strong>Conclusion: </strong>Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use. Strengthening social media use for health information, promoting women's education, encouraging institutional delivery, motivate for antenatal care services, and providing support to socially and economically vulnerable women are essential strategies to enhance mosquito net utilization.</p>\",\"PeriodicalId\":18317,\"journal\":{\"name\":\"Malaria Journal\",\"volume\":\"24 1\",\"pages\":\"317\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Malaria Journal\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12936-025-05563-8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"INFECTIOUS DISEASES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaria Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12936-025-05563-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFECTIOUS DISEASES","Score":null,"Total":0}
引用次数: 0

摘要

背景:疟疾仍然是一项重大的公共卫生挑战,特别是在撒哈拉以南非洲,那里的育龄妇女在怀孕和分娩期间特别脆弱。为了识别关键预测因子并提高预测准确性,将Random Forest等机器学习算法与SHAP分析一起应用于大型多国国土安全部数据集,并使用Tomek Links和Random oversampling解决类不平衡问题。方法:本研究采用了来自撒哈拉以南非洲10个国家的人口与健康调查(DHS)的153,015名参与者的加权数据集。使用STATA version 17和Python 3.10对数据进行预处理和分析。特征缩放用于标准化数值变量,确保预测变量的均匀加权,提高模型的稳定性。采用80:20的数据分割比率,并使用Tomek Links结合Random oversampling解决类不平衡问题。选择了8个模型,并使用平衡和非平衡数据集进行训练。使用ROC-AUC、准确性、召回率、F1分数和精度等指标评估模型性能。结果:随机森林算法在本研究中表现最好,准确率为83%,F1分数为82%,召回率为80%,精度为84%,AUC为88%。55%的参与者使用了蚊帐。SHAP分析显示,年龄在34岁以上、有工作、经常使用社交媒体、受过高等教育、在医院分娩和女性为户主的家庭增加了蚊帐的使用,而ANC访问次数较少和离婚则减少了蚊帐的使用。结论:年龄在34岁以上、有工作、频繁使用社交媒体、受过高等教育、机构分娩和女性户主家庭增加了蚊帐的使用,而ANC访问次数较少和离婚则降低了蚊帐的使用。加强社会媒体在卫生信息方面的使用,促进妇女教育,鼓励机构提供服务,鼓励产前保健服务,并向社会和经济上脆弱的妇女提供支持,这些都是提高蚊帐使用率的基本战略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Leveraging machine learning to predict mosquito bed net utilization among women of reproductive age in sub-Saharan Africa.

Background: Malaria remains a major public health challenge, particularly in sub-Saharan Africa, where women of reproductive age are especially vulnerable during pregnancy and childbirth. To identify key predictors and improve predictive accuracy, machine learning algorithms such as Random Forest were applied, along with SHAP analysis, to a large multi-country DHS dataset, with class imbalance addressed using Tomek Links and Random Over-Sampling.

Methods: This study employed a weighted dataset of 153,015 participants from the Demographic and Health Survey (DHS) conducted across ten sub-Saharan African countries. Data preprocessing and analysis were carried out using STATA version 17 and Python 3.10. Feature scaling was applied to standardize numerical variables, ensuring uniform weighting across predictors and improving model stability. An 80:20 data split ratio was applied, and class imbalance was addressed using Tomek Links combined with Random Over-Sampling. Eight models were selected and trained using both balanced and unbalanced datasets. The model performance was evaluated using metrics such as ROC-AUC, accuracy, recall, F1 score, and precision.

Results: The Random Forest algorithm performed best in this study, with an accuracy of 83%, an F1 score of 82%, recall of 80%, precision of 84%, and an AUC of 88%. Fifty-five percent of participants used mosquito nets. The SHAP analysis showed that Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use.

Conclusion: Age above 34, being employed, frequent social media use, higher education, institutional deliveries, and female-headed households increased bed net use, while fewer ANC visits and being divorced decreased its use. Strengthening social media use for health information, promoting women's education, encouraging institutional delivery, motivate for antenatal care services, and providing support to socially and economically vulnerable women are essential strategies to enhance mosquito net utilization.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Malaria Journal
Malaria Journal 医学-寄生虫学
CiteScore
5.10
自引率
23.30%
发文量
334
审稿时长
2-4 weeks
期刊介绍: Malaria Journal is aimed at the scientific community interested in malaria in its broadest sense. It is the only journal that publishes exclusively articles on malaria and, as such, it aims to bring together knowledge from the different specialities involved in this very broad discipline, from the bench to the bedside and to the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信