用机器学习技术预测撒哈拉以南非洲人口中风

Benjamin Segun Aribisala , Deirdre Edward , Godwin Ogbole , Onoja M. Akpa , Segun Ayilara , Fred Sarfo , Olusola Olabanjo , Adekunle Fakunle , Babafemi Oluropo Macaulay , Joseph Yaria , Joshua Akinyemi , Albert Akpalu , Kolawole Wahab , Reginald Obiako , Morenikeji Komolafe , Lukman Owolabi , Godwin Osaigbovo , Akinkunmi Paul Okekunle , Arti Singh , Philip Ibinaye , Mayowa Owolabi
{"title":"用机器学习技术预测撒哈拉以南非洲人口中风","authors":"Benjamin Segun Aribisala ,&nbsp;Deirdre Edward ,&nbsp;Godwin Ogbole ,&nbsp;Onoja M. Akpa ,&nbsp;Segun Ayilara ,&nbsp;Fred Sarfo ,&nbsp;Olusola Olabanjo ,&nbsp;Adekunle Fakunle ,&nbsp;Babafemi Oluropo Macaulay ,&nbsp;Joseph Yaria ,&nbsp;Joshua Akinyemi ,&nbsp;Albert Akpalu ,&nbsp;Kolawole Wahab ,&nbsp;Reginald Obiako ,&nbsp;Morenikeji Komolafe ,&nbsp;Lukman Owolabi ,&nbsp;Godwin Osaigbovo ,&nbsp;Akinkunmi Paul Okekunle ,&nbsp;Arti Singh ,&nbsp;Philip Ibinaye ,&nbsp;Mayowa Owolabi","doi":"10.1016/j.neuri.2025.100216","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Stroke is the second leading cause of death and the third leading cause of disability globally, including Africa, which bears its largest burden. Accurate models are needed in Africa to predict and prevent stroke occurrence. The aim of this study was to identify the best machine learning (ML) algorithm for stroke prediction.</div></div><div><h3>Methods</h3><div>We assessed medical data of 4,236 subjects comprising 2,118 stroke patients and 2,118 controls from the SIREN database. Sixteen established vascular risk factors were evaluated in this study. These are addition of salt to food at table during eating, cardiac disease, diabetes mellitus, dyslipidemia, education, family history of cardiovascular disease, hypertension, income, low green leafy vegetable consumption, obesity, physical inactivity, regular meat consumption, regular sugar consumption, smoking, stress and use of tobacco. From these, we also selected the 11 topmost risk factors using Population-Attributable Risk ranking. Eleven ML models were built and empirically investigated using the 16 and the 11 risk factors.</div></div><div><h3>Results</h3><div>Our results showed that the 16 features-based classification (maximum AUC of 82.32%) had a slightly better performance than the 11 feature-based (maximum AUC 81.17%) algorithm. The result also showed that Artificial Neural Network (ANN) had the best performance amongst eleven algorithms investigated with AUC of 82.32%, sensitivity of 71.23%, specificity of 80.00%.</div></div><div><h3>Conclusion</h3><div>Machine Learning algorithms predicted stroke occurrence employing major risk factors in Sub-Saharan Africa better than regression models. Machine Learning, especially Artificial Neural Network, is recommended to enhance Afrocentric stroke prediction models for stroke risk factor quantification and control in Africa.</div></div>","PeriodicalId":74295,"journal":{"name":"Neuroscience informatics","volume":"5 3","pages":"Article 100216"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting stroke with machine learning techniques in a sub-Saharan African population\",\"authors\":\"Benjamin Segun Aribisala ,&nbsp;Deirdre Edward ,&nbsp;Godwin Ogbole ,&nbsp;Onoja M. Akpa ,&nbsp;Segun Ayilara ,&nbsp;Fred Sarfo ,&nbsp;Olusola Olabanjo ,&nbsp;Adekunle Fakunle ,&nbsp;Babafemi Oluropo Macaulay ,&nbsp;Joseph Yaria ,&nbsp;Joshua Akinyemi ,&nbsp;Albert Akpalu ,&nbsp;Kolawole Wahab ,&nbsp;Reginald Obiako ,&nbsp;Morenikeji Komolafe ,&nbsp;Lukman Owolabi ,&nbsp;Godwin Osaigbovo ,&nbsp;Akinkunmi Paul Okekunle ,&nbsp;Arti Singh ,&nbsp;Philip Ibinaye ,&nbsp;Mayowa Owolabi\",\"doi\":\"10.1016/j.neuri.2025.100216\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Stroke is the second leading cause of death and the third leading cause of disability globally, including Africa, which bears its largest burden. Accurate models are needed in Africa to predict and prevent stroke occurrence. The aim of this study was to identify the best machine learning (ML) algorithm for stroke prediction.</div></div><div><h3>Methods</h3><div>We assessed medical data of 4,236 subjects comprising 2,118 stroke patients and 2,118 controls from the SIREN database. Sixteen established vascular risk factors were evaluated in this study. These are addition of salt to food at table during eating, cardiac disease, diabetes mellitus, dyslipidemia, education, family history of cardiovascular disease, hypertension, income, low green leafy vegetable consumption, obesity, physical inactivity, regular meat consumption, regular sugar consumption, smoking, stress and use of tobacco. From these, we also selected the 11 topmost risk factors using Population-Attributable Risk ranking. Eleven ML models were built and empirically investigated using the 16 and the 11 risk factors.</div></div><div><h3>Results</h3><div>Our results showed that the 16 features-based classification (maximum AUC of 82.32%) had a slightly better performance than the 11 feature-based (maximum AUC 81.17%) algorithm. The result also showed that Artificial Neural Network (ANN) had the best performance amongst eleven algorithms investigated with AUC of 82.32%, sensitivity of 71.23%, specificity of 80.00%.</div></div><div><h3>Conclusion</h3><div>Machine Learning algorithms predicted stroke occurrence employing major risk factors in Sub-Saharan Africa better than regression models. Machine Learning, especially Artificial Neural Network, is recommended to enhance Afrocentric stroke prediction models for stroke risk factor quantification and control in Africa.</div></div>\",\"PeriodicalId\":74295,\"journal\":{\"name\":\"Neuroscience informatics\",\"volume\":\"5 3\",\"pages\":\"Article 100216\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neuroscience informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772528625000317\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroscience informatics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772528625000317","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

中风是全球第二大致死原因和第三大致残原因,非洲也是其中之一,其负担最大。非洲需要准确的模型来预测和预防中风的发生。本研究的目的是确定用于中风预测的最佳机器学习(ML)算法。方法对来自SIREN数据库的2,118例脑卒中患者和2,118例对照组的4,236例受试者的医学资料进行评估。本研究评估了16个已确定的血管危险因素。这些因素包括:吃饭时在食物中添加盐、心脏病、糖尿病、血脂异常、教育程度、心血管疾病家族史、高血压、收入、绿叶蔬菜摄入量低、肥胖、缺乏体育锻炼、经常吃肉、经常吃糖、吸烟、压力和使用烟草。从这些因素中,我们还使用人口归因风险排名选择了11个最重要的风险因素。建立了11个ML模型,并对16个和11个危险因素进行了实证研究。结果基于16个特征的分类算法(最大AUC为82.32%)的分类性能略好于基于11个特征的分类算法(最大AUC为81.17%)。人工神经网络(Artificial Neural Network, ANN)的AUC为82.32%,灵敏度为71.23%,特异性为80.00%,在11种算法中表现最佳。结论机器学习算法预测撒哈拉以南非洲地区卒中发生的主要危险因素优于回归模型。建议使用机器学习,特别是人工神经网络来增强以非洲为中心的中风预测模型,用于非洲中风风险因素的量化和控制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting stroke with machine learning techniques in a sub-Saharan African population

Background

Stroke is the second leading cause of death and the third leading cause of disability globally, including Africa, which bears its largest burden. Accurate models are needed in Africa to predict and prevent stroke occurrence. The aim of this study was to identify the best machine learning (ML) algorithm for stroke prediction.

Methods

We assessed medical data of 4,236 subjects comprising 2,118 stroke patients and 2,118 controls from the SIREN database. Sixteen established vascular risk factors were evaluated in this study. These are addition of salt to food at table during eating, cardiac disease, diabetes mellitus, dyslipidemia, education, family history of cardiovascular disease, hypertension, income, low green leafy vegetable consumption, obesity, physical inactivity, regular meat consumption, regular sugar consumption, smoking, stress and use of tobacco. From these, we also selected the 11 topmost risk factors using Population-Attributable Risk ranking. Eleven ML models were built and empirically investigated using the 16 and the 11 risk factors.

Results

Our results showed that the 16 features-based classification (maximum AUC of 82.32%) had a slightly better performance than the 11 feature-based (maximum AUC 81.17%) algorithm. The result also showed that Artificial Neural Network (ANN) had the best performance amongst eleven algorithms investigated with AUC of 82.32%, sensitivity of 71.23%, specificity of 80.00%.

Conclusion

Machine Learning algorithms predicted stroke occurrence employing major risk factors in Sub-Saharan Africa better than regression models. Machine Learning, especially Artificial Neural Network, is recommended to enhance Afrocentric stroke prediction models for stroke risk factor quantification and control in Africa.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Neuroscience informatics
Neuroscience informatics Surgery, Radiology and Imaging, Information Systems, Neurology, Artificial Intelligence, Computer Science Applications, Signal Processing, Critical Care and Intensive Care Medicine, Health Informatics, Clinical Neurology, Pathology and Medical Technology
自引率
0.00%
发文量
0
审稿时长
57 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信