脑卒中预测数据集的集成学习方法及分析

Utkrisht Singh, A. Jena, Mohammed Taha Haque
{"title":"脑卒中预测数据集的集成学习方法及分析","authors":"Utkrisht Singh, A. Jena, Mohammed Taha Haque","doi":"10.1109/ASSIC55218.2022.10088363","DOIUrl":null,"url":null,"abstract":"A stroke is an illness that results in traumatic brain injury by tearing blood vessels. A brain stroke can also occur if blood flow and other nutrients to the brain are interrupted abruptly. It is one of the major global causes of disability and death, as per the report given by the World Health Organization (WHO). With increased convergence amongst technology and medical diagnosis, practitioners create possibilities for improved management of patients by comprehensively quarrying as well as archiving patient's records containing their medical background. As a result, it becomes critical to investigate the interdependence of these factors (risk) in patient's medical records and comprehend the relative impact of these factors for the prediction of brain stroke. This research establishes an early estimation of stroke diseases by combining the existence of hypertension, heart disease, body mass index, smoking status, prior stroke, age, and some other feature attributes. For forecasting the stroke, various statistical methods and five different ML models including some ensemble learning techniques like Support Vector Machine (SVM), Random Forest (RF), Ada-Boost Classifier (ABC), Decision Tree Classifier (DTC), and XG-Boost Classifier (XGB) were used to train the feature attributes. Furthermore, the proposed research work has accomplished an accuracy of 95.08 percent, with the XG-Boost Classifier outperforming the Machine Learning (ML) Models. As a result, XG-Boost is nearly the most preferable classifier for predicting strokes, which can be used as a reference model by physicians and also used by patients considering aid in the early detection of a potential stroke.","PeriodicalId":441406,"journal":{"name":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Ensemble Learning Approach and Analysis for Stroke Prediction Dataset\",\"authors\":\"Utkrisht Singh, A. Jena, Mohammed Taha Haque\",\"doi\":\"10.1109/ASSIC55218.2022.10088363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A stroke is an illness that results in traumatic brain injury by tearing blood vessels. A brain stroke can also occur if blood flow and other nutrients to the brain are interrupted abruptly. It is one of the major global causes of disability and death, as per the report given by the World Health Organization (WHO). With increased convergence amongst technology and medical diagnosis, practitioners create possibilities for improved management of patients by comprehensively quarrying as well as archiving patient's records containing their medical background. As a result, it becomes critical to investigate the interdependence of these factors (risk) in patient's medical records and comprehend the relative impact of these factors for the prediction of brain stroke. This research establishes an early estimation of stroke diseases by combining the existence of hypertension, heart disease, body mass index, smoking status, prior stroke, age, and some other feature attributes. For forecasting the stroke, various statistical methods and five different ML models including some ensemble learning techniques like Support Vector Machine (SVM), Random Forest (RF), Ada-Boost Classifier (ABC), Decision Tree Classifier (DTC), and XG-Boost Classifier (XGB) were used to train the feature attributes. Furthermore, the proposed research work has accomplished an accuracy of 95.08 percent, with the XG-Boost Classifier outperforming the Machine Learning (ML) Models. As a result, XG-Boost is nearly the most preferable classifier for predicting strokes, which can be used as a reference model by physicians and also used by patients considering aid in the early detection of a potential stroke.\",\"PeriodicalId\":441406,\"journal\":{\"name\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASSIC55218.2022.10088363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSIC55218.2022.10088363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

中风是一种通过撕裂血管导致创伤性脑损伤的疾病。如果大脑的血液流动和其他营养物质突然中断,也会发生脑中风。根据世界卫生组织(世卫组织)的报告,它是全球致残和死亡的主要原因之一。随着技术和医疗诊断之间的日益融合,从业人员通过全面采集和存档包含其医疗背景的患者记录,为改进患者管理创造了可能性。因此,在患者的医疗记录中调查这些因素(风险)的相互依赖性,并了解这些因素对脑卒中预测的相对影响就变得至关重要。本研究结合高血压、心脏病、体重指数、吸烟状况、卒中史、年龄等特征属性,建立对卒中疾病的早期估计。为了预测中风,使用了各种统计方法和五种不同的ML模型,包括支持向量机(SVM)、随机森林(RF)、Ada-Boost分类器(ABC)、决策树分类器(DTC)和XG-Boost分类器(XGB)等集成学习技术来训练特征属性。此外,提出的研究工作已经完成了95.08%的准确率,XG-Boost分类器优于机器学习(ML)模型。因此,XG-Boost几乎是预测中风最可取的分类器,它可以被医生用作参考模型,也可以被考虑帮助早期发现潜在中风的患者使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Ensemble Learning Approach and Analysis for Stroke Prediction Dataset
A stroke is an illness that results in traumatic brain injury by tearing blood vessels. A brain stroke can also occur if blood flow and other nutrients to the brain are interrupted abruptly. It is one of the major global causes of disability and death, as per the report given by the World Health Organization (WHO). With increased convergence amongst technology and medical diagnosis, practitioners create possibilities for improved management of patients by comprehensively quarrying as well as archiving patient's records containing their medical background. As a result, it becomes critical to investigate the interdependence of these factors (risk) in patient's medical records and comprehend the relative impact of these factors for the prediction of brain stroke. This research establishes an early estimation of stroke diseases by combining the existence of hypertension, heart disease, body mass index, smoking status, prior stroke, age, and some other feature attributes. For forecasting the stroke, various statistical methods and five different ML models including some ensemble learning techniques like Support Vector Machine (SVM), Random Forest (RF), Ada-Boost Classifier (ABC), Decision Tree Classifier (DTC), and XG-Boost Classifier (XGB) were used to train the feature attributes. Furthermore, the proposed research work has accomplished an accuracy of 95.08 percent, with the XG-Boost Classifier outperforming the Machine Learning (ML) Models. As a result, XG-Boost is nearly the most preferable classifier for predicting strokes, which can be used as a reference model by physicians and also used by patients considering aid in the early detection of a potential stroke.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信