Stroke Prediction using Optimization and Exploratory Data Analysis

Ravneet Kaur, Kaustubh Hambarde, Reuben George, Arwa Hussain, Chaitanya Gomkar, S. Sonawani
{"title":"Stroke Prediction using Optimization and Exploratory Data Analysis","authors":"Ravneet Kaur, Kaustubh Hambarde, Reuben George, Arwa Hussain, Chaitanya Gomkar, S. Sonawani","doi":"10.1109/IATMSI56455.2022.10119295","DOIUrl":null,"url":null,"abstract":"A stroke is a medical condition that causes brain damage by rupturing blood vessels. It can also happen if the brain's blood supply and other nutrients are cut off. It is the leading cause of death and disability worldwide., according to the World Health Organization (WHO). It is a potentially fatal illness that primarily affects adults over the age of 65. Doctors devote a significant amount of time and effort to predicting strokes. As a result., the primary goal of the study is to use various Machine Learning approaches to predict the likelihood of stroke occurring using hyper parameter tuning to achieve greater accuracy and optimize the outcomes. After going through the dataset, we discovered that the standard algorithms we used., such as Support Vector Machine (SVC), Decision Tree Classifier, Random Forest Classifier, XGBoost, and KNeighbors, as well as some feature selection methods, could only predict 80 to 85 percent of the time, so we came up with the idea of optimization in machine learning, where we use the technology or concept of hyper parameter tuning, which helped us to gain a prediction of about 95 percent. With this, we also used an Exploratory Data Analysis (EDA) concept for visualization, which helped us to study the attribute. The above-mentioned prognosis was achieved using Hyper Parameter Tuning, which involves checking and analyzing the parameters of each algorithm in such a way that after setting to some predefined parameters, it produces the expected accuracy. To evaluate the data, we employed the EDA approach, in which we compared many associated health behaviors in different combinations with respect to stroke, and each EDA diagram concluded the relationship of these attributes to the cause of stroke. As a result, this study evaluates the performance of various machine learning algorithms that use Hyper parameters tuning with EDA.","PeriodicalId":221211,"journal":{"name":"2022 IEEE Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IATMSI56455.2022.10119295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A stroke is a medical condition that causes brain damage by rupturing blood vessels. It can also happen if the brain's blood supply and other nutrients are cut off. It is the leading cause of death and disability worldwide., according to the World Health Organization (WHO). It is a potentially fatal illness that primarily affects adults over the age of 65. Doctors devote a significant amount of time and effort to predicting strokes. As a result., the primary goal of the study is to use various Machine Learning approaches to predict the likelihood of stroke occurring using hyper parameter tuning to achieve greater accuracy and optimize the outcomes. After going through the dataset, we discovered that the standard algorithms we used., such as Support Vector Machine (SVC), Decision Tree Classifier, Random Forest Classifier, XGBoost, and KNeighbors, as well as some feature selection methods, could only predict 80 to 85 percent of the time, so we came up with the idea of optimization in machine learning, where we use the technology or concept of hyper parameter tuning, which helped us to gain a prediction of about 95 percent. With this, we also used an Exploratory Data Analysis (EDA) concept for visualization, which helped us to study the attribute. The above-mentioned prognosis was achieved using Hyper Parameter Tuning, which involves checking and analyzing the parameters of each algorithm in such a way that after setting to some predefined parameters, it produces the expected accuracy. To evaluate the data, we employed the EDA approach, in which we compared many associated health behaviors in different combinations with respect to stroke, and each EDA diagram concluded the relationship of these attributes to the cause of stroke. As a result, this study evaluates the performance of various machine learning algorithms that use Hyper parameters tuning with EDA.
基于优化和探索性数据分析的脑卒中预测
中风是一种医学疾病,通过血管破裂导致脑损伤。如果大脑的血液供应和其他营养物质被切断,也会发生这种情况。它是全世界死亡和残疾的主要原因。根据世界卫生组织(世卫组织)的数据。这是一种潜在的致命疾病,主要影响65岁以上的成年人。医生投入了大量的时间和精力来预测中风。因此。,该研究的主要目标是使用各种机器学习方法来预测中风发生的可能性,并使用超参数调整来实现更高的准确性并优化结果。在浏览了数据集之后,我们发现我们使用的标准算法。例如支持向量机(SVC)、决策树分类器、随机森林分类器、XGBoost和KNeighbors,以及一些特征选择方法,只能预测80%到85%的时间,所以我们提出了机器学习优化的想法,我们使用超参数调优的技术或概念,这帮助我们获得了大约95%的预测。为此,我们还使用了探索性数据分析(Exploratory Data Analysis, EDA)概念进行可视化,这有助于我们研究属性。上述预测是通过超参数调优(Hyper Parameter Tuning)实现的。超参数调优包括检查和分析每个算法的参数,在设置一些预定义的参数后,产生预期的精度。为了评估这些数据,我们采用了EDA方法,在这种方法中,我们比较了许多与中风相关的不同组合的健康行为,每个EDA图都总结了这些属性与中风原因的关系。因此,本研究评估了使用EDA超参数调优的各种机器学习算法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信