Harsh Agrawal, Janki Chandiwala, Sarvesh Agrawal, Y. Goyal
{"title":"使用探索性数据分析的机器学习预测心力衰竭","authors":"Harsh Agrawal, Janki Chandiwala, Sarvesh Agrawal, Y. Goyal","doi":"10.1109/CONIT51480.2021.9498561","DOIUrl":null,"url":null,"abstract":"According to WHO, cardiovascular diseases are the number 1 cause of death globally. It causes the death of more than 12 million people every year worldwide. The main issue that needs to be resolved is that one should be warned well before time to take precautionary measures. Thus, in this paper, we propose a radical solution based on ensemble learning combining 10 different classification algorithms namely AdaBoost, CatBoost, Decision Trees, KNN, Logistic regression, Light GBM, Gaussian Naïve Bayes, Random Forest, SVM and XGBoost. This ensemble model was able to achieve a test accuracy of 85.2% and test recall of 87.50%. We used the data collected from the Framingham Heart study which includes 15 attributes and 4200+ records. Moreover, we performed extensive Exploratory Data Analysis to understand the importance of each attribute in causing heart failure.","PeriodicalId":426131,"journal":{"name":"2021 International Conference on Intelligent Technologies (CONIT)","volume":"173 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Heart Failure Prediction using Machine Learning with Exploratory Data Analysis\",\"authors\":\"Harsh Agrawal, Janki Chandiwala, Sarvesh Agrawal, Y. Goyal\",\"doi\":\"10.1109/CONIT51480.2021.9498561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"According to WHO, cardiovascular diseases are the number 1 cause of death globally. It causes the death of more than 12 million people every year worldwide. The main issue that needs to be resolved is that one should be warned well before time to take precautionary measures. Thus, in this paper, we propose a radical solution based on ensemble learning combining 10 different classification algorithms namely AdaBoost, CatBoost, Decision Trees, KNN, Logistic regression, Light GBM, Gaussian Naïve Bayes, Random Forest, SVM and XGBoost. This ensemble model was able to achieve a test accuracy of 85.2% and test recall of 87.50%. We used the data collected from the Framingham Heart study which includes 15 attributes and 4200+ records. Moreover, we performed extensive Exploratory Data Analysis to understand the importance of each attribute in causing heart failure.\",\"PeriodicalId\":426131,\"journal\":{\"name\":\"2021 International Conference on Intelligent Technologies (CONIT)\",\"volume\":\"173 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Intelligent Technologies (CONIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONIT51480.2021.9498561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Intelligent Technologies (CONIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONIT51480.2021.9498561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Heart Failure Prediction using Machine Learning with Exploratory Data Analysis
According to WHO, cardiovascular diseases are the number 1 cause of death globally. It causes the death of more than 12 million people every year worldwide. The main issue that needs to be resolved is that one should be warned well before time to take precautionary measures. Thus, in this paper, we propose a radical solution based on ensemble learning combining 10 different classification algorithms namely AdaBoost, CatBoost, Decision Trees, KNN, Logistic regression, Light GBM, Gaussian Naïve Bayes, Random Forest, SVM and XGBoost. This ensemble model was able to achieve a test accuracy of 85.2% and test recall of 87.50%. We used the data collected from the Framingham Heart study which includes 15 attributes and 4200+ records. Moreover, we performed extensive Exploratory Data Analysis to understand the importance of each attribute in causing heart failure.