{"title":"使用梯度增强模型、随机森林和高斯朴素贝叶斯的软投票集成进行心脏病预测","authors":"Kaustav Sen, Bindu Verma","doi":"10.1109/INCET57972.2023.10170399","DOIUrl":null,"url":null,"abstract":"Heart disease is associated with a high mortality rate because it affects a significant number of people around the world. There is a pressing need for improved diagnostic methods that are both effective and accurate. Techniques from the field of machine learning have been put to extensive use on tabular data from the healthcare sector, where they have proven to be effective in prediction and analysis. To address the issue of the traditional machine learning model’s low accuracy, precision, and recall value, we propose a soft voting meta classifier composed of Catboost, Light-Gradient Boosting Machine, Gaussian Naive Bayes , Random Forest, and XGBoost. The proposed soft voting ensemble outperformed the other models used in this experiment, which was conducted on a fused UCI heart disease and Statlog dataset. The proposed soft voting ensemble model achieved 91.85% accuracy and a 0.9344 Area Under The Curve Score.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Heart Disease Prediction Using a Soft Voting Ensemble of Gradient Boosting Models, RandomForest, and Gaussian Naive Bayes\",\"authors\":\"Kaustav Sen, Bindu Verma\",\"doi\":\"10.1109/INCET57972.2023.10170399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heart disease is associated with a high mortality rate because it affects a significant number of people around the world. There is a pressing need for improved diagnostic methods that are both effective and accurate. Techniques from the field of machine learning have been put to extensive use on tabular data from the healthcare sector, where they have proven to be effective in prediction and analysis. To address the issue of the traditional machine learning model’s low accuracy, precision, and recall value, we propose a soft voting meta classifier composed of Catboost, Light-Gradient Boosting Machine, Gaussian Naive Bayes , Random Forest, and XGBoost. The proposed soft voting ensemble outperformed the other models used in this experiment, which was conducted on a fused UCI heart disease and Statlog dataset. The proposed soft voting ensemble model achieved 91.85% accuracy and a 0.9344 Area Under The Curve Score.\",\"PeriodicalId\":403008,\"journal\":{\"name\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCET57972.2023.10170399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10170399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Heart Disease Prediction Using a Soft Voting Ensemble of Gradient Boosting Models, RandomForest, and Gaussian Naive Bayes
Heart disease is associated with a high mortality rate because it affects a significant number of people around the world. There is a pressing need for improved diagnostic methods that are both effective and accurate. Techniques from the field of machine learning have been put to extensive use on tabular data from the healthcare sector, where they have proven to be effective in prediction and analysis. To address the issue of the traditional machine learning model’s low accuracy, precision, and recall value, we propose a soft voting meta classifier composed of Catboost, Light-Gradient Boosting Machine, Gaussian Naive Bayes , Random Forest, and XGBoost. The proposed soft voting ensemble outperformed the other models used in this experiment, which was conducted on a fused UCI heart disease and Statlog dataset. The proposed soft voting ensemble model achieved 91.85% accuracy and a 0.9344 Area Under The Curve Score.