Vijai M Moorthy, Bhupal N Dharamsoth, Vijayalakshmi Muthukaruppan, Arul Elango, Kalaiarasi Ganesan
{"title":"使用数据挖掘和机器学习解决方案预测冠心病。","authors":"Vijai M Moorthy, Bhupal N Dharamsoth, Vijayalakshmi Muthukaruppan, Arul Elango, Kalaiarasi Ganesan","doi":"10.1590/0001-3765202520240811","DOIUrl":null,"url":null,"abstract":"<p><p>This research focuses on predicting cardiovascular disease using machine learning classification strategies. The study presents a unique approach by integrating multiple machine learning techniques, leveraging the strengths of Random Forest and Gradient Boosting. The authors developed a novel ensemble learning model, combining Linear Regression, Random Forest, and Gradient Boosting algorithms, optimized using Bayesian hyperparameter tuning. The model demonstrated superior performance in predicting CVD outcomes, with classification accuracy of 95.5%, 94.26%, and 98.3% for Linear Regression, Decision Tree, and Gradient Boosted methods, respectively. The true positive rate for the GB algorithm's predictions of patients was 98.3%. The study hypothesizes that the GB method predicts the Framingham dataset better than other algorithms using 4240 samples.</p>","PeriodicalId":7776,"journal":{"name":"Anais da Academia Brasileira de Ciencias","volume":"97 3","pages":"e20240811"},"PeriodicalIF":1.1000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting Coronary Heart Disease Using Data Mining and Machine Learning Solutions.\",\"authors\":\"Vijai M Moorthy, Bhupal N Dharamsoth, Vijayalakshmi Muthukaruppan, Arul Elango, Kalaiarasi Ganesan\",\"doi\":\"10.1590/0001-3765202520240811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This research focuses on predicting cardiovascular disease using machine learning classification strategies. The study presents a unique approach by integrating multiple machine learning techniques, leveraging the strengths of Random Forest and Gradient Boosting. The authors developed a novel ensemble learning model, combining Linear Regression, Random Forest, and Gradient Boosting algorithms, optimized using Bayesian hyperparameter tuning. The model demonstrated superior performance in predicting CVD outcomes, with classification accuracy of 95.5%, 94.26%, and 98.3% for Linear Regression, Decision Tree, and Gradient Boosted methods, respectively. The true positive rate for the GB algorithm's predictions of patients was 98.3%. The study hypothesizes that the GB method predicts the Framingham dataset better than other algorithms using 4240 samples.</p>\",\"PeriodicalId\":7776,\"journal\":{\"name\":\"Anais da Academia Brasileira de Ciencias\",\"volume\":\"97 3\",\"pages\":\"e20240811\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Anais da Academia Brasileira de Ciencias\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1590/0001-3765202520240811\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anais da Academia Brasileira de Ciencias","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1590/0001-3765202520240811","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Predicting Coronary Heart Disease Using Data Mining and Machine Learning Solutions.
This research focuses on predicting cardiovascular disease using machine learning classification strategies. The study presents a unique approach by integrating multiple machine learning techniques, leveraging the strengths of Random Forest and Gradient Boosting. The authors developed a novel ensemble learning model, combining Linear Regression, Random Forest, and Gradient Boosting algorithms, optimized using Bayesian hyperparameter tuning. The model demonstrated superior performance in predicting CVD outcomes, with classification accuracy of 95.5%, 94.26%, and 98.3% for Linear Regression, Decision Tree, and Gradient Boosted methods, respectively. The true positive rate for the GB algorithm's predictions of patients was 98.3%. The study hypothesizes that the GB method predicts the Framingham dataset better than other algorithms using 4240 samples.
期刊介绍:
The Brazilian Academy of Sciences (BAS) publishes its journal, Annals of the Brazilian Academy of Sciences (AABC, in its Brazilianportuguese acronym ), every 3 months, being the oldest journal in Brazil with conkinuous distribukion, daking back to 1929. This scienkihic journal aims to publish the advances in scienkihic research from both Brazilian and foreigner scienkists, who work in the main research centers in the whole world, always looking for excellence.
Essenkially a mulkidisciplinary journal, the AABC cover, with both reviews and original researches, the diverse areas represented in the Academy, such as Biology, Physics, Biomedical Sciences, Chemistry, Agrarian Sciences, Engineering, Mathemakics, Social, Health and Earth Sciences.