{"title":"有监督机器学习方法在心脏病预测中的比较研究","authors":"Meghavi Rana, Mohammad Zia Ur Rehman, S. Jain","doi":"10.1109/vlsidcs53788.2022.9811495","DOIUrl":null,"url":null,"abstract":"With the ever-growing medical data, it became possible to use the data for prediction of diseases using Machine Learning (ML) methods. ML methods have been widely employed in healthcare. In the study, some of the mostly used ML methods such as Support Vector Machine, Naïve Bayes classifier, Random Forest, Decision tree, and K-Nearest Neighbor were used for prediction of heart disease. Further, we aim to provide a comparative analysis of the ML algorithms applied for heart disease prediction using their accuracy metrics. Dataset for the study was taken from Kaggle in .csv format, where data mining steps such as data collection, data cleaning, data preprocessing, and exploratory data analysis have been done. The study highlights the ML methods used for classification, providing the comparative analysis between them. As a result, it was concluded that the random forest gave the highest accuracy rate with the dataset used in the study.","PeriodicalId":307414,"journal":{"name":"2022 IEEE VLSI Device Circuit and System (VLSI DCS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparative Study of Supervised Machine Learning Methods for Prediction of Heart Disease\",\"authors\":\"Meghavi Rana, Mohammad Zia Ur Rehman, S. Jain\",\"doi\":\"10.1109/vlsidcs53788.2022.9811495\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the ever-growing medical data, it became possible to use the data for prediction of diseases using Machine Learning (ML) methods. ML methods have been widely employed in healthcare. In the study, some of the mostly used ML methods such as Support Vector Machine, Naïve Bayes classifier, Random Forest, Decision tree, and K-Nearest Neighbor were used for prediction of heart disease. Further, we aim to provide a comparative analysis of the ML algorithms applied for heart disease prediction using their accuracy metrics. Dataset for the study was taken from Kaggle in .csv format, where data mining steps such as data collection, data cleaning, data preprocessing, and exploratory data analysis have been done. The study highlights the ML methods used for classification, providing the comparative analysis between them. As a result, it was concluded that the random forest gave the highest accuracy rate with the dataset used in the study.\",\"PeriodicalId\":307414,\"journal\":{\"name\":\"2022 IEEE VLSI Device Circuit and System (VLSI DCS)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE VLSI Device Circuit and System (VLSI DCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/vlsidcs53788.2022.9811495\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE VLSI Device Circuit and System (VLSI DCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/vlsidcs53788.2022.9811495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative Study of Supervised Machine Learning Methods for Prediction of Heart Disease
With the ever-growing medical data, it became possible to use the data for prediction of diseases using Machine Learning (ML) methods. ML methods have been widely employed in healthcare. In the study, some of the mostly used ML methods such as Support Vector Machine, Naïve Bayes classifier, Random Forest, Decision tree, and K-Nearest Neighbor were used for prediction of heart disease. Further, we aim to provide a comparative analysis of the ML algorithms applied for heart disease prediction using their accuracy metrics. Dataset for the study was taken from Kaggle in .csv format, where data mining steps such as data collection, data cleaning, data preprocessing, and exploratory data analysis have been done. The study highlights the ML methods used for classification, providing the comparative analysis between them. As a result, it was concluded that the random forest gave the highest accuracy rate with the dataset used in the study.