{"title":"使用机器学习技术对心脏病进行分类","authors":"Perivitta Rajendran, S. Haw, Palaichamy Naveen","doi":"10.1145/3488466.3488482","DOIUrl":null,"url":null,"abstract":"The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.","PeriodicalId":196340,"journal":{"name":"Proceedings of the 5th International Conference on Digital Technology in Education","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Classification of Heart Disease Using Machine Learning Techniques\",\"authors\":\"Perivitta Rajendran, S. Haw, Palaichamy Naveen\",\"doi\":\"10.1145/3488466.3488482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.\",\"PeriodicalId\":196340,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Digital Technology in Education\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Digital Technology in Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3488466.3488482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Digital Technology in Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3488466.3488482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of Heart Disease Using Machine Learning Techniques
The most crucial task in the medical field is diagnosing an illness. If a disease is determined at the early stage then many lives can be saved. The purpose of this paper is to use the medical data to predict cardiovascular heart disease using both supervised and unsupervised learning techniques and to show the effects of feature correlation on the classification model with over four different algorithms namely, Logistic Regression, Naive Bayes, Random Forest and Artificial Neural Networks. For the performance assessment, it incorporates F1-score, precision, Area under curve and recall. Overall, Logistic Regression algorithm tends to perform well for both Hungary and Statlog dataset whereas for Cleveland dataset, Artificial Neural Networks performs better than Logistic Regression in terms of accuracy. In terms of area under curve score, Logistic Regression performance is higher in all the dataset compared to Naive Bayes, Random Forest and Artificial Neural Networks. The results tabulated evidently prove that the designed diagnostic system is capable of predicting the risk level of heart disease effectively when compared to other approaches.