{"title":"双敏感代价随机森林在心脏病检测中的应用","authors":"Zhifeng Wang, Xiaoling Tan","doi":"10.1145/3570773.3570867","DOIUrl":null,"url":null,"abstract":"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.","PeriodicalId":153475,"journal":{"name":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Double Sensitive Cost Random Forest in Heart Disease Detection\",\"authors\":\"Zhifeng Wang, Xiaoling Tan\",\"doi\":\"10.1145/3570773.3570867\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.\",\"PeriodicalId\":153475,\"journal\":{\"name\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3570773.3570867\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3570773.3570867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of Double Sensitive Cost Random Forest in Heart Disease Detection
Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.