双敏感代价随机森林在心脏病检测中的应用

Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences Pub Date : 2022-10-13 DOI:10.1145/3570773.3570867

Zhifeng Wang, Xiaoling Tan

{"title":"双敏感代价随机森林在心脏病检测中的应用","authors":"Zhifeng Wang, Xiaoling Tan","doi":"10.1145/3570773.3570867","DOIUrl":null,"url":null,"abstract":"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.","PeriodicalId":153475,"journal":{"name":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Double Sensitive Cost Random Forest in Heart Disease Detection\",\"authors\":\"Zhifeng Wang, Xiaoling Tan\",\"doi\":\"10.1145/3570773.3570867\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.\",\"PeriodicalId\":153475,\"journal\":{\"name\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3570773.3570867\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3570773.3570867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

传统的特征选择算法简单地计算特征代价向量，使随机过程更具倾向性，但没有考虑特征之间的相对关系，在特征分化不显著时退化为普通的随机森林算法。鉴于此，我们提出了双代价敏感随机森林算法。该算法引入了两个改进。1)在特征向量生成中引入序列分析，在分类中对不同类别赋予动态权值。2)以平均误差最小为目标，在决策树生成阶段引入成本敏感性。通过与逻辑回归、随机森林、支持向量机等算法的对比，实验结果表明，该方法在心脏病检测中的误分类率较低，使得结果分类更加可靠，更适合实际应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Application of Double Sensitive Cost Random Forest in Heart Disease Detection

Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences

自引率

0.00%

发文量