双敏感代价随机森林在心脏病检测中的应用

Zhifeng Wang, Xiaoling Tan
{"title":"双敏感代价随机森林在心脏病检测中的应用","authors":"Zhifeng Wang, Xiaoling Tan","doi":"10.1145/3570773.3570867","DOIUrl":null,"url":null,"abstract":"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.","PeriodicalId":153475,"journal":{"name":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Double Sensitive Cost Random Forest in Heart Disease Detection\",\"authors\":\"Zhifeng Wang, Xiaoling Tan\",\"doi\":\"10.1145/3570773.3570867\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.\",\"PeriodicalId\":153475,\"journal\":{\"name\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3570773.3570867\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Symposium on Artificial Intelligence for Medicine Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3570773.3570867","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

传统的特征选择算法简单地计算特征代价向量,使随机过程更具倾向性,但没有考虑特征之间的相对关系,在特征分化不显著时退化为普通的随机森林算法。鉴于此,我们提出了双代价敏感随机森林算法。该算法引入了两个改进。1)在特征向量生成中引入序列分析,在分类中对不同类别赋予动态权值。2)以平均误差最小为目标,在决策树生成阶段引入成本敏感性。通过与逻辑回归、随机森林、支持向量机等算法的对比,实验结果表明,该方法在心脏病检测中的误分类率较低,使得结果分类更加可靠,更适合实际应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Application of Double Sensitive Cost Random Forest in Heart Disease Detection
Traditional feature selection algorithms simply compute a feature cost vector to make the random process more tendentious, but do not consider the relative relationship between features, and degenerate into ordinary random forest algorithms when feature differentiation is not significant. In view of this, we propose the dual cost-sensitive random forest algorithm. The algorithm introduces two improvements. 1) Introducing sequential analysis in generating feature vectors, giving dynamic weights to different categories in classification. 2) Introducing cost sensitivity in the decision tree generation stage with the goal of minimum average error. After comparing with logistic regression, random forest, support vector machine and other algorithms, the experimental results show that the method has a lower misclassification rate in heart disease detection, which makes the result classification more reliable and more suitable for practical applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信