基于特征选择和特征加权的改进随机森林在CBR系统病例检索中的应用

IF 0.6 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING
{"title":"基于特征选择和特征加权的改进随机森林在CBR系统病例检索中的应用","authors":"","doi":"10.4018/ijsi.293265","DOIUrl":null,"url":null,"abstract":": The medical diagnostic process works very similarly to the Case Based Reasoning (CBR) cycle scheme. CBR is a problem solving approach based on the reuse of past experiences called cases. To improve the performance of the retrieval phase, a Random Forest (RF) model is proposed, in this respect we used this algorithm in three different ways (three different algorithms): Classic Random Forest (CRF) algorithm, Random Forest with Feature Selection (RF_FS) algorithm where we selected the most important attributes and deleted the less important ones and Weighted Random Forest (WRF) algorithm where we weighted the most important attributes by giving them more weight. We did this by multiplying the entropy with the weight corresponding to each attribute.We tested our three algorithms CRF, RF_FS and WRF with CBR on data from 11 medical databases and compared the results they produced. We found that WRF and RF_FS give better results than CRF. The experiemental results show the performance and robustess of the proposed approach.","PeriodicalId":55938,"journal":{"name":"International Journal of Software Innovation","volume":" ","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved Random Forest based on Feature Selection and Feature weighting for case retrieval in CBR system Application to medical data\",\"authors\":\"\",\"doi\":\"10.4018/ijsi.293265\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": The medical diagnostic process works very similarly to the Case Based Reasoning (CBR) cycle scheme. CBR is a problem solving approach based on the reuse of past experiences called cases. To improve the performance of the retrieval phase, a Random Forest (RF) model is proposed, in this respect we used this algorithm in three different ways (three different algorithms): Classic Random Forest (CRF) algorithm, Random Forest with Feature Selection (RF_FS) algorithm where we selected the most important attributes and deleted the less important ones and Weighted Random Forest (WRF) algorithm where we weighted the most important attributes by giving them more weight. We did this by multiplying the entropy with the weight corresponding to each attribute.We tested our three algorithms CRF, RF_FS and WRF with CBR on data from 11 medical databases and compared the results they produced. We found that WRF and RF_FS give better results than CRF. The experiemental results show the performance and robustess of the proposed approach.\",\"PeriodicalId\":55938,\"journal\":{\"name\":\"International Journal of Software Innovation\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Software Innovation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/ijsi.293265\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Software Innovation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijsi.293265","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

:医学诊断过程的工作原理与基于案例推理(CBR)的循环方案非常相似。CBR是一种基于重用过去经验的问题解决方法,称为案例。为了提高检索阶段的性能,提出了一个随机森林(RF)模型,在这方面,我们以三种不同的方式(三种不同算法)使用了该算法:经典随机森林(CRF)算法,具有特征选择的随机森林(RF_FS)算法,其中我们选择了最重要的属性并删除了不太重要的属性,以及加权随机森林(WRF)算法,在该算法中,我们通过赋予最重要属性更多的权重来对其进行加权。我们通过将熵与每个属性对应的权重相乘来实现这一点。我们在11个医学数据库的数据上测试了我们的三种算法CRF、RF_FS和WRF和CBR,并比较了它们产生的结果。我们发现WRF和RF_FS比CRF给出更好的结果。实验结果表明了该方法的性能和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An improved Random Forest based on Feature Selection and Feature weighting for case retrieval in CBR system Application to medical data
: The medical diagnostic process works very similarly to the Case Based Reasoning (CBR) cycle scheme. CBR is a problem solving approach based on the reuse of past experiences called cases. To improve the performance of the retrieval phase, a Random Forest (RF) model is proposed, in this respect we used this algorithm in three different ways (three different algorithms): Classic Random Forest (CRF) algorithm, Random Forest with Feature Selection (RF_FS) algorithm where we selected the most important attributes and deleted the less important ones and Weighted Random Forest (WRF) algorithm where we weighted the most important attributes by giving them more weight. We did this by multiplying the entropy with the weight corresponding to each attribute.We tested our three algorithms CRF, RF_FS and WRF with CBR on data from 11 medical databases and compared the results they produced. We found that WRF and RF_FS give better results than CRF. The experiemental results show the performance and robustess of the proposed approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Software Innovation
International Journal of Software Innovation COMPUTER SCIENCE, SOFTWARE ENGINEERING-
CiteScore
1.40
自引率
0.00%
发文量
118
期刊介绍: The International Journal of Software Innovation (IJSI) covers state-of-the-art research and development in all aspects of evolutionary and revolutionary ideas pertaining to software systems and their development. The journal publishes original papers on both theory and practice that reflect and accommodate the fast-changing nature of daily life. Topics of interest include not only application-independent software systems, but also application-specific software systems like healthcare, education, energy, and entertainment software systems, as well as techniques and methodologies for modeling, developing, validating, maintaining, and reengineering software systems and their environments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信