随机森林树投票函数形成的定位方法——以解决结核类型鉴别问题为例

O. Matviichuk, Oksana Biloshytska, O. Horodetska, V. Pavlov, M. Linnik, I. Nastenko
{"title":"随机森林树投票函数形成的定位方法——以解决结核类型鉴别问题为例","authors":"O. Matviichuk, Oksana Biloshytska, O. Horodetska, V. Pavlov, M. Linnik, I. Nastenko","doi":"10.1109/CSIT56902.2022.10000450","DOIUrl":null,"url":null,"abstract":"The paper proposes a new voting technology for random forest trees – the Positional Approach to the Voting Function Formation (PAVFF). In contrast to existing forms of organizing the voting of random forest trees, the paper proposes to change the subjects of voting and to use as such individual finite elements of the tree, with weights determined in accordance with the competences of new voting units. Each forest tree in the voting function is represented by its individual branch (voting unit) with the corresponding competence level assigned at the stage of tree finite element verification. Furthermore, we propose different mechanisms for organizing the received units in the voting process. The effectiveness of the new mechanism is shown by the example of the differentiation problem of drug-sensitive and drug-resistant forms of tuberculosis. The task feature space is formed by the ROI (regions of interest) textural characteristics on the patient’s lungs CT scan. The initial feature space was composed of the elements of a few textural characteristic matrices. From over half a million input features, a few optimal ensembles were selected to form random forest trees. We used intra- and inter-class variance selection techniques for this purpose, with the final selection made by a genetic algorithm using a combined correlation criterion. After verification of voting units (finite elements of trees) 3 variants of voting by competence were formed: by the most competent unit, weighted average participation of all participants, and group voting with coefficient revaluation by the Group Method of Data Handling. The results were compared with similar voting by random forest trees. A 5% improvement in classification quality is shown.","PeriodicalId":282561,"journal":{"name":"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Positional Approach to the Voting Function Formation of Random Forest Trees as an Example of Solving the Differentiating Tuberculosis Forms Problem\",\"authors\":\"O. Matviichuk, Oksana Biloshytska, O. Horodetska, V. Pavlov, M. Linnik, I. Nastenko\",\"doi\":\"10.1109/CSIT56902.2022.10000450\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper proposes a new voting technology for random forest trees – the Positional Approach to the Voting Function Formation (PAVFF). In contrast to existing forms of organizing the voting of random forest trees, the paper proposes to change the subjects of voting and to use as such individual finite elements of the tree, with weights determined in accordance with the competences of new voting units. Each forest tree in the voting function is represented by its individual branch (voting unit) with the corresponding competence level assigned at the stage of tree finite element verification. Furthermore, we propose different mechanisms for organizing the received units in the voting process. The effectiveness of the new mechanism is shown by the example of the differentiation problem of drug-sensitive and drug-resistant forms of tuberculosis. The task feature space is formed by the ROI (regions of interest) textural characteristics on the patient’s lungs CT scan. The initial feature space was composed of the elements of a few textural characteristic matrices. From over half a million input features, a few optimal ensembles were selected to form random forest trees. We used intra- and inter-class variance selection techniques for this purpose, with the final selection made by a genetic algorithm using a combined correlation criterion. After verification of voting units (finite elements of trees) 3 variants of voting by competence were formed: by the most competent unit, weighted average participation of all participants, and group voting with coefficient revaluation by the Group Method of Data Handling. The results were compared with similar voting by random forest trees. A 5% improvement in classification quality is shown.\",\"PeriodicalId\":282561,\"journal\":{\"name\":\"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)\",\"volume\":\"153 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSIT56902.2022.10000450\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSIT56902.2022.10000450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文提出了一种新的随机森林树投票技术——投票函数形成的位置法(PAVFF)。与现有的随机森林树投票组织形式相比,本文提出改变投票主体,使用随机森林树的单个有限元作为投票主体,并根据新投票单元的能力确定权重。投票函数中的每棵森林树由其单独的分支(投票单元)表示,并在树有限元验证阶段分配相应的能力等级。此外,我们提出了在投票过程中组织接收单位的不同机制。药物敏感型和耐药型结核病的鉴别问题的例子表明了新机制的有效性。任务特征空间由患者肺部CT扫描的感兴趣区域(ROI)纹理特征组成。初始特征空间由几个纹理特征矩阵的元素组成。从超过50万个输入特征中,选择一些最优的集合形成随机森林树。为此,我们使用了类内和类间方差选择技术,并通过使用组合相关准则的遗传算法进行最终选择。在对投票单元(树的有限元)进行验证后,形成了3种能力投票的变体:最具能力的单位投票、所有参与者的加权平均参与投票、采用数据处理的分组方法进行系数重估的分组投票。结果与随机森林树木的类似投票进行了比较。结果表明,分类质量提高了5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Positional Approach to the Voting Function Formation of Random Forest Trees as an Example of Solving the Differentiating Tuberculosis Forms Problem
The paper proposes a new voting technology for random forest trees – the Positional Approach to the Voting Function Formation (PAVFF). In contrast to existing forms of organizing the voting of random forest trees, the paper proposes to change the subjects of voting and to use as such individual finite elements of the tree, with weights determined in accordance with the competences of new voting units. Each forest tree in the voting function is represented by its individual branch (voting unit) with the corresponding competence level assigned at the stage of tree finite element verification. Furthermore, we propose different mechanisms for organizing the received units in the voting process. The effectiveness of the new mechanism is shown by the example of the differentiation problem of drug-sensitive and drug-resistant forms of tuberculosis. The task feature space is formed by the ROI (regions of interest) textural characteristics on the patient’s lungs CT scan. The initial feature space was composed of the elements of a few textural characteristic matrices. From over half a million input features, a few optimal ensembles were selected to form random forest trees. We used intra- and inter-class variance selection techniques for this purpose, with the final selection made by a genetic algorithm using a combined correlation criterion. After verification of voting units (finite elements of trees) 3 variants of voting by competence were formed: by the most competent unit, weighted average participation of all participants, and group voting with coefficient revaluation by the Group Method of Data Handling. The results were compared with similar voting by random forest trees. A 5% improvement in classification quality is shown.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信