Positional Approach to the Voting Function Formation of Random Forest Trees as an Example of Solving the Differentiating Tuberculosis Forms Problem

2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT) Pub Date : 2022-11-10 DOI:10.1109/CSIT56902.2022.10000450

O. Matviichuk, Oksana Biloshytska, O. Horodetska, V. Pavlov, M. Linnik, I. Nastenko

{"title":"Positional Approach to the Voting Function Formation of Random Forest Trees as an Example of Solving the Differentiating Tuberculosis Forms Problem","authors":"O. Matviichuk, Oksana Biloshytska, O. Horodetska, V. Pavlov, M. Linnik, I. Nastenko","doi":"10.1109/CSIT56902.2022.10000450","DOIUrl":null,"url":null,"abstract":"The paper proposes a new voting technology for random forest trees – the Positional Approach to the Voting Function Formation (PAVFF). In contrast to existing forms of organizing the voting of random forest trees, the paper proposes to change the subjects of voting and to use as such individual finite elements of the tree, with weights determined in accordance with the competences of new voting units. Each forest tree in the voting function is represented by its individual branch (voting unit) with the corresponding competence level assigned at the stage of tree finite element verification. Furthermore, we propose different mechanisms for organizing the received units in the voting process. The effectiveness of the new mechanism is shown by the example of the differentiation problem of drug-sensitive and drug-resistant forms of tuberculosis. The task feature space is formed by the ROI (regions of interest) textural characteristics on the patient’s lungs CT scan. The initial feature space was composed of the elements of a few textural characteristic matrices. From over half a million input features, a few optimal ensembles were selected to form random forest trees. We used intra- and inter-class variance selection techniques for this purpose, with the final selection made by a genetic algorithm using a combined correlation criterion. After verification of voting units (finite elements of trees) 3 variants of voting by competence were formed: by the most competent unit, weighted average participation of all participants, and group voting with coefficient revaluation by the Group Method of Data Handling. The results were compared with similar voting by random forest trees. A 5% improvement in classification quality is shown.","PeriodicalId":282561,"journal":{"name":"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSIT56902.2022.10000450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The paper proposes a new voting technology for random forest trees – the Positional Approach to the Voting Function Formation (PAVFF). In contrast to existing forms of organizing the voting of random forest trees, the paper proposes to change the subjects of voting and to use as such individual finite elements of the tree, with weights determined in accordance with the competences of new voting units. Each forest tree in the voting function is represented by its individual branch (voting unit) with the corresponding competence level assigned at the stage of tree finite element verification. Furthermore, we propose different mechanisms for organizing the received units in the voting process. The effectiveness of the new mechanism is shown by the example of the differentiation problem of drug-sensitive and drug-resistant forms of tuberculosis. The task feature space is formed by the ROI (regions of interest) textural characteristics on the patient’s lungs CT scan. The initial feature space was composed of the elements of a few textural characteristic matrices. From over half a million input features, a few optimal ensembles were selected to form random forest trees. We used intra- and inter-class variance selection techniques for this purpose, with the final selection made by a genetic algorithm using a combined correlation criterion. After verification of voting units (finite elements of trees) 3 variants of voting by competence were formed: by the most competent unit, weighted average participation of all participants, and group voting with coefficient revaluation by the Group Method of Data Handling. The results were compared with similar voting by random forest trees. A 5% improvement in classification quality is shown.

查看原文本刊更多论文

随机森林树投票函数形成的定位方法——以解决结核类型鉴别问题为例

本文提出了一种新的随机森林树投票技术——投票函数形成的位置法(PAVFF)。与现有的随机森林树投票组织形式相比，本文提出改变投票主体，使用随机森林树的单个有限元作为投票主体，并根据新投票单元的能力确定权重。投票函数中的每棵森林树由其单独的分支(投票单元)表示，并在树有限元验证阶段分配相应的能力等级。此外，我们提出了在投票过程中组织接收单位的不同机制。药物敏感型和耐药型结核病的鉴别问题的例子表明了新机制的有效性。任务特征空间由患者肺部CT扫描的感兴趣区域(ROI)纹理特征组成。初始特征空间由几个纹理特征矩阵的元素组成。从超过50万个输入特征中，选择一些最优的集合形成随机森林树。为此，我们使用了类内和类间方差选择技术，并通过使用组合相关准则的遗传算法进行最终选择。在对投票单元(树的有限元)进行验证后，形成了3种能力投票的变体:最具能力的单位投票、所有参与者的加权平均参与投票、采用数据处理的分组方法进行系数重估的分组投票。结果与随机森林树木的类似投票进行了比较。结果表明，分类质量提高了5%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT)

自引率

0.00%

发文量