{"title":"OFHR: Online Streaming Feature Selection With Hierarchical Structure Based on Relief","authors":"Chenxi Wang, Xiaoqing Zhang, Jinkun Chen, Yu Mao, Shaozi Li, Yaojin Lin","doi":"10.1109/ITME53901.2021.00038","DOIUrl":null,"url":null,"abstract":"Hierarchical classification learning, an emerging classification task in machine learning, is an essential topic. In which various feature selection algorithms have been proposed to select informative features for hierarchical classification. How-ever, existing hierarchical feature selection algorithms consider that the feature space of data is completely obtained in advance, and neglect the uncertainty and dynamism, i.e., feature arrives dynamically in an online manner. In this paper, we present an online streaming feature selection framework with hierarchical structure. First, we apply the closeness matrix between internal nodes to the Relief algorithm, which can calculate the weights of the dynamic features. Second, significant features are dynamically selected for each internal node by considering the hierarchical relationships and feature weights between nodes in the tree structure. Moreover, we perform redundant analysis of features by calculating the covariance between features, and then obtain a superior online feature subset for each internal node. Finally, the proposed algorithm is compared with six online streaming feature selection methods on six hierarchical data sets. The experimental results prove that our algorithm can improve the classification accuracy of the classifier by 10% compared to the suboptimal algorithms, which indicates that the algorithm outperforms other comparative algorithms in hierarchical data sets.","PeriodicalId":6774,"journal":{"name":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","volume":"1 1","pages":"140-145"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Technology in Medicine and Education (ITME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITME53901.2021.00038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Hierarchical classification learning, an emerging classification task in machine learning, is an essential topic. In which various feature selection algorithms have been proposed to select informative features for hierarchical classification. How-ever, existing hierarchical feature selection algorithms consider that the feature space of data is completely obtained in advance, and neglect the uncertainty and dynamism, i.e., feature arrives dynamically in an online manner. In this paper, we present an online streaming feature selection framework with hierarchical structure. First, we apply the closeness matrix between internal nodes to the Relief algorithm, which can calculate the weights of the dynamic features. Second, significant features are dynamically selected for each internal node by considering the hierarchical relationships and feature weights between nodes in the tree structure. Moreover, we perform redundant analysis of features by calculating the covariance between features, and then obtain a superior online feature subset for each internal node. Finally, the proposed algorithm is compared with six online streaming feature selection methods on six hierarchical data sets. The experimental results prove that our algorithm can improve the classification accuracy of the classifier by 10% compared to the suboptimal algorithms, which indicates that the algorithm outperforms other comparative algorithms in hierarchical data sets.