Ming Liu, Jianqiang Du, Zhiqing Li, Jigen Luo, Bin Nie, Mengting Zhang
{"title":"Hybrid Multistage Feature Selection Method and its Application in Chinese Medicine","authors":"Ming Liu, Jianqiang Du, Zhiqing Li, Jigen Luo, Bin Nie, Mengting Zhang","doi":"10.1109/ISBP57705.2023.10061301","DOIUrl":null,"url":null,"abstract":"The experimental data on traditional Chinese medicine efficacy has many irrelevant and redundant features, and different feature combinations have different effects. Therefore, we propose a hybrid multistage feature selection algorithm based on approximate Markov blanket and improved black widow algorithm. The first stage remove irrelevant features by the maximum information coefficient. The second stage delete redundant features from clustered searched by approximate Markov blanket by Lasso algorithm to avoid information loss. The third stage search the optimal feature subset by improved black widow algorithm that used the fast reproduction strategy, the child eating mother strategy and the population restriction strategy. The proposed approach is tested on the basic material data of traditional Chinese medicine and 9 UCI datasets, and compared with other feature selection algorithms. The experimental results show that the algorithm can obtain a small number of feature subsets with high accuracy, and has good time performance.","PeriodicalId":309634,"journal":{"name":"2023 International Conference on Intelligent Supercomputing and BioPharma (ISBP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Intelligent Supercomputing and BioPharma (ISBP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISBP57705.2023.10061301","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The experimental data on traditional Chinese medicine efficacy has many irrelevant and redundant features, and different feature combinations have different effects. Therefore, we propose a hybrid multistage feature selection algorithm based on approximate Markov blanket and improved black widow algorithm. The first stage remove irrelevant features by the maximum information coefficient. The second stage delete redundant features from clustered searched by approximate Markov blanket by Lasso algorithm to avoid information loss. The third stage search the optimal feature subset by improved black widow algorithm that used the fast reproduction strategy, the child eating mother strategy and the population restriction strategy. The proposed approach is tested on the basic material data of traditional Chinese medicine and 9 UCI datasets, and compared with other feature selection algorithms. The experimental results show that the algorithm can obtain a small number of feature subsets with high accuracy, and has good time performance.