{"title":"基于互信息和二元灰狼瀑布模型的特征选择","authors":"Bibhuprasad Sahu, Sujata Dash","doi":"10.1109/APSIT58554.2023.10201689","DOIUrl":null,"url":null,"abstract":"This article aims to identify a way to predict cancer by analyzing gene expression data from microarrays. The focus is on selecting specific features through metaheuristic search algorithms that can help determine the optimal global and local features using population and neighborhood-based methods. Essentially, the goal is to use advanced technology to identify potential biomarkers for cancer prediction, through a novel computer-aided diagnostic tool for the classification of cancer samples using gene expression data. The model is known as JMR-CR with waterfall GWO and operates in two distinct phases. In the initial phase, the JMI-CR algorithm is employed to select the most relevant features from the dataset by utilizing a novel mutual information technique called joint mutual information. In the second phase, the waterfall grey wolf optimization algorithm is used to identify the optimal features. To assess the performance of the proposed model is evaluated using two classification algorithms, namely support vector machine (SVM) and K nearest neighbor (KNN). The proposed model offers several advantages. It addresses the challenges posed by higher dimensionality and class imbalance problems by utilizing the waterfall GWO model, resulting in increased classification accuracy. The model is tested on various cancer microarray gene expression datasets, and the experimental results demonstrate that the proposed hybrid model outperforms other existing models in terms of generalization performance and testing accuracy.","PeriodicalId":170044,"journal":{"name":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","volume":"333 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Selection With Novel Mutual Information and Binary Grey Wolf Waterfall Model\",\"authors\":\"Bibhuprasad Sahu, Sujata Dash\",\"doi\":\"10.1109/APSIT58554.2023.10201689\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article aims to identify a way to predict cancer by analyzing gene expression data from microarrays. The focus is on selecting specific features through metaheuristic search algorithms that can help determine the optimal global and local features using population and neighborhood-based methods. Essentially, the goal is to use advanced technology to identify potential biomarkers for cancer prediction, through a novel computer-aided diagnostic tool for the classification of cancer samples using gene expression data. The model is known as JMR-CR with waterfall GWO and operates in two distinct phases. In the initial phase, the JMI-CR algorithm is employed to select the most relevant features from the dataset by utilizing a novel mutual information technique called joint mutual information. In the second phase, the waterfall grey wolf optimization algorithm is used to identify the optimal features. To assess the performance of the proposed model is evaluated using two classification algorithms, namely support vector machine (SVM) and K nearest neighbor (KNN). The proposed model offers several advantages. It addresses the challenges posed by higher dimensionality and class imbalance problems by utilizing the waterfall GWO model, resulting in increased classification accuracy. The model is tested on various cancer microarray gene expression datasets, and the experimental results demonstrate that the proposed hybrid model outperforms other existing models in terms of generalization performance and testing accuracy.\",\"PeriodicalId\":170044,\"journal\":{\"name\":\"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)\",\"volume\":\"333 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIT58554.2023.10201689\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIT58554.2023.10201689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature Selection With Novel Mutual Information and Binary Grey Wolf Waterfall Model
This article aims to identify a way to predict cancer by analyzing gene expression data from microarrays. The focus is on selecting specific features through metaheuristic search algorithms that can help determine the optimal global and local features using population and neighborhood-based methods. Essentially, the goal is to use advanced technology to identify potential biomarkers for cancer prediction, through a novel computer-aided diagnostic tool for the classification of cancer samples using gene expression data. The model is known as JMR-CR with waterfall GWO and operates in two distinct phases. In the initial phase, the JMI-CR algorithm is employed to select the most relevant features from the dataset by utilizing a novel mutual information technique called joint mutual information. In the second phase, the waterfall grey wolf optimization algorithm is used to identify the optimal features. To assess the performance of the proposed model is evaluated using two classification algorithms, namely support vector machine (SVM) and K nearest neighbor (KNN). The proposed model offers several advantages. It addresses the challenges posed by higher dimensionality and class imbalance problems by utilizing the waterfall GWO model, resulting in increased classification accuracy. The model is tested on various cancer microarray gene expression datasets, and the experimental results demonstrate that the proposed hybrid model outperforms other existing models in terms of generalization performance and testing accuracy.