{"title":"基于信息增益和Coati优化算法的基因选择微阵列基因表达数据分类。","authors":"Sarah Osama, A. Ali, Hassan Shaban","doi":"10.21608/kjis.2023.216661.1013","DOIUrl":null,"url":null,"abstract":"Gene expression data has become an essen2al tool for cancer classifica2on because it provides substan2al insights into the underlying mechanisms of cancer progression. However, the high-dimensional nature of microarray gene expression data presents a significant challenge. This paper introduces a new method called IG-COA, which combines Informa2on Gain (IG) approach and Coa2 Op2miza2on Algorithm (COA), to iden2fy the biomarkers genes. COA is a recent algorithm that has not been previously examined for feature or gene selec2on, to the best of our knowledge. Firstly, the IG method is used because using COA directly on microarray datasets is ineffec2ve and can make it challenging to train a classifier accurately. Secondly, the COA algorithm is u2lized to select the op2mal subset of genes from the previously selected ones. The effec2veness of the suggested IG-COA method with a Support Vector Machine is tested on several microarray gene expression datasets, and it exceeds other state-of-the-art methods.","PeriodicalId":115907,"journal":{"name":"Kafrelsheikh Journal of Information Sciences","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A hybrid of Information gain and a Coati Optimization Algorithm for gene selection in microarray gene expression data classification.\",\"authors\":\"Sarah Osama, A. Ali, Hassan Shaban\",\"doi\":\"10.21608/kjis.2023.216661.1013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene expression data has become an essen2al tool for cancer classifica2on because it provides substan2al insights into the underlying mechanisms of cancer progression. However, the high-dimensional nature of microarray gene expression data presents a significant challenge. This paper introduces a new method called IG-COA, which combines Informa2on Gain (IG) approach and Coa2 Op2miza2on Algorithm (COA), to iden2fy the biomarkers genes. COA is a recent algorithm that has not been previously examined for feature or gene selec2on, to the best of our knowledge. Firstly, the IG method is used because using COA directly on microarray datasets is ineffec2ve and can make it challenging to train a classifier accurately. Secondly, the COA algorithm is u2lized to select the op2mal subset of genes from the previously selected ones. The effec2veness of the suggested IG-COA method with a Support Vector Machine is tested on several microarray gene expression datasets, and it exceeds other state-of-the-art methods.\",\"PeriodicalId\":115907,\"journal\":{\"name\":\"Kafrelsheikh Journal of Information Sciences\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Kafrelsheikh Journal of Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21608/kjis.2023.216661.1013\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kafrelsheikh Journal of Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/kjis.2023.216661.1013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
基因表达数据已经成为癌症分类的重要工具,因为它为癌症进展的潜在机制提供了实质性的见解。然而,微阵列基因表达数据的高维性提出了一个重大挑战。本文介绍了一种结合Informa2on Gain (IG)法和Coa2 Op2miza2on算法(COA)的生物标记基因鉴定新方法IG-COA。据我们所知,COA是一种最近的算法,以前还没有对特征或基因选择进行过研究。首先,使用IG方法是因为直接在微阵列数据集上使用COA是无效的,并且会给准确训练分类器带来挑战。其次,利用COA算法从先前选择的基因中选择最优基因子集;基于支持向量机的igg - coa方法的有效性在多个微阵列基因表达数据集上进行了测试,并且超过了其他最先进的方法。
A hybrid of Information gain and a Coati Optimization Algorithm for gene selection in microarray gene expression data classification.
Gene expression data has become an essen2al tool for cancer classifica2on because it provides substan2al insights into the underlying mechanisms of cancer progression. However, the high-dimensional nature of microarray gene expression data presents a significant challenge. This paper introduces a new method called IG-COA, which combines Informa2on Gain (IG) approach and Coa2 Op2miza2on Algorithm (COA), to iden2fy the biomarkers genes. COA is a recent algorithm that has not been previously examined for feature or gene selec2on, to the best of our knowledge. Firstly, the IG method is used because using COA directly on microarray datasets is ineffec2ve and can make it challenging to train a classifier accurately. Secondly, the COA algorithm is u2lized to select the op2mal subset of genes from the previously selected ones. The effec2veness of the suggested IG-COA method with a Support Vector Machine is tested on several microarray gene expression datasets, and it exceeds other state-of-the-art methods.