{"title":"A particle swarm optimization based gene identification technique for classification of cancer subgroups","authors":"Subhajit Kar, Kaushik Das Sharma, M. Maitra","doi":"10.1109/CIEC.2016.7513800","DOIUrl":null,"url":null,"abstract":"Microarray gene expression data generally consist of huge number of genes compared to very less number of samples available. Therefore it is a stimulating task to identify a small subset of relevant genes from microarray gene expression data where the identified genes can solely be used for accurately classifying the cancer subgroups. Therefore, in this paper a computationally efficient but accurate gene identification technique has been proposed. At the onset the t-test method has been utilized to reduce the dimension of the dataset and then the proposed particle swarm optimization based approach has been employed to find useful genes. The proposed method has been applied on the small round blue cell tumor (SRBCT) data to classify the four subgroups specifically neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma and Ewing sarcoma. The results demonstrate that the proposed technique could identify only fourteen genes that can be efficiently exploited for the diagnostic prediction task with high accuracy.","PeriodicalId":443343,"journal":{"name":"2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIEC.2016.7513800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Microarray gene expression data generally consist of huge number of genes compared to very less number of samples available. Therefore it is a stimulating task to identify a small subset of relevant genes from microarray gene expression data where the identified genes can solely be used for accurately classifying the cancer subgroups. Therefore, in this paper a computationally efficient but accurate gene identification technique has been proposed. At the onset the t-test method has been utilized to reduce the dimension of the dataset and then the proposed particle swarm optimization based approach has been employed to find useful genes. The proposed method has been applied on the small round blue cell tumor (SRBCT) data to classify the four subgroups specifically neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma and Ewing sarcoma. The results demonstrate that the proposed technique could identify only fourteen genes that can be efficiently exploited for the diagnostic prediction task with high accuracy.