{"title":"Biclustering of Gene Expression Data Using Genetic Algorithm","authors":"Anupam Chakraborty, Hitashyam Maka","doi":"10.1109/CIBCB.2005.1594893","DOIUrl":null,"url":null,"abstract":"The biclustering problem of gene expression data deals with finding a subset of genes which exhibit similar expression patterns along a subset of conditions. Most of the current algorithms use a statistically predefined threshold as an input parameter for biclustering. This threshold defines the maximum allowable dissimilarity between the cells of a bicluster and is very hard to determine beforehand. Hence we propose two genetic algorithms that embed greedy algorithm as local search procedure and find the best biclusters independent of this threshold score. We also establish that the HScore of a bicluster under the additive model approximately follows chi-square distribution. We found that these genetic algorithms outperformed other greedy algorithms on yeast and lymphoma datasets.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"68","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594893","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 68
Abstract
The biclustering problem of gene expression data deals with finding a subset of genes which exhibit similar expression patterns along a subset of conditions. Most of the current algorithms use a statistically predefined threshold as an input parameter for biclustering. This threshold defines the maximum allowable dissimilarity between the cells of a bicluster and is very hard to determine beforehand. Hence we propose two genetic algorithms that embed greedy algorithm as local search procedure and find the best biclusters independent of this threshold score. We also establish that the HScore of a bicluster under the additive model approximately follows chi-square distribution. We found that these genetic algorithms outperformed other greedy algorithms on yeast and lymphoma datasets.