{"title":"Genetic algorithm based detection of general linear biclusters","authors":"Cuong To, Alan Wee-Chung Liew","doi":"10.1109/ICMLC.2014.7009667","DOIUrl":null,"url":null,"abstract":"Clustering methods classify patterns into clusters using the entire set of attributes of patterns in the similarity measurement. In plenty of cases, patterns are similar under a subset of attributes only. The class of methods that cluster patterns based on subsets of attributes is called biclustering. Biclustering simultaneously groups on both rows and columns of a data matrix and has been applied to various fields, especially gene expression data. However, the biclustering problem is inherently intractable and computationally complex. In recent years, several biclustering algorithms which are based on linear coherent model have been proposed. In this paper, we introduce a novel GA-based algorithm that uses hyperplane to describe the linear relationships between rows (genes) in a sub-matrix (bicluster). The performance of our algorithm is tested via simulated data, gene expression data and compared with several other bicluster methods.","PeriodicalId":335296,"journal":{"name":"2014 International Conference on Machine Learning and Cybernetics","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2014.7009667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Clustering methods classify patterns into clusters using the entire set of attributes of patterns in the similarity measurement. In plenty of cases, patterns are similar under a subset of attributes only. The class of methods that cluster patterns based on subsets of attributes is called biclustering. Biclustering simultaneously groups on both rows and columns of a data matrix and has been applied to various fields, especially gene expression data. However, the biclustering problem is inherently intractable and computationally complex. In recent years, several biclustering algorithms which are based on linear coherent model have been proposed. In this paper, we introduce a novel GA-based algorithm that uses hyperplane to describe the linear relationships between rows (genes) in a sub-matrix (bicluster). The performance of our algorithm is tested via simulated data, gene expression data and compared with several other bicluster methods.