{"title":"A new clustering method for microarray data analysis","authors":"Louxin Zhang, Song Zhu","doi":"10.1109/CSB.2002.1039349","DOIUrl":null,"url":null,"abstract":"A novel clustering approach is introduced to overcome missing data and inconsistency of gene expression levels under different conditions in the stage of clustering. It is based on the so-called smooth score, which is defined for measuring the deviation of the expression level of a gene and the average expression level of all the genes involved under a condition. We present an efficient greedy algorithm for finding clusters with a smooth score below a threshold after studying its computational complexity. The algorithm was tested intensively on random matrices and yeast data. It was shown to perform it well in finding co-regulation patterns in a test with the yeast data.","PeriodicalId":87204,"journal":{"name":"Proceedings. IEEE Computer Society Bioinformatics Conference","volume":"1 1","pages":"268-275"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/CSB.2002.1039349","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE Computer Society Bioinformatics Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSB.2002.1039349","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A novel clustering approach is introduced to overcome missing data and inconsistency of gene expression levels under different conditions in the stage of clustering. It is based on the so-called smooth score, which is defined for measuring the deviation of the expression level of a gene and the average expression level of all the genes involved under a condition. We present an efficient greedy algorithm for finding clusters with a smooth score below a threshold after studying its computational complexity. The algorithm was tested intensively on random matrices and yeast data. It was shown to perform it well in finding co-regulation patterns in a test with the yeast data.