{"title":"A clustering algorithm for gene expression data using wavelet packet decomposition","authors":"A. Rao","doi":"10.1109/ACSSC.2002.1197198","DOIUrl":null,"url":null,"abstract":"Mining large amounts of data and deriving meaning from the mined data in bioinformatics is a computationally intensive and relevant issue. In this paper, an efficient algorithm to cluster genes into similar 'functional' groups is presented. This is a technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA micro-array hybridization data. These patterns are clues to discovering rhythmic genes implicated in cell-cycle, circadian, or other biological processes. These functionalities are discussed. A signal-processing approach to this problem is presented. The information theoretic criterion for identifying those genes exhibiting maximum variation in behavior is explored. The genes are clustered and then relationships are derived for the proposition of a temporal cell-cycle model governing regulatory behavior. The human fibroblast and yeast data set are presently considered for analysis.","PeriodicalId":284950,"journal":{"name":"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.2002.1197198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Mining large amounts of data and deriving meaning from the mined data in bioinformatics is a computationally intensive and relevant issue. In this paper, an efficient algorithm to cluster genes into similar 'functional' groups is presented. This is a technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA micro-array hybridization data. These patterns are clues to discovering rhythmic genes implicated in cell-cycle, circadian, or other biological processes. These functionalities are discussed. A signal-processing approach to this problem is presented. The information theoretic criterion for identifying those genes exhibiting maximum variation in behavior is explored. The genes are clustered and then relationships are derived for the proposition of a temporal cell-cycle model governing regulatory behavior. The human fibroblast and yeast data set are presently considered for analysis.