{"title":"Clustering DNA methylation expressions using nonparametric beta mixture model","authors":"Lin Zhang, Jia Meng, Hui Liu, Yufei Huang","doi":"10.1109/GENSiPS.2011.6169472","DOIUrl":null,"url":null,"abstract":"The problem of defining the clustering structure in DNA methylation expressions is considered. A Dirichlet process beta mixture model (DPBMM) is proposed that models the DNA methylation data array. The model allows automatic learning of the cluster structure parameters such as the cluster mixing proportion, the models of each cluster, and especially the number of clusters. To enable the learning, we proposed a Gibbs sampling algorithm for computing the posterior distributions, hence the estimates of the parameters. We investigate the performance of the proposed clustering algorithm via simulation.","PeriodicalId":181666,"journal":{"name":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GENSiPS.2011.6169472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The problem of defining the clustering structure in DNA methylation expressions is considered. A Dirichlet process beta mixture model (DPBMM) is proposed that models the DNA methylation data array. The model allows automatic learning of the cluster structure parameters such as the cluster mixing proportion, the models of each cluster, and especially the number of clusters. To enable the learning, we proposed a Gibbs sampling algorithm for computing the posterior distributions, hence the estimates of the parameters. We investigate the performance of the proposed clustering algorithm via simulation.