{"title":"Interactive Exploration of Hierarchical Density Clusters in Gene Expression Data","authors":"Tran Van Long, L. Linsen","doi":"10.1109/KSE.2010.22","DOIUrl":null,"url":null,"abstract":"Clustering gene expression data is an important task in bioinformatics research and biomedical applications. In this paper, we present an effective clustering algorithm for gene expression data. The clustering algorithm is based on the analysis of data's density distribution. We propose an intersecting partition of gene expression data into the supports of data points. Density clusters are maximally connected regions at certain density levels, and thus, can be organized in a hierarchical structure. For interactive visual exploration, we use a 2D radial layout of the hierarchical density cluster tree with linked as well as embedded views of parallel coordinates and heat maps. Our system supports the understanding of the distribution of density clusters and the patterns of the density clusters. Experimental results for common gene expression data sets shows the effectiveness and scalability of the algorithm.","PeriodicalId":158823,"journal":{"name":"2010 Second International Conference on Knowledge and Systems Engineering","volume":"35 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Knowledge and Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2010.22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Clustering gene expression data is an important task in bioinformatics research and biomedical applications. In this paper, we present an effective clustering algorithm for gene expression data. The clustering algorithm is based on the analysis of data's density distribution. We propose an intersecting partition of gene expression data into the supports of data points. Density clusters are maximally connected regions at certain density levels, and thus, can be organized in a hierarchical structure. For interactive visual exploration, we use a 2D radial layout of the hierarchical density cluster tree with linked as well as embedded views of parallel coordinates and heat maps. Our system supports the understanding of the distribution of density clusters and the patterns of the density clusters. Experimental results for common gene expression data sets shows the effectiveness and scalability of the algorithm.