微阵列数据分析的分形聚类

Lu-Yong Wang, A. Balasubramanian, A. Chakraborty, D. Comaniciu
{"title":"微阵列数据分析的分形聚类","authors":"Lu-Yong Wang, A. Balasubramanian, A. Chakraborty, D. Comaniciu","doi":"10.1109/CSBW.2005.66","DOIUrl":null,"url":null,"abstract":"DNA microarray experiments generate a substantial amount of information about global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. Some clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or distance metric to cluster the points in multi-dimension linear Euclidean space. Poor consistence with the functional annotation of genes is shown in their validation study. A fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry is proposed. Fractal dimension is used to characterize the degree of self similarity among the points in the clusters. The main idea of fractal clustering is to group points in a cluster in such a way that none of the points in the cluster changes the cluster's intrinsic dimension radically. Hausdorff fractal dimension is computed through the means of the box-counting plot algorithm, since it is the fastest and also robust enough. This method is assessed using validation assessment using public microarray dataset. It shows that this method is superior in identifying functional related gene groups than other traditional methods.","PeriodicalId":123531,"journal":{"name":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","volume":"658 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Fractal clustering for microarray data analysis\",\"authors\":\"Lu-Yong Wang, A. Balasubramanian, A. Chakraborty, D. Comaniciu\",\"doi\":\"10.1109/CSBW.2005.66\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"DNA microarray experiments generate a substantial amount of information about global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. Some clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or distance metric to cluster the points in multi-dimension linear Euclidean space. Poor consistence with the functional annotation of genes is shown in their validation study. A fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry is proposed. Fractal dimension is used to characterize the degree of self similarity among the points in the clusters. The main idea of fractal clustering is to group points in a cluster in such a way that none of the points in the cluster changes the cluster's intrinsic dimension radically. Hausdorff fractal dimension is computed through the means of the box-counting plot algorithm, since it is the fastest and also robust enough. This method is assessed using validation assessment using public microarray dataset. It shows that this method is superior in identifying functional related gene groups than other traditional methods.\",\"PeriodicalId\":123531,\"journal\":{\"name\":\"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)\",\"volume\":\"658 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSBW.2005.66\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSBW.2005.66","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

DNA微阵列实验产生了大量关于全球基因表达的信息。基因表达谱可以用多维空间中的点来表示。在生物医学研究中,识别相关的基因群是必不可少的。聚类有助于基因表达谱的模式识别。介绍了一些聚类技术。然而,这些传统方法主要是利用基于形状的假设或距离度量来聚类多维线性欧几里德空间中的点。验证研究显示,与基因功能注释的一致性较差。提出了一种基于现代几何的分形维数聚类基因的分形聚类方法。分形维数用来表征聚类中点之间的自相似程度。分形聚类的主要思想是对聚类中的点进行分组,使聚类中的任何一个点都不会从根本上改变聚类的内在维数。Hausdorff分形维数是通过盒数图算法计算的,因为它是最快的,也足够鲁棒。该方法通过使用公共微阵列数据集的验证评估进行评估。结果表明,该方法在鉴定功能相关基因群方面优于其他传统方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fractal clustering for microarray data analysis
DNA microarray experiments generate a substantial amount of information about global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. Some clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or distance metric to cluster the points in multi-dimension linear Euclidean space. Poor consistence with the functional annotation of genes is shown in their validation study. A fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry is proposed. Fractal dimension is used to characterize the degree of self similarity among the points in the clusters. The main idea of fractal clustering is to group points in a cluster in such a way that none of the points in the cluster changes the cluster's intrinsic dimension radically. Hausdorff fractal dimension is computed through the means of the box-counting plot algorithm, since it is the fastest and also robust enough. This method is assessed using validation assessment using public microarray dataset. It shows that this method is superior in identifying functional related gene groups than other traditional methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信