Iterative rank based methods for clustering

Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003 Pub Date : 2003-08-11 DOI:10.1109/CSB.2003.1227379

S. Perrey, H. Brinck, A. Zielesny

引用次数: 1

Abstract

Recently a new clustering algorithm was developed, useful in phylogenetic systematics and taxonomy. It derives a hierarchy from (dis)similarity data on a simple and rather natural way. It transforms a given dissimilarity by an iterative approach. Each iteration step consists of ranking the objects under consideration according to their pairwise dissimilarity and calculating the Euclidian distance of the resulting rank vectors. We investigate alterations of this order of steps as well as substitute the Euclidian distance by standard statistical measures for series of estimates. We evaluate the resulting different procedures on biological and other data sets of different structure regarding their underlying cluster systems. Thereby, potentials and limits of this kind of iterative approach become obvious.

查看原文本刊更多论文

基于迭代秩的聚类方法

近年来，人们提出了一种新的聚类算法，用于系统发育系统分类和分类。它以一种简单而自然的方式从(非)相似度数据中导出层次结构。它通过迭代方法变换给定的不相似性。每个迭代步骤包括根据所考虑的对象的两两不相似度对其进行排序，并计算得到的秩向量的欧几里德距离。我们研究了这个步骤顺序的变化，以及用一系列估计的标准统计度量代替欧几里得距离。我们评估了生物和其他不同结构的数据集对其底层集群系统的不同程序。因此，这种迭代方法的潜力和局限性变得明显。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003

自引率

0.00%

发文量