{"title":"Iterative rank based methods for clustering","authors":"S. Perrey, H. Brinck, A. Zielesny","doi":"10.1109/CSB.2003.1227379","DOIUrl":null,"url":null,"abstract":"Recently a new clustering algorithm was developed, useful in phylogenetic systematics and taxonomy. It derives a hierarchy from (dis)similarity data on a simple and rather natural way. It transforms a given dissimilarity by an iterative approach. Each iteration step consists of ranking the objects under consideration according to their pairwise dissimilarity and calculating the Euclidian distance of the resulting rank vectors. We investigate alterations of this order of steps as well as substitute the Euclidian distance by standard statistical measures for series of estimates. We evaluate the resulting different procedures on biological and other data sets of different structure regarding their underlying cluster systems. Thereby, potentials and limits of this kind of iterative approach become obvious.","PeriodicalId":147883,"journal":{"name":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSB.2003.1227379","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recently a new clustering algorithm was developed, useful in phylogenetic systematics and taxonomy. It derives a hierarchy from (dis)similarity data on a simple and rather natural way. It transforms a given dissimilarity by an iterative approach. Each iteration step consists of ranking the objects under consideration according to their pairwise dissimilarity and calculating the Euclidian distance of the resulting rank vectors. We investigate alterations of this order of steps as well as substitute the Euclidian distance by standard statistical measures for series of estimates. We evaluate the resulting different procedures on biological and other data sets of different structure regarding their underlying cluster systems. Thereby, potentials and limits of this kind of iterative approach become obvious.