{"title":"Intrinsic K-means clustering over homogeneous manifolds","authors":"Chao Tan, Huan Zhao, Han Ding","doi":"10.1007/s10044-024-01330-8","DOIUrl":null,"url":null,"abstract":"<p>The original K-means algorithm is widely applied for clustering in Euclidean spaces. Nevertheless, due to the non-flat characteristics of the Riemannian manifold, standard Euclidean K-means algorithms yield inferior results on such data. To address this issue, this paper presents an intrinsic K-means clustering algorithm on homogeneous manifolds based on the geodesic distance. It allows the development of K-means-based methods for frequently occurring non-vector spaces in robotics, such as directional vector modelling <span>\\(\\mathbb {S}^2\\)</span> and pose estimation <span>\\(\\mathbb {S}^3\\)</span>. First, the Riemannian metric of the homogeneous manifold is delivered; on this basis, the intrinsic K-means is proposed using Karcher mean, and its convergence is proved. Then, differences between the proposed algorithm and four projection-based algorithms, such as embedding projection, stereographic projection, central projection and logarithmic projection, are discussed by investigating their distance preservation on manifolds. Finally, to evaluate the effectiveness of the proposed algorithm, it is compared with the projection-based algorithms on <span>\\(\\mathbb {S}^n\\)</span>. The results show that the intrinsic K-means achieves better clustering results, where the clustering accuracy of the proposed method is improved by 47% and 27% on average on artificial <span>\\(\\mathbb {S}^2\\)</span> and <span>\\(\\mathbb {S}^3\\)</span> datasets, respectively. Meanwhile, the noise immunity of the proposed algorithm becomes more evident with the noise ratio increase.</p>","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"15 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Analysis and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10044-024-01330-8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The original K-means algorithm is widely applied for clustering in Euclidean spaces. Nevertheless, due to the non-flat characteristics of the Riemannian manifold, standard Euclidean K-means algorithms yield inferior results on such data. To address this issue, this paper presents an intrinsic K-means clustering algorithm on homogeneous manifolds based on the geodesic distance. It allows the development of K-means-based methods for frequently occurring non-vector spaces in robotics, such as directional vector modelling \(\mathbb {S}^2\) and pose estimation \(\mathbb {S}^3\). First, the Riemannian metric of the homogeneous manifold is delivered; on this basis, the intrinsic K-means is proposed using Karcher mean, and its convergence is proved. Then, differences between the proposed algorithm and four projection-based algorithms, such as embedding projection, stereographic projection, central projection and logarithmic projection, are discussed by investigating their distance preservation on manifolds. Finally, to evaluate the effectiveness of the proposed algorithm, it is compared with the projection-based algorithms on \(\mathbb {S}^n\). The results show that the intrinsic K-means achieves better clustering results, where the clustering accuracy of the proposed method is improved by 47% and 27% on average on artificial \(\mathbb {S}^2\) and \(\mathbb {S}^3\) datasets, respectively. Meanwhile, the noise immunity of the proposed algorithm becomes more evident with the noise ratio increase.
期刊介绍:
The journal publishes high quality articles in areas of fundamental research in intelligent pattern analysis and applications in computer science and engineering. It aims to provide a forum for original research which describes novel pattern analysis techniques and industrial applications of the current technology. In addition, the journal will also publish articles on pattern analysis applications in medical imaging. The journal solicits articles that detail new technology and methods for pattern recognition and analysis in applied domains including, but not limited to, computer vision and image processing, speech analysis, robotics, multimedia, document analysis, character recognition, knowledge engineering for pattern recognition, fractal analysis, and intelligent control. The journal publishes articles on the use of advanced pattern recognition and analysis methods including statistical techniques, neural networks, genetic algorithms, fuzzy pattern recognition, machine learning, and hardware implementations which are either relevant to the development of pattern analysis as a research area or detail novel pattern analysis applications. Papers proposing new classifier systems or their development, pattern analysis systems for real-time applications, fuzzy and temporal pattern recognition and uncertainty management in applied pattern recognition are particularly solicited.