Jinglong Wang , Yu Zhang , Changju Liu , Jiangtao Xu
{"title":"AHDPC:自适应双曲密度峰值聚类","authors":"Jinglong Wang , Yu Zhang , Changju Liu , Jiangtao Xu","doi":"10.1016/j.eswa.2025.130065","DOIUrl":null,"url":null,"abstract":"<div><div>Non-uniformly distributed datasets are common in real-world, and density peak clustering (DPC) methods are used on these datasets due to their superior clustering performance. However, existing DPC relies on linearly growing Euclidean distance, causing misleading similarity between points from different clusters and limiting the improvement of accuracy. To overcome this limitation, this study introduces an adaptive hyperbolic density peak clustering algorithm (AHDPC) by extending DPC into hyperbolic space. First, linear Euclidean distance is replaced with exponentially growing hyperbolic distance to enhance density difference between different points. Then, to overcome the misclassification of points at the junction of high-density and low-density regions and errors from extreme hyperbolic distance, a novel adaptive weighting strategy is proposed, it dynamically adjusts hyperbolic distance by building the trace of the global covariance matrix, the Euclidean norm, the maximum pairwise distance, and point-to-center deviation. Finally, an adaptively cutoff distance method based on a segmented search strategy is developed to eliminate manual tuning, and an exponential density function replaces the gaussian kernel to improve computational efficiency. AHDPC not only overcomes the deficiencies of Euclidean space but also mitigates the restrictive aspects of hyperbolic space. Extensive experiments on synthetic and real datasets, the olivetti faces dataset, and medical image datasets demonstrate that AHDPC outperforms state-of-the-art methods in clustering accuracy. Results also show that AHDPC produces more discriminative decision graph for identifying cluster centers and enhances the accuracy of categorisation of non-center points. The advantages of its robustness and adaptive weight in improving the clustering performance are also confirmed.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"299 ","pages":"Article 130065"},"PeriodicalIF":7.5000,"publicationDate":"2025-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AHDPC: Adaptively hyperbolic density peak clustering\",\"authors\":\"Jinglong Wang , Yu Zhang , Changju Liu , Jiangtao Xu\",\"doi\":\"10.1016/j.eswa.2025.130065\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Non-uniformly distributed datasets are common in real-world, and density peak clustering (DPC) methods are used on these datasets due to their superior clustering performance. However, existing DPC relies on linearly growing Euclidean distance, causing misleading similarity between points from different clusters and limiting the improvement of accuracy. To overcome this limitation, this study introduces an adaptive hyperbolic density peak clustering algorithm (AHDPC) by extending DPC into hyperbolic space. First, linear Euclidean distance is replaced with exponentially growing hyperbolic distance to enhance density difference between different points. Then, to overcome the misclassification of points at the junction of high-density and low-density regions and errors from extreme hyperbolic distance, a novel adaptive weighting strategy is proposed, it dynamically adjusts hyperbolic distance by building the trace of the global covariance matrix, the Euclidean norm, the maximum pairwise distance, and point-to-center deviation. Finally, an adaptively cutoff distance method based on a segmented search strategy is developed to eliminate manual tuning, and an exponential density function replaces the gaussian kernel to improve computational efficiency. AHDPC not only overcomes the deficiencies of Euclidean space but also mitigates the restrictive aspects of hyperbolic space. Extensive experiments on synthetic and real datasets, the olivetti faces dataset, and medical image datasets demonstrate that AHDPC outperforms state-of-the-art methods in clustering accuracy. Results also show that AHDPC produces more discriminative decision graph for identifying cluster centers and enhances the accuracy of categorisation of non-center points. The advantages of its robustness and adaptive weight in improving the clustering performance are also confirmed.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"299 \",\"pages\":\"Article 130065\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425036814\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425036814","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
AHDPC: Adaptively hyperbolic density peak clustering
Non-uniformly distributed datasets are common in real-world, and density peak clustering (DPC) methods are used on these datasets due to their superior clustering performance. However, existing DPC relies on linearly growing Euclidean distance, causing misleading similarity between points from different clusters and limiting the improvement of accuracy. To overcome this limitation, this study introduces an adaptive hyperbolic density peak clustering algorithm (AHDPC) by extending DPC into hyperbolic space. First, linear Euclidean distance is replaced with exponentially growing hyperbolic distance to enhance density difference between different points. Then, to overcome the misclassification of points at the junction of high-density and low-density regions and errors from extreme hyperbolic distance, a novel adaptive weighting strategy is proposed, it dynamically adjusts hyperbolic distance by building the trace of the global covariance matrix, the Euclidean norm, the maximum pairwise distance, and point-to-center deviation. Finally, an adaptively cutoff distance method based on a segmented search strategy is developed to eliminate manual tuning, and an exponential density function replaces the gaussian kernel to improve computational efficiency. AHDPC not only overcomes the deficiencies of Euclidean space but also mitigates the restrictive aspects of hyperbolic space. Extensive experiments on synthetic and real datasets, the olivetti faces dataset, and medical image datasets demonstrate that AHDPC outperforms state-of-the-art methods in clustering accuracy. Results also show that AHDPC produces more discriminative decision graph for identifying cluster centers and enhances the accuracy of categorisation of non-center points. The advantages of its robustness and adaptive weight in improving the clustering performance are also confirmed.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.