{"title":"An integrated interpretation and clustering model based on attribute grouping","authors":"Liang Chen, Leming Sun, Caiming Zhong","doi":"10.1007/s10489-025-06262-2","DOIUrl":null,"url":null,"abstract":"<div><p>Clustering is a technique in unsupervised learning used to group unlabeled data. However, traditional clustering algorithms cannot provide explanations for the clustering process and its results, which limits their applicability in certain fields. Existing methods to address the lack of interpretability in clustering algorithms typically focus on explaining the results after the clustering process is complete. Few studies explore embedding interpretability directly into the clustering process, and most of these methods rely on data prototypes to express interpretability, which often leads to explanations that are not intuitive and user-friendly. To address this, a feature-based method is proposed to embed interpretability into the clustering process. This approach provides users with intuitive and easy-to-understand explanations and introduces a new direction for research on embedding interpretability into clustering. The method operates in two stages: in the first stage, all attributes are grouped; in the second stage, an optimization formula is used to complete both the clustering and the weighting of each attribute group. The proposed method was evaluated on multiple synthetic and real-world datasets and compared with other methods. The experimental results show that the method improves clustering accuracy by approximately 5 percent and interpretability by around 40 percent compared to existing approaches.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 7","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10489-025-06262-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06262-2","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Clustering is a technique in unsupervised learning used to group unlabeled data. However, traditional clustering algorithms cannot provide explanations for the clustering process and its results, which limits their applicability in certain fields. Existing methods to address the lack of interpretability in clustering algorithms typically focus on explaining the results after the clustering process is complete. Few studies explore embedding interpretability directly into the clustering process, and most of these methods rely on data prototypes to express interpretability, which often leads to explanations that are not intuitive and user-friendly. To address this, a feature-based method is proposed to embed interpretability into the clustering process. This approach provides users with intuitive and easy-to-understand explanations and introduces a new direction for research on embedding interpretability into clustering. The method operates in two stages: in the first stage, all attributes are grouped; in the second stage, an optimization formula is used to complete both the clustering and the weighting of each attribute group. The proposed method was evaluated on multiple synthetic and real-world datasets and compared with other methods. The experimental results show that the method improves clustering accuracy by approximately 5 percent and interpretability by around 40 percent compared to existing approaches.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.