Yao Li , Ming Chi , Wei Lu , Xiaodong Liu , Witold Pedrycz
{"title":"基于模糊相似性和扩散增强的上下文亲和力的数据流聚类","authors":"Yao Li , Ming Chi , Wei Lu , Xiaodong Liu , Witold Pedrycz","doi":"10.1016/j.ins.2025.122690","DOIUrl":null,"url":null,"abstract":"<div><div>Data stream clustering provides an effective method for recognizing underlying patterns in potentially unbounded sequences of data objects. Existing data stream clustering methods primarily encounter two key issues: (1) the inadequate evaluation of relationships between data objects within fixed-size landmark windows, leading to degraded clustering quality; and (2) the absence of efficient mechanisms for transferring useful knowledge from previous windows to the current window, weakening the model’s adaptability to data stream evolution. To address these issues, a data stream clustering method based on axiomatic fuzzy set theory via a diffusion process is first proposed. First, the proposed method employs axiomatic fuzzy set theory to measure the relationships between data objects within the window, capturing similarity information to more accurately reveal the underlying data distribution. Second, an efficient diffusion process enhances pairwise affinities through contextual propagation, which significantly improves connectivity within clusters. Finally, the learned affinity matrix is applied to spectral clustering for data stream clustering. Over time, we update the dynamic set according to the distance between data objects and cluster centers. This dynamic set retains representative data objects and effectively transfers previously learned knowledge to the current landmark window. Experimental results on four datasets and seven algorithms demonstrate the effectiveness and robustness of the proposed method.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"723 ","pages":"Article 122690"},"PeriodicalIF":6.8000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data stream clustering via fuzzy similarity and diffusion-enhanced contextual affinity\",\"authors\":\"Yao Li , Ming Chi , Wei Lu , Xiaodong Liu , Witold Pedrycz\",\"doi\":\"10.1016/j.ins.2025.122690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Data stream clustering provides an effective method for recognizing underlying patterns in potentially unbounded sequences of data objects. Existing data stream clustering methods primarily encounter two key issues: (1) the inadequate evaluation of relationships between data objects within fixed-size landmark windows, leading to degraded clustering quality; and (2) the absence of efficient mechanisms for transferring useful knowledge from previous windows to the current window, weakening the model’s adaptability to data stream evolution. To address these issues, a data stream clustering method based on axiomatic fuzzy set theory via a diffusion process is first proposed. First, the proposed method employs axiomatic fuzzy set theory to measure the relationships between data objects within the window, capturing similarity information to more accurately reveal the underlying data distribution. Second, an efficient diffusion process enhances pairwise affinities through contextual propagation, which significantly improves connectivity within clusters. Finally, the learned affinity matrix is applied to spectral clustering for data stream clustering. Over time, we update the dynamic set according to the distance between data objects and cluster centers. This dynamic set retains representative data objects and effectively transfers previously learned knowledge to the current landmark window. Experimental results on four datasets and seven algorithms demonstrate the effectiveness and robustness of the proposed method.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"723 \",\"pages\":\"Article 122690\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025525008230\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525008230","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Data stream clustering via fuzzy similarity and diffusion-enhanced contextual affinity
Data stream clustering provides an effective method for recognizing underlying patterns in potentially unbounded sequences of data objects. Existing data stream clustering methods primarily encounter two key issues: (1) the inadequate evaluation of relationships between data objects within fixed-size landmark windows, leading to degraded clustering quality; and (2) the absence of efficient mechanisms for transferring useful knowledge from previous windows to the current window, weakening the model’s adaptability to data stream evolution. To address these issues, a data stream clustering method based on axiomatic fuzzy set theory via a diffusion process is first proposed. First, the proposed method employs axiomatic fuzzy set theory to measure the relationships between data objects within the window, capturing similarity information to more accurately reveal the underlying data distribution. Second, an efficient diffusion process enhances pairwise affinities through contextual propagation, which significantly improves connectivity within clusters. Finally, the learned affinity matrix is applied to spectral clustering for data stream clustering. Over time, we update the dynamic set according to the distance between data objects and cluster centers. This dynamic set retains representative data objects and effectively transfers previously learned knowledge to the current landmark window. Experimental results on four datasets and seven algorithms demonstrate the effectiveness and robustness of the proposed method.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.