Flavio Corradini, Vincenzo Nucci, Marco Piangerelli, Barbara Re
{"title":"可解释漂移适应混合特征的在线聚类","authors":"Flavio Corradini, Vincenzo Nucci, Marco Piangerelli, Barbara Re","doi":"10.1016/j.iswa.2025.200510","DOIUrl":null,"url":null,"abstract":"<div><div>In the era of big data, the rapid pace and variability of information have become increasingly evident, particularly in areas like seasonal trends and manufacturing processes. The dynamic nature of the environments that produce these data means that their behavior is time-dependent. Consequently, treating data streams as static entities is no longer effective. This has led to the concept of data drift, which refers to shifts in data distribution over time. Stream processing algorithms are designed to detect these changes promptly and adjust to the newly emerging data patterns. In our research, we introduce FURAKI, an innovative online clustering algorithm that incorporates drift detection. It employs a binary tree structure and is capable of handling both single-feature and mixed-feature data from unbounded streams. We conducted extensive testing of FURAKI against state-of-the-art algorithms using various datasets. Our findings reveal that FURAKI outperforms the state-of-the-art algorithms in the considered datasets.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200510"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online clustering with interpretable drift adaptation to mixed features\",\"authors\":\"Flavio Corradini, Vincenzo Nucci, Marco Piangerelli, Barbara Re\",\"doi\":\"10.1016/j.iswa.2025.200510\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In the era of big data, the rapid pace and variability of information have become increasingly evident, particularly in areas like seasonal trends and manufacturing processes. The dynamic nature of the environments that produce these data means that their behavior is time-dependent. Consequently, treating data streams as static entities is no longer effective. This has led to the concept of data drift, which refers to shifts in data distribution over time. Stream processing algorithms are designed to detect these changes promptly and adjust to the newly emerging data patterns. In our research, we introduce FURAKI, an innovative online clustering algorithm that incorporates drift detection. It employs a binary tree structure and is capable of handling both single-feature and mixed-feature data from unbounded streams. We conducted extensive testing of FURAKI against state-of-the-art algorithms using various datasets. Our findings reveal that FURAKI outperforms the state-of-the-art algorithms in the considered datasets.</div></div>\",\"PeriodicalId\":100684,\"journal\":{\"name\":\"Intelligent Systems with Applications\",\"volume\":\"26 \",\"pages\":\"Article 200510\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligent Systems with Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667305325000365\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305325000365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Online clustering with interpretable drift adaptation to mixed features
In the era of big data, the rapid pace and variability of information have become increasingly evident, particularly in areas like seasonal trends and manufacturing processes. The dynamic nature of the environments that produce these data means that their behavior is time-dependent. Consequently, treating data streams as static entities is no longer effective. This has led to the concept of data drift, which refers to shifts in data distribution over time. Stream processing algorithms are designed to detect these changes promptly and adjust to the newly emerging data patterns. In our research, we introduce FURAKI, an innovative online clustering algorithm that incorporates drift detection. It employs a binary tree structure and is capable of handling both single-feature and mixed-feature data from unbounded streams. We conducted extensive testing of FURAKI against state-of-the-art algorithms using various datasets. Our findings reveal that FURAKI outperforms the state-of-the-art algorithms in the considered datasets.