{"title":"Fuzzy clustering of time series based on trend feature information granulation","authors":"Bin Yu, Chongyan Wu","doi":"10.1016/j.fss.2025.109522","DOIUrl":null,"url":null,"abstract":"<div><div>Clustering is a means to mine valuable information from complex and massive time series data sets, and information granulation is a new strategy to simulate human thinking and solve complex problems. The combination of the two provides a new perspective for knowledge discovery of time series. In this paper, the feature extraction of time series is carried out, and the trend feature is abstractly represented by information granulation to reduce the data scale. Then, a fuzzy C-means clustering algorithm of time series based on trend feature information granules is proposed. First, hodrick prescott (HP) filtering is used to process the raw time series data, removing noise and redundancy. Secondly, the global continuous fitting function is obtained by polynomial curve fitting (PCF) to the time series data. Thirdly, the polynomial function derivative (PFD) values of each time point are obtained by the function as the trend feature, and the time series is transformed into the trend feature series. Then, the feature sequence is segmented optimally. According to the principle of reasonable particle size, the feature sequence is represented by a group of information granule, and the dimensionality is reduced to transform the trend feature information granule (TFIG) sequence. Finally, the fuzzy clustering of time series is realized in the transformed representation space (information granule space). In the process of clustering, a trend feature information granule-based dynamic time warping (TFIG-DTW) algorithm is developed for calculating the distance of two equal-length or unequal-length granular time series, and weighted DTW barycenter averaging (wDBA) is extended to fuzzy C-means (FCM) algorithm to update cluster prototype. Finally, UCR time series database and stock data set are used as experimental objects to verify the effectiveness and superiority of fuzzy clustering method.</div></div>","PeriodicalId":55130,"journal":{"name":"Fuzzy Sets and Systems","volume":"519 ","pages":"Article 109522"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Sets and Systems","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165011425002611","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Clustering is a means to mine valuable information from complex and massive time series data sets, and information granulation is a new strategy to simulate human thinking and solve complex problems. The combination of the two provides a new perspective for knowledge discovery of time series. In this paper, the feature extraction of time series is carried out, and the trend feature is abstractly represented by information granulation to reduce the data scale. Then, a fuzzy C-means clustering algorithm of time series based on trend feature information granules is proposed. First, hodrick prescott (HP) filtering is used to process the raw time series data, removing noise and redundancy. Secondly, the global continuous fitting function is obtained by polynomial curve fitting (PCF) to the time series data. Thirdly, the polynomial function derivative (PFD) values of each time point are obtained by the function as the trend feature, and the time series is transformed into the trend feature series. Then, the feature sequence is segmented optimally. According to the principle of reasonable particle size, the feature sequence is represented by a group of information granule, and the dimensionality is reduced to transform the trend feature information granule (TFIG) sequence. Finally, the fuzzy clustering of time series is realized in the transformed representation space (information granule space). In the process of clustering, a trend feature information granule-based dynamic time warping (TFIG-DTW) algorithm is developed for calculating the distance of two equal-length or unequal-length granular time series, and weighted DTW barycenter averaging (wDBA) is extended to fuzzy C-means (FCM) algorithm to update cluster prototype. Finally, UCR time series database and stock data set are used as experimental objects to verify the effectiveness and superiority of fuzzy clustering method.
期刊介绍:
Since its launching in 1978, the journal Fuzzy Sets and Systems has been devoted to the international advancement of the theory and application of fuzzy sets and systems. The theory of fuzzy sets now encompasses a well organized corpus of basic notions including (and not restricted to) aggregation operations, a generalized theory of relations, specific measures of information content, a calculus of fuzzy numbers. Fuzzy sets are also the cornerstone of a non-additive uncertainty theory, namely possibility theory, and of a versatile tool for both linguistic and numerical modeling: fuzzy rule-based systems. Numerous works now combine fuzzy concepts with other scientific disciplines as well as modern technologies.
In mathematics fuzzy sets have triggered new research topics in connection with category theory, topology, algebra, analysis. Fuzzy sets are also part of a recent trend in the study of generalized measures and integrals, and are combined with statistical methods. Furthermore, fuzzy sets have strong logical underpinnings in the tradition of many-valued logics.