{"title":"Two-stage time-series clustering approach under reducing time cost requirement","authors":"N. Manakova, V. Tkachenko","doi":"10.1109/TCSET49122.2020.235513","DOIUrl":null,"url":null,"abstract":"Clustering is an essential task of unsupervised learning, which is valuable as a specific data mining tool and as an auxiliary stage of numerous highly demanded tasks, including recognizing structures, tuning of forecast parameters, detecting anomalies, and others. Significantly data-driven, especially of specific data such as time-series considered here, as well as with an impressive growth of the volume data, the computational cost becomes a vital critical issue. In the research presented, the authors developed a two-step approach to clustering based on the split of a massive dataset into two unequal parts under the control of the clusterability metric through the instance-based and feature-based combination of time-series clustering. The conducted experimental study on the well-known test data set confirmed the competitiveness of the proposed method under the conditions of the requirement to reduce time costs.","PeriodicalId":389689,"journal":{"name":"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TCSET49122.2020.235513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Clustering is an essential task of unsupervised learning, which is valuable as a specific data mining tool and as an auxiliary stage of numerous highly demanded tasks, including recognizing structures, tuning of forecast parameters, detecting anomalies, and others. Significantly data-driven, especially of specific data such as time-series considered here, as well as with an impressive growth of the volume data, the computational cost becomes a vital critical issue. In the research presented, the authors developed a two-step approach to clustering based on the split of a massive dataset into two unequal parts under the control of the clusterability metric through the instance-based and feature-based combination of time-series clustering. The conducted experimental study on the well-known test data set confirmed the competitiveness of the proposed method under the conditions of the requirement to reduce time costs.