{"title":"降低时间成本要求的两阶段时间序列聚类方法","authors":"N. Manakova, V. Tkachenko","doi":"10.1109/TCSET49122.2020.235513","DOIUrl":null,"url":null,"abstract":"Clustering is an essential task of unsupervised learning, which is valuable as a specific data mining tool and as an auxiliary stage of numerous highly demanded tasks, including recognizing structures, tuning of forecast parameters, detecting anomalies, and others. Significantly data-driven, especially of specific data such as time-series considered here, as well as with an impressive growth of the volume data, the computational cost becomes a vital critical issue. In the research presented, the authors developed a two-step approach to clustering based on the split of a massive dataset into two unequal parts under the control of the clusterability metric through the instance-based and feature-based combination of time-series clustering. The conducted experimental study on the well-known test data set confirmed the competitiveness of the proposed method under the conditions of the requirement to reduce time costs.","PeriodicalId":389689,"journal":{"name":"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Two-stage time-series clustering approach under reducing time cost requirement\",\"authors\":\"N. Manakova, V. Tkachenko\",\"doi\":\"10.1109/TCSET49122.2020.235513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is an essential task of unsupervised learning, which is valuable as a specific data mining tool and as an auxiliary stage of numerous highly demanded tasks, including recognizing structures, tuning of forecast parameters, detecting anomalies, and others. Significantly data-driven, especially of specific data such as time-series considered here, as well as with an impressive growth of the volume data, the computational cost becomes a vital critical issue. In the research presented, the authors developed a two-step approach to clustering based on the split of a massive dataset into two unequal parts under the control of the clusterability metric through the instance-based and feature-based combination of time-series clustering. The conducted experimental study on the well-known test data set confirmed the competitiveness of the proposed method under the conditions of the requirement to reduce time costs.\",\"PeriodicalId\":389689,\"journal\":{\"name\":\"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TCSET49122.2020.235513\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TCSET49122.2020.235513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Two-stage time-series clustering approach under reducing time cost requirement
Clustering is an essential task of unsupervised learning, which is valuable as a specific data mining tool and as an auxiliary stage of numerous highly demanded tasks, including recognizing structures, tuning of forecast parameters, detecting anomalies, and others. Significantly data-driven, especially of specific data such as time-series considered here, as well as with an impressive growth of the volume data, the computational cost becomes a vital critical issue. In the research presented, the authors developed a two-step approach to clustering based on the split of a massive dataset into two unequal parts under the control of the clusterability metric through the instance-based and feature-based combination of time-series clustering. The conducted experimental study on the well-known test data set confirmed the competitiveness of the proposed method under the conditions of the requirement to reduce time costs.