Attila Egri, I. Horváth, Ferenc Kovács, Roland Molontay, K. Varga
{"title":"基于互相关的多变量时间序列聚类与降维","authors":"Attila Egri, I. Horváth, Ferenc Kovács, Roland Molontay, K. Varga","doi":"10.1109/ines.2017.8118563","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate dimension reduction possibilities of multidimensional time series data and we introduce a graph based clustering approach using the cross-correlation between time series. The proposed solution consists of two main steps: introducing a novel similarity measure for measuring cross-correlations and a graph-based clustering technique. These two parts are both compared to existing techniques, including noise tolerance and our solution performs better in a noisy environment. The proposed solution is applied to performance metrics of a specific data processing system in order to identify and efficiently visualize connections among the collected metrics. The introduced method provides a more balanced clustering than classic ones, and it is suitable to reveal dependencies and connections among performance metrics time series data.","PeriodicalId":344933,"journal":{"name":"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Cross-correlation based clustering and dimension reduction of multivariate time series\",\"authors\":\"Attila Egri, I. Horváth, Ferenc Kovács, Roland Molontay, K. Varga\",\"doi\":\"10.1109/ines.2017.8118563\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate dimension reduction possibilities of multidimensional time series data and we introduce a graph based clustering approach using the cross-correlation between time series. The proposed solution consists of two main steps: introducing a novel similarity measure for measuring cross-correlations and a graph-based clustering technique. These two parts are both compared to existing techniques, including noise tolerance and our solution performs better in a noisy environment. The proposed solution is applied to performance metrics of a specific data processing system in order to identify and efficiently visualize connections among the collected metrics. The introduced method provides a more balanced clustering than classic ones, and it is suitable to reveal dependencies and connections among performance metrics time series data.\",\"PeriodicalId\":344933,\"journal\":{\"name\":\"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ines.2017.8118563\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ines.2017.8118563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cross-correlation based clustering and dimension reduction of multivariate time series
In this paper, we investigate dimension reduction possibilities of multidimensional time series data and we introduce a graph based clustering approach using the cross-correlation between time series. The proposed solution consists of two main steps: introducing a novel similarity measure for measuring cross-correlations and a graph-based clustering technique. These two parts are both compared to existing techniques, including noise tolerance and our solution performs better in a noisy environment. The proposed solution is applied to performance metrics of a specific data processing system in order to identify and efficiently visualize connections among the collected metrics. The introduced method provides a more balanced clustering than classic ones, and it is suitable to reveal dependencies and connections among performance metrics time series data.