{"title":"基于聚类表示的时间序列动态时间翘曲改进模型","authors":"Mina Younan, E. H. Houssein, M. Elhoseny, A. Ali","doi":"10.1109/ICCES51560.2020.9334608","DOIUrl":null,"url":null,"abstract":"Revolution of Smart Things (SThs) connected to the Internet to build Internet of Things (IoT) applications, causes a flood of data streams every moment. Main root causes of massive SThs integration for increasing accuracy of sensed features and for enabling fault tolerance. In general, resulting deluge of real-time data streams has the property of five V of the big data (i.e., volume, velocity, variety, veracity, and value). Such properties make mining and analysis of massive and heterogeneous data be challenging tasks. In our previous work, we present three novel data reduction models based on Dynamic Time Warping (DTW) for enabling balanced indexing in the IoT. This paper presents two extensions to improve the Hybrid algorithm (ClRe 3.0) using DTW warped path. First extension (ClRe 3.1): targets improving accuracy of indexed clusters representatives by taking the average of individual warped items and keeping only 50% of the warped items for each warped slot. Second extension (ClRe 3.2): targets decreasing size of indexed clusters representatives as possible by compensating every warped slot by its corresponding item keeping only common items with minimum distances. The proposed extensions are explained using real samples and evaluated using Szeged-weather dataset as well. The evaluation results proves that ClRe 3.1 could enhance the accuracy of ClRe 3.0 by approximate 9% in average, keeping indexes sizes as possible as fitted (i.e., < the average length of all datasets). In case of indexing only highly common readings, ClRe 3.2 out-performs other extensions in decreasing indexes sizes.","PeriodicalId":247183,"journal":{"name":"2020 15th International Conference on Computer Engineering and Systems (ICCES)","volume":"157 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Models for Time Series Cluster Representation Based Dynamic Time Warping\",\"authors\":\"Mina Younan, E. H. Houssein, M. Elhoseny, A. Ali\",\"doi\":\"10.1109/ICCES51560.2020.9334608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Revolution of Smart Things (SThs) connected to the Internet to build Internet of Things (IoT) applications, causes a flood of data streams every moment. Main root causes of massive SThs integration for increasing accuracy of sensed features and for enabling fault tolerance. In general, resulting deluge of real-time data streams has the property of five V of the big data (i.e., volume, velocity, variety, veracity, and value). Such properties make mining and analysis of massive and heterogeneous data be challenging tasks. In our previous work, we present three novel data reduction models based on Dynamic Time Warping (DTW) for enabling balanced indexing in the IoT. This paper presents two extensions to improve the Hybrid algorithm (ClRe 3.0) using DTW warped path. First extension (ClRe 3.1): targets improving accuracy of indexed clusters representatives by taking the average of individual warped items and keeping only 50% of the warped items for each warped slot. Second extension (ClRe 3.2): targets decreasing size of indexed clusters representatives as possible by compensating every warped slot by its corresponding item keeping only common items with minimum distances. The proposed extensions are explained using real samples and evaluated using Szeged-weather dataset as well. The evaluation results proves that ClRe 3.1 could enhance the accuracy of ClRe 3.0 by approximate 9% in average, keeping indexes sizes as possible as fitted (i.e., < the average length of all datasets). In case of indexing only highly common readings, ClRe 3.2 out-performs other extensions in decreasing indexes sizes.\",\"PeriodicalId\":247183,\"journal\":{\"name\":\"2020 15th International Conference on Computer Engineering and Systems (ICCES)\",\"volume\":\"157 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 15th International Conference on Computer Engineering and Systems (ICCES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCES51560.2020.9334608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 15th International Conference on Computer Engineering and Systems (ICCES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCES51560.2020.9334608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Models for Time Series Cluster Representation Based Dynamic Time Warping
Revolution of Smart Things (SThs) connected to the Internet to build Internet of Things (IoT) applications, causes a flood of data streams every moment. Main root causes of massive SThs integration for increasing accuracy of sensed features and for enabling fault tolerance. In general, resulting deluge of real-time data streams has the property of five V of the big data (i.e., volume, velocity, variety, veracity, and value). Such properties make mining and analysis of massive and heterogeneous data be challenging tasks. In our previous work, we present three novel data reduction models based on Dynamic Time Warping (DTW) for enabling balanced indexing in the IoT. This paper presents two extensions to improve the Hybrid algorithm (ClRe 3.0) using DTW warped path. First extension (ClRe 3.1): targets improving accuracy of indexed clusters representatives by taking the average of individual warped items and keeping only 50% of the warped items for each warped slot. Second extension (ClRe 3.2): targets decreasing size of indexed clusters representatives as possible by compensating every warped slot by its corresponding item keeping only common items with minimum distances. The proposed extensions are explained using real samples and evaluated using Szeged-weather dataset as well. The evaluation results proves that ClRe 3.1 could enhance the accuracy of ClRe 3.0 by approximate 9% in average, keeping indexes sizes as possible as fitted (i.e., < the average length of all datasets). In case of indexing only highly common readings, ClRe 3.2 out-performs other extensions in decreasing indexes sizes.