{"title":"Efficient Tensor Completion for Internet Traffic Data Recovery","authors":"Yuanyuan Li, K. Yu, Xiaofei Wu","doi":"10.1145/3291842.3291912","DOIUrl":null,"url":null,"abstract":"In real-world traffic data, missing data is inevitable for traffic collection problem and some other reasons. Therefore, it is increasingly critical to complete the whole traffic data. To make full use of hidden spatial-temporal structures of the Internet traffic data, we extend to traffic matrix (user, server) to a three-way tensor (user, server, time). This paper proposed a novel algorithm to recover the missing data, named TSVT (Tucker decomposition and tensor singular value thresholding). We exploit the proposed the Tucker decomposition and low rank tensor completion (LRTC). Based on LRTC, we extend the known singular value thresholding (SVT) to the tensor case, and combine Tucker decomposition to optimize the performance of the algorithm. Based on Internet traffic data, we model two different tensors, and apply our TSVT to infer the unobserved data of the traffic. Furthermore, we apply TSVT algorithm to our Internet data. Numerical experiments on Synthetic data verify our method and the application for completing DPI traffic data proves the effectiveness of the method.","PeriodicalId":283197,"journal":{"name":"Proceedings of the 2nd International Conference on Telecommunications and Communication Engineering","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Telecommunications and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3291842.3291912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In real-world traffic data, missing data is inevitable for traffic collection problem and some other reasons. Therefore, it is increasingly critical to complete the whole traffic data. To make full use of hidden spatial-temporal structures of the Internet traffic data, we extend to traffic matrix (user, server) to a three-way tensor (user, server, time). This paper proposed a novel algorithm to recover the missing data, named TSVT (Tucker decomposition and tensor singular value thresholding). We exploit the proposed the Tucker decomposition and low rank tensor completion (LRTC). Based on LRTC, we extend the known singular value thresholding (SVT) to the tensor case, and combine Tucker decomposition to optimize the performance of the algorithm. Based on Internet traffic data, we model two different tensors, and apply our TSVT to infer the unobserved data of the traffic. Furthermore, we apply TSVT algorithm to our Internet data. Numerical experiments on Synthetic data verify our method and the application for completing DPI traffic data proves the effectiveness of the method.
在现实交通数据中,由于交通采集问题等原因,数据缺失是不可避免的。因此,完整的交通数据变得越来越重要。为了充分利用互联网流量数据隐藏的时空结构,我们将流量矩阵(用户、服务器)扩展为一个三向张量(用户、服务器、时间)。本文提出了一种新的恢复缺失数据的算法,称为TSVT (Tucker decomposition and tensor singular value threshold)。我们利用提出的Tucker分解和低秩张量补全(LRTC)。在LRTC的基础上,将已知的奇异值阈值(SVT)扩展到张量情况,并结合Tucker分解优化算法的性能。基于互联网流量数据,我们建立了两个不同的张量模型,并应用我们的TSVT来推断流量的未观测数据。此外,我们将TSVT算法应用于我们的互联网数据。在综合数据上的数值实验验证了该方法的有效性,在DPI交通数据上的应用也证明了该方法的有效性。