Data Augmentation for Tensor Completion via Embedding Gradient Tracking

IF 3.7 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Han Deng, Yuhui Li, Songyou Xie, Yang Yang, Yaqin Liu
{"title":"Data Augmentation for Tensor Completion via Embedding Gradient Tracking","authors":"Han Deng, Yuhui Li, Songyou Xie, Yang Yang, Yaqin Liu","doi":"10.1109/CSCloud-EdgeCom58631.2023.00044","DOIUrl":null,"url":null,"abstract":"How can we enhance the performance of neural tensor completion models for sparse data recovery? The task of tensor completion is crucial for network monitoring since it is the basis of network operation and management. Neural tensor completion models can exploit the multi-dimensional structure of tensors to recover missing entries in partially observed tensors. However, they can produce inaccurate estimations as real-world tensors are very sparse. These models tend to overfit a small amount of data, leading to limited accuracy on recovered data. In this paper, we propose LightTracker, a general data augmentation framework that enhances the prediction accuracy of neural tensor completion models. Specifically, LightTracker first trains a neural tensor completion model and tracks the gradient from the view of embeddings. After training, we use the tracked entity gradient to determine the importance of unknown tensor entries. Then, we aggregate the entries’ importance to calculate the importance of each entity. Finally, LightTracker creates an augmented tensor by mixing observed data and predicted entries with the highest importance. Experiments conducted on two network traffic datasets show that LightTracker effectively enhanced the imputation accuracy of different tensor completion baselines, showing its generality and effectiveness.","PeriodicalId":56007,"journal":{"name":"Journal of Cloud Computing-Advances Systems and Applications","volume":"41 1","pages":"210-216"},"PeriodicalIF":3.7000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cloud Computing-Advances Systems and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/CSCloud-EdgeCom58631.2023.00044","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

How can we enhance the performance of neural tensor completion models for sparse data recovery? The task of tensor completion is crucial for network monitoring since it is the basis of network operation and management. Neural tensor completion models can exploit the multi-dimensional structure of tensors to recover missing entries in partially observed tensors. However, they can produce inaccurate estimations as real-world tensors are very sparse. These models tend to overfit a small amount of data, leading to limited accuracy on recovered data. In this paper, we propose LightTracker, a general data augmentation framework that enhances the prediction accuracy of neural tensor completion models. Specifically, LightTracker first trains a neural tensor completion model and tracks the gradient from the view of embeddings. After training, we use the tracked entity gradient to determine the importance of unknown tensor entries. Then, we aggregate the entries’ importance to calculate the importance of each entity. Finally, LightTracker creates an augmented tensor by mixing observed data and predicted entries with the highest importance. Experiments conducted on two network traffic datasets show that LightTracker effectively enhanced the imputation accuracy of different tensor completion baselines, showing its generality and effectiveness.
基于嵌入梯度跟踪的张量补全数据增强
如何提高神经张量补全模型在稀疏数据恢复中的性能?张量补全任务是网络运行管理的基础,对网络监控至关重要。神经张量补全模型可以利用张量的多维结构来恢复部分观测张量中的缺失项。然而,它们可能产生不准确的估计,因为真实世界的张量非常稀疏。这些模型倾向于过拟合少量数据,导致恢复数据的精度有限。在本文中,我们提出LightTracker,一个通用的数据增强框架,提高神经张量补全模型的预测精度。具体来说,LightTracker首先训练一个神经张量补全模型,并从嵌入的角度跟踪梯度。训练后,我们使用跟踪的实体梯度来确定未知张量条目的重要性。然后,我们汇总条目的重要性来计算每个实体的重要性。最后,LightTracker通过混合观测数据和最重要的预测条目来创建增强张量。在两个网络流量数据集上进行的实验表明,LightTracker有效地提高了不同张量补全基线的补全精度,显示了其通用性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cloud Computing-Advances Systems and Applications
Journal of Cloud Computing-Advances Systems and Applications Computer Science-Computer Networks and Communications
CiteScore
6.80
自引率
7.50%
发文量
76
审稿时长
75 days
期刊介绍: The Journal of Cloud Computing: Advances, Systems and Applications (JoCCASA) will publish research articles on all aspects of Cloud Computing. Principally, articles will address topics that are core to Cloud Computing, focusing on the Cloud applications, the Cloud systems, and the advances that will lead to the Clouds of the future. Comprehensive review and survey articles that offer up new insights, and lay the foundations for further exploratory and experimental work, are also relevant.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信