Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast

IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang
{"title":"Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast","authors":"Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang","doi":"10.1109/TSMC.2024.3485960","DOIUrl":null,"url":null,"abstract":"Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 1","pages":"573-586"},"PeriodicalIF":8.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10752362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.
基于张量-训练嵌入表和光伏预测的数据驱动大规模深度学习推荐模型训练优化
光伏发电功率预测对促进可再生能源并网具有重要意义。然而,基于神经网络的方法,特别是用于光伏发电预测的深度学习,在处理工业规模的数据集时,面临着计算和内存需求的挑战。为了解决这个问题,我们引入了Rec-PF,一种采用张量训练(TT)技术的鲁棒计算框架。该框架旨在简化在受限资源上的大规模深度学习推荐模型(dlrm)的训练过程。Rec-PF采用高性能压缩嵌入表,使用关键计算原语增强TT分解。它可以作为PyTorch API的临时替代品。此外,Rec-PF利用索引重新排序技术来利用来自训练输入的局部和全局信息,从而提高性能。此外,Rec-PF采用流水线培训模型,消除了培训工作者与主机内存之间的通信需求。我们是将DLRM应用于光伏功率预测的先驱,在不影响准确性的情况下减少培训时间。与没有采用我们方法的方法相比,我们的方法在训练时间上有了两倍的改进。为了更好地展示该算法的增强性能,我们特别使用推荐系统中常用的数据集将其与其他框架的效率进行了比较。综合实验表明,Rec-PF能够在单个GPU上处理最大的可公开访问的DLRM和PV数据集,与最先进的DLRM和PV框架相比,提供三倍的加速。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Systems Man Cybernetics-Systems
IEEE Transactions on Systems Man Cybernetics-Systems AUTOMATION & CONTROL SYSTEMS-COMPUTER SCIENCE, CYBERNETICS
CiteScore
18.50
自引率
11.50%
发文量
812
审稿时长
6 months
期刊介绍: The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信