Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast

IF 8.6 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Systems Man Cybernetics-Systems Pub Date : 2024-11-13 DOI:10.1109/TSMC.2024.3485960

Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang

{"title":"Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast","authors":"Yunfeng Li;Zheng Wang;Chenhao Ren;Xiaoming Hou;Shengli Zhang","doi":"10.1109/TSMC.2024.3485960","DOIUrl":null,"url":null,"abstract":"Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 1","pages":"573-586"},"PeriodicalIF":8.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10752362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Photovoltaic (PV) power forecasting is important for promoting the integration of renewable energy sources. However, neural network-based methods, particularly deep learning for PV power forecasting, face challenges with computational and memory requirements when dealing with industry-scale datasets. To address this, we introduce Rec-PF, a robust computational framework employing the tensor-train (TT) technique. This framework aims to streamline the training process of massive deep learning recommendation models (DLRMs) on constrained resources. Rec-PF employs a high-performance compressed embedding table, enhancing TT decomposition using key computing primitives. It serves as a drop-in replacement for the PyTorch API. Additionally, Rec-PF utilizes an index reordering technique to leverage local and global information from training inputs, thereby enhancing performance. Furthermore, Rec-PF adopts a pipeline training model, eliminating the need for communication between training workers and host memory. We are pioneers in applying DLRM to PV power prediction to reduce training time without compromising accuracy. Our approach demonstrates a twofold improvement in training time compared to methods that do not incorporate our approach. To better demonstrate the enhanced performance of the algorithm, we specifically compare its efficiency with other frameworks using datasets commonly employed in recommender systems. Comprehensive experiments indicate that Rec-PF is capable of processing the largest publicly accessible DLRM and PV datasets on a single GPU, offering a threefold acceleration compared to state-of-the-art DLRM and PV frameworks.

查看原文本刊更多论文

基于张量-训练嵌入表和光伏预测的数据驱动大规模深度学习推荐模型训练优化

光伏发电功率预测对促进可再生能源并网具有重要意义。然而，基于神经网络的方法，特别是用于光伏发电预测的深度学习，在处理工业规模的数据集时，面临着计算和内存需求的挑战。为了解决这个问题，我们引入了Rec-PF，一种采用张量训练（TT）技术的鲁棒计算框架。该框架旨在简化在受限资源上的大规模深度学习推荐模型（dlrm）的训练过程。Rec-PF采用高性能压缩嵌入表，使用关键计算原语增强TT分解。它可以作为PyTorch API的临时替代品。此外，Rec-PF利用索引重新排序技术来利用来自训练输入的局部和全局信息，从而提高性能。此外，Rec-PF采用流水线培训模型，消除了培训工作者与主机内存之间的通信需求。我们是将DLRM应用于光伏功率预测的先驱，在不影响准确性的情况下减少培训时间。与没有采用我们方法的方法相比，我们的方法在训练时间上有了两倍的改进。为了更好地展示该算法的增强性能，我们特别使用推荐系统中常用的数据集将其与其他框架的效率进行了比较。综合实验表明，Rec-PF能够在单个GPU上处理最大的可公开访问的DLRM和PV数据集，与最先进的DLRM和PV框架相比，提供三倍的加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Systems Man Cybernetics-Systems AUTOMATION & CONTROL SYSTEMS-COMPUTER SCIENCE, CYBERNETICS

CiteScore

18.50

自引率

11.50%

发文量

812

审稿时长

6 months

期刊介绍： The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.