{"title":"Transformer-driven multi-agent deep reinforcement learning based point cloud video transmissions","authors":"Hai Lin, Xianfu Chen","doi":"10.1145/3566099.3569006","DOIUrl":null,"url":null,"abstract":"The point cloud videos, a medium for representing natural content in AR/VR with point clouds, have attracted a wide range of attention for its characteristics and have the potential to be the next generation of video technology. Given the high data volume, the point cloud video raises the challenge of intelligent transmission and resource scheduling in multi-user scenarios under time-varying system conditions. In this paper, we propose a multi-agent deep reinforcement learning (DRL) approach to optimize the expected long-term multi-user QoE and adopt a Field of View (FoV) prediction model with Transformer for high-accuracy FoV prediction. Over the time horizon, the proposed approach learns to select the tiles of the corresponding video in accordance with a proposed well-defined QoE model capable of quantifying users' satisfaction for transmissions in an iterative way. Under various settings, extensive numerical experiments based on real throughput data traces and different computation capabilities data demonstrate that the proposed approach is effective for long-term multi-agent point cloud video transmissions.","PeriodicalId":272675,"journal":{"name":"Proceedings of the 1st Workshop on Digital Twin & Edge AI for Industrial IoT","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on Digital Twin & Edge AI for Industrial IoT","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3566099.3569006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The point cloud videos, a medium for representing natural content in AR/VR with point clouds, have attracted a wide range of attention for its characteristics and have the potential to be the next generation of video technology. Given the high data volume, the point cloud video raises the challenge of intelligent transmission and resource scheduling in multi-user scenarios under time-varying system conditions. In this paper, we propose a multi-agent deep reinforcement learning (DRL) approach to optimize the expected long-term multi-user QoE and adopt a Field of View (FoV) prediction model with Transformer for high-accuracy FoV prediction. Over the time horizon, the proposed approach learns to select the tiles of the corresponding video in accordance with a proposed well-defined QoE model capable of quantifying users' satisfaction for transmissions in an iterative way. Under various settings, extensive numerical experiments based on real throughput data traces and different computation capabilities data demonstrate that the proposed approach is effective for long-term multi-agent point cloud video transmissions.
点云视频作为一种用点云来表现AR/VR中自然内容的媒介,因其自身的特点引起了广泛的关注,具有成为下一代视频技术的潜力。由于数据量大,点云视频对时变系统条件下多用户场景下的智能传输和资源调度提出了挑战。在本文中,我们提出了一种多智能体深度强化学习(DRL)方法来优化预期的长期多用户QoE,并采用具有Transformer的Field of View (FoV)预测模型进行高精度的FoV预测。在时间范围内,所提出的方法根据所提出的定义良好的QoE模型学习选择相应视频的贴片,该模型能够以迭代的方式量化用户对传输的满意度。在不同设置下,基于真实吞吐量数据轨迹和不同计算能力数据的大量数值实验表明,该方法对于多智能体点云视频的长期传输是有效的。