基于QoS的V2X资源分配的多agent与单agent深度强化学习

2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT) Pub Date : 2021-12-06 DOI:10.1109/SOFTT54252.2021.9673150

Shubhangi Bhadauria, L. Ravichandran, Elke Roth-Mandutz, Georg Fischer

{"title":"基于QoS的V2X资源分配的多agent与单agent深度强化学习","authors":"Shubhangi Bhadauria, L. Ravichandran, Elke Roth-Mandutz, Georg Fischer","doi":"10.1109/SOFTT54252.2021.9673150","DOIUrl":null,"url":null,"abstract":"Autonomous driving requires Vehicle-to-Everything (V2X) communication as standardized in the 3rd generation partnership project (3GPP). Diverse use cases and service types are foreseen to be supported, including safety-critical use cases, e.g., lane merging and cooperative collision avoidance. Each service type's quality of service (QoS) requirements vary enormously regarding latency, reliability, data rates, and positioning accuracy. In this paper, we analyze and evaluate the performance of a QoS-aware decentralized resource allocation scheme using first, a single-agent reinforcement learning (SARL) and second, a multi-agent reinforcement learning (MARL) approach. In addition, the impact of multiple vehicular user equipments (V-UEs) supporting one and multiple services are considered. The QoS parameter considered here is the latency and the relative distance between the communicating V-UEs, which is mapped on the Priority to reflect the required packet delay budget (PDB). The goal is to maximize the throughput of all V2N links while meeting the V2V link's latency constraint of the supported service. The results based on a system-level simulation for an urban scenario show that MARL improves the throughput for V-UEs set up for single and multiple services compared to SARL. However, for latency SARL indicates advantages at least when multiple services per V-UE apply.","PeriodicalId":443155,"journal":{"name":"2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT)","volume":"90 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"QoS based Multi-Agent vs. Single-Agent Deep Reinforcement Learning for V2X Resource Allocation\",\"authors\":\"Shubhangi Bhadauria, L. Ravichandran, Elke Roth-Mandutz, Georg Fischer\",\"doi\":\"10.1109/SOFTT54252.2021.9673150\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Autonomous driving requires Vehicle-to-Everything (V2X) communication as standardized in the 3rd generation partnership project (3GPP). Diverse use cases and service types are foreseen to be supported, including safety-critical use cases, e.g., lane merging and cooperative collision avoidance. Each service type's quality of service (QoS) requirements vary enormously regarding latency, reliability, data rates, and positioning accuracy. In this paper, we analyze and evaluate the performance of a QoS-aware decentralized resource allocation scheme using first, a single-agent reinforcement learning (SARL) and second, a multi-agent reinforcement learning (MARL) approach. In addition, the impact of multiple vehicular user equipments (V-UEs) supporting one and multiple services are considered. The QoS parameter considered here is the latency and the relative distance between the communicating V-UEs, which is mapped on the Priority to reflect the required packet delay budget (PDB). The goal is to maximize the throughput of all V2N links while meeting the V2V link's latency constraint of the supported service. The results based on a system-level simulation for an urban scenario show that MARL improves the throughput for V-UEs set up for single and multiple services compared to SARL. However, for latency SARL indicates advantages at least when multiple services per V-UE apply.\",\"PeriodicalId\":443155,\"journal\":{\"name\":\"2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT)\",\"volume\":\"90 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SOFTT54252.2021.9673150\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOFTT54252.2021.9673150","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

自动驾驶需要第三代合作伙伴计划(3GPP)标准化的车联网(V2X)通信。预计将支持多种用例和服务类型，包括安全关键用例，例如车道合并和协同避碰。每种服务类型的服务质量(QoS)需求在延迟、可靠性、数据速率和定位精度方面差异很大。在本文中，我们首先使用单智能体强化学习(SARL)和多智能体强化学习(MARL)方法分析和评估了qos感知分散资源分配方案的性能。此外，还考虑了多个车辆用户设备(v - ue)支持一种和多种业务的影响。这里考虑的QoS参数是通信的v - ue之间的延迟和相对距离，它映射到优先级上，以反映所需的数据包延迟预算(PDB)。目标是最大限度地提高所有V2N链路的吞吐量，同时满足所支持服务的V2V链路的延迟约束。基于城市场景的系统级模拟的结果表明，与SARL相比，MARL提高了为单个和多个服务设置的v - ue的吞吐量。但是，对于延迟，SARL至少在每个V-UE应用多个服务时显示出优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

QoS based Multi-Agent vs. Single-Agent Deep Reinforcement Learning for V2X Resource Allocation

Autonomous driving requires Vehicle-to-Everything (V2X) communication as standardized in the 3rd generation partnership project (3GPP). Diverse use cases and service types are foreseen to be supported, including safety-critical use cases, e.g., lane merging and cooperative collision avoidance. Each service type's quality of service (QoS) requirements vary enormously regarding latency, reliability, data rates, and positioning accuracy. In this paper, we analyze and evaluate the performance of a QoS-aware decentralized resource allocation scheme using first, a single-agent reinforcement learning (SARL) and second, a multi-agent reinforcement learning (MARL) approach. In addition, the impact of multiple vehicular user equipments (V-UEs) supporting one and multiple services are considered. The QoS parameter considered here is the latency and the relative distance between the communicating V-UEs, which is mapped on the Priority to reflect the required packet delay budget (PDB). The goal is to maximize the throughput of all V2N links while meeting the V2V link's latency constraint of the supported service. The results based on a system-level simulation for an urban scenario show that MARL improves the throughput for V-UEs set up for single and multiple services compared to SARL. However, for latency SARL indicates advantages at least when multiple services per V-UE apply.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Symposium On Future Telecommunication Technologies (SOFTT)

自引率

0.00%

发文量