A Hybrid Sensing and Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications

2019 53rd Asilomar Conference on Signals, Systems, and Computers Pub Date : 2019-11-01 DOI:10.1109/IEEECONF44664.2019.9048691

T. Şahin, Mate Boban, R. Khalili, A. Wolisz

引用次数: 1

Abstract

Vehicle-to-vehicle (V2V) communications performance depends significantly on the approach taken to schedule the radio resources. When the infrastructure is available, so far the best performing V2V scheduling algorithms are based on centralized approach. In case there is no infrastructure, sensing the resources in a distributed manner to determine whether a specific resource is free performs well. We propose a hybrid solution, where a centralized reinforcement learning (RL) algorithm provides a candidate subset of resources, whereas a distributed sensing mechanism, running on each vehicle, makes the final resource selection. We evaluate the performance of the proposed approach in an out-of-coverage setting and show that it outperforms the state-of-the-art algorithms in highly dynamic scenarios by using the best of both worlds: RL agent provides optimized long-term resource allocations, while the distributed sensing handles temporary and unforeseen network conditions that can not be predicted effectively.

查看原文本刊更多论文

一种用于车对车通信的混合传感和强化学习调度

车对车(V2V)通信性能在很大程度上取决于调度无线电资源的方法。当基础设施可用时，迄今为止性能最好的V2V调度算法是基于集中式方法的。在没有基础设施的情况下，以分布式方式感知资源以确定特定资源是否空闲的性能很好。我们提出了一种混合解决方案，其中集中式强化学习(RL)算法提供候选资源子集，而运行在每辆车上的分布式感知机制则进行最终的资源选择。我们评估了所提出的方法在覆盖范围之外的环境中的性能，并表明它在高度动态场景中优于最先进的算法，通过使用两者的优点:RL代理提供优化的长期资源分配，而分布式感知处理无法有效预测的临时和不可预见的网络条件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 53rd Asilomar Conference on Signals, Systems, and Computers

自引率

0.00%

发文量