RAMRL: Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning

2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST) Pub Date : 2022-07-21 DOI:10.1109/MOST57249.2023.00011

G. Bagwe, Xiaoyong Yuan, Xianhao Chen, Lan Zhang

{"title":"RAMRL: Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning","authors":"G. Bagwe, Xiaoyong Yuan, Xianhao Chen, Lan Zhang","doi":"10.1109/MOST57249.2023.00011","DOIUrl":null,"url":null,"abstract":"Despite the success of AI-enabled onboard perception, on-ramp merging has been one of the main challenges for autonomous driving. Due to limited sensing range of onboard sensors, a merging vehicle can hardly observe main road conditions and merge properly. By leveraging the wireless communications between connected and automated vehicles (CAVs), a merging CAV has potential to proactively obtain the intentions of nearby vehicles. However, CAVs can be prone to inaccurate observations, such as the noisy basic safety messages (BSM) and poor quality surveillance images. In this paper, we present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning, named by RAMRL. Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account. To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation, which is used to learn a policy model through proximal policy optimization (PPO). Moreover, to improve data efficiency and provide better generalization performance, we train the policy model with augmented data (e.g., noisy BSM and noisy surveillance images). Extensive experiments are conducted with Simulation of Urban MObility (SUMO) platform under two typical merging scenarios. Experimental results demonstrate the effectiveness and efficiency of our robust on-ramp merging design.","PeriodicalId":338621,"journal":{"name":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MOST57249.2023.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Despite the success of AI-enabled onboard perception, on-ramp merging has been one of the main challenges for autonomous driving. Due to limited sensing range of onboard sensors, a merging vehicle can hardly observe main road conditions and merge properly. By leveraging the wireless communications between connected and automated vehicles (CAVs), a merging CAV has potential to proactively obtain the intentions of nearby vehicles. However, CAVs can be prone to inaccurate observations, such as the noisy basic safety messages (BSM) and poor quality surveillance images. In this paper, we present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning, named by RAMRL. Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account. To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation, which is used to learn a policy model through proximal policy optimization (PPO). Moreover, to improve data efficiency and provide better generalization performance, we train the policy model with augmented data (e.g., noisy BSM and noisy surveillance images). Extensive experiments are conducted with Simulation of Urban MObility (SUMO) platform under two typical merging scenarios. Experimental results demonstrate the effectiveness and efficiency of our robust on-ramp merging design.

查看原文本刊更多论文

RAMRL:通过增强多模态强化学习实现稳健的入口匝道合并

尽管人工智能的车载感知技术取得了成功，但入匝道合并一直是自动驾驶面临的主要挑战之一。由于车载传感器的感知范围有限，合并车辆很难观察到主路状况并进行正常合并。通过利用联网汽车和自动驾驶汽车(CAV)之间的无线通信，合并后的CAV有可能主动获取附近车辆的意图。然而，自动驾驶汽车可能容易产生不准确的观测结果，例如嘈杂的基本安全信息(BSM)和质量差的监视图像。在本文中，我们提出了一种通过增强和多模态强化学习实现自动驾驶汽车鲁棒入匝道合并的新方法，称为RAMRL。具体而言，我们将入匝道合并问题表述为考虑驾驶安全、舒适驾驶行为和交通效率的马尔可夫决策过程(MDP)。为了提供可靠的合并机动，我们同时利用BSM和监视图像进行多模态观测，并通过近端策略优化(PPO)来学习策略模型。此外，为了提高数据效率和提供更好的泛化性能，我们使用增强数据(例如，带噪声的BSM和带噪声的监控图像)训练策略模型。在城市交通仿真(SUMO)平台上进行了两种典型合并场景下的大量实验。实验结果证明了稳健的匝道进匝道合并设计的有效性和高效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)

自引率

0.00%

发文量