RAMRL: Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning

G. Bagwe, Xiaoyong Yuan, Xianhao Chen, Lan Zhang
{"title":"RAMRL: Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning","authors":"G. Bagwe, Xiaoyong Yuan, Xianhao Chen, Lan Zhang","doi":"10.1109/MOST57249.2023.00011","DOIUrl":null,"url":null,"abstract":"Despite the success of AI-enabled onboard perception, on-ramp merging has been one of the main challenges for autonomous driving. Due to limited sensing range of onboard sensors, a merging vehicle can hardly observe main road conditions and merge properly. By leveraging the wireless communications between connected and automated vehicles (CAVs), a merging CAV has potential to proactively obtain the intentions of nearby vehicles. However, CAVs can be prone to inaccurate observations, such as the noisy basic safety messages (BSM) and poor quality surveillance images. In this paper, we present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning, named by RAMRL. Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account. To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation, which is used to learn a policy model through proximal policy optimization (PPO). Moreover, to improve data efficiency and provide better generalization performance, we train the policy model with augmented data (e.g., noisy BSM and noisy surveillance images). Extensive experiments are conducted with Simulation of Urban MObility (SUMO) platform under two typical merging scenarios. Experimental results demonstrate the effectiveness and efficiency of our robust on-ramp merging design.","PeriodicalId":338621,"journal":{"name":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MOST57249.2023.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Despite the success of AI-enabled onboard perception, on-ramp merging has been one of the main challenges for autonomous driving. Due to limited sensing range of onboard sensors, a merging vehicle can hardly observe main road conditions and merge properly. By leveraging the wireless communications between connected and automated vehicles (CAVs), a merging CAV has potential to proactively obtain the intentions of nearby vehicles. However, CAVs can be prone to inaccurate observations, such as the noisy basic safety messages (BSM) and poor quality surveillance images. In this paper, we present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning, named by RAMRL. Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account. To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation, which is used to learn a policy model through proximal policy optimization (PPO). Moreover, to improve data efficiency and provide better generalization performance, we train the policy model with augmented data (e.g., noisy BSM and noisy surveillance images). Extensive experiments are conducted with Simulation of Urban MObility (SUMO) platform under two typical merging scenarios. Experimental results demonstrate the effectiveness and efficiency of our robust on-ramp merging design.
RAMRL:通过增强多模态强化学习实现稳健的入口匝道合并
尽管人工智能的车载感知技术取得了成功,但入匝道合并一直是自动驾驶面临的主要挑战之一。由于车载传感器的感知范围有限,合并车辆很难观察到主路状况并进行正常合并。通过利用联网汽车和自动驾驶汽车(CAV)之间的无线通信,合并后的CAV有可能主动获取附近车辆的意图。然而,自动驾驶汽车可能容易产生不准确的观测结果,例如嘈杂的基本安全信息(BSM)和质量差的监视图像。在本文中,我们提出了一种通过增强和多模态强化学习实现自动驾驶汽车鲁棒入匝道合并的新方法,称为RAMRL。具体而言,我们将入匝道合并问题表述为考虑驾驶安全、舒适驾驶行为和交通效率的马尔可夫决策过程(MDP)。为了提供可靠的合并机动,我们同时利用BSM和监视图像进行多模态观测,并通过近端策略优化(PPO)来学习策略模型。此外,为了提高数据效率和提供更好的泛化性能,我们使用增强数据(例如,带噪声的BSM和带噪声的监控图像)训练策略模型。在城市交通仿真(SUMO)平台上进行了两种典型合并场景下的大量实验。实验结果证明了稳健的匝道进匝道合并设计的有效性和高效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信