Optimal Safety Planning and Driving Decision-Making for Multiple Autonomous Vehicles: A Learning Based Approach

Abu Jafar Md Muzahid, M. Rahim, Saydul Akbar Murad, S. F. Kamarulzaman, Md. Arafatur Rahman
{"title":"Optimal Safety Planning and Driving Decision-Making for Multiple Autonomous Vehicles: A Learning Based Approach","authors":"Abu Jafar Md Muzahid, M. Rahim, Saydul Akbar Murad, S. F. Kamarulzaman, Md. Arafatur Rahman","doi":"10.1109/ETCCE54784.2021.9689820","DOIUrl":null,"url":null,"abstract":"In the early diffusion stage of autonomous vehicle systems, the controlling of vehicles through exacting decision-making to reduce the number of collisions is a major problem. This paper offers a DRL-based safety planning decision-making scheme in an emergency that leads to both the first and multiple collisions. Firstly, the lane-changing process and braking method are thoroughly analyzed, taking into account the critical aspects of developing an autonomous driving safety scheme. Secondly, we propose a DRL strategy that specifies the optimum driving techniques. We use a multiple-goal reward system to balance the accomplishment rewards from cooperative and competitive approaches, accident severity, and passenger comfort. Thirdly, the deep deterministic policy gradient (DDPG), a basic actor-critic (AC) technique, is used to mitigate the numerous collision problems. This approach can improve the efficacy of the optimal strategy while remaining stable for ongoing control mechanisms. In an emergency, the agent car can adapt optimum driving behaviors to enhance driving safety when adequately trained strategies. Extensive simulations show our concept’s effectiveness and worth in learning efficiency, decision accuracy, and safety.","PeriodicalId":208038,"journal":{"name":"2021 Emerging Technology in Computing, Communication and Electronics (ETCCE)","volume":"12 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Emerging Technology in Computing, Communication and Electronics (ETCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETCCE54784.2021.9689820","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

In the early diffusion stage of autonomous vehicle systems, the controlling of vehicles through exacting decision-making to reduce the number of collisions is a major problem. This paper offers a DRL-based safety planning decision-making scheme in an emergency that leads to both the first and multiple collisions. Firstly, the lane-changing process and braking method are thoroughly analyzed, taking into account the critical aspects of developing an autonomous driving safety scheme. Secondly, we propose a DRL strategy that specifies the optimum driving techniques. We use a multiple-goal reward system to balance the accomplishment rewards from cooperative and competitive approaches, accident severity, and passenger comfort. Thirdly, the deep deterministic policy gradient (DDPG), a basic actor-critic (AC) technique, is used to mitigate the numerous collision problems. This approach can improve the efficacy of the optimal strategy while remaining stable for ongoing control mechanisms. In an emergency, the agent car can adapt optimum driving behaviors to enhance driving safety when adequately trained strategies. Extensive simulations show our concept’s effectiveness and worth in learning efficiency, decision accuracy, and safety.
多辆自动驾驶汽车的最优安全规划和驾驶决策:基于学习的方法
在自动驾驶汽车系统的早期扩散阶段,通过严格的决策来控制车辆以减少碰撞次数是一个主要问题。本文提出了一种基于drl的首次碰撞和多次碰撞的紧急事故安全规划决策方案。首先,考虑到开发自动驾驶安全方案的关键方面,对变道过程和制动方法进行了深入分析。其次,我们提出了一个指定最佳驾驶技术的DRL策略。我们使用一个多目标奖励系统来平衡合作和竞争方式、事故严重性和乘客舒适度的成就奖励。第三,采用深度确定性策略梯度(deep deterministic policy gradient, DDPG)这一基本的行为者批评(actor- critical, AC)技术来缓解大量的碰撞问题。这种方法可以提高最优策略的有效性,同时保持持续控制机制的稳定性。在紧急情况下,通过适当的策略训练,智能车可以适应最优的驾驶行为以提高驾驶安全性。大量的仿真实验证明了该概念在学习效率、决策准确性和安全性方面的有效性和价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信