多交通场景下更安全的自动驾驶填充动作选择强化学习算法

Fan Yang, Xueyuan Li, Qi Liu, Chaoyang Liu, Zirui Li, Yong Liu
{"title":"多交通场景下更安全的自动驾驶填充动作选择强化学习算法","authors":"Fan Yang, Xueyuan Li, Qi Liu, Chaoyang Liu, Zirui Li, Yong Liu","doi":"10.1109/IV55152.2023.10186804","DOIUrl":null,"url":null,"abstract":"Learning-based algorithms are gradually emerging in the field of autonomous driving due to their powerful data processing capabilities. Researchers in the field of intelligent vehicle planning and decision-making are gradually using reinforcement learning algorithms to solve related problems. The safety research of reinforcement learning algorithms is significant and widely concerned. The main reason for the safety problem of the existing reinforcement learning algorithm is that there is still a bias in the safety judgment of the current environment, and it is impossible to make directional improvements by modifying the network and training method. In this paper, an action judgment network is designed as a standard to select the optimal action, which can assist the algorithm to judge environmental safety more deeply. Firstly, the action judgment network takes the state space and action as input, and the output is the safety state of the vehicle after the action. Secondly, this work establishes the required database to train the action judgment network through deep learning and achieves the highest accuracy of 98%. Finally, the proposed algorithm is tested in three scenarios: single-lane, intersection, and roundabout. This algorithm can judge the actions according to the reinforcement learning q value table order until the optimal and safe action is selected. The results show that the newly proposed algorithm can greatly improve the safety of the algorithm without affecting vehicle speed.","PeriodicalId":195148,"journal":{"name":"2023 IEEE Intelligent Vehicles Symposium (IV)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Filling Action Selection Reinforcement Learning Algorithm for Safer Autonomous Driving in Multi-Traffic Scenes\",\"authors\":\"Fan Yang, Xueyuan Li, Qi Liu, Chaoyang Liu, Zirui Li, Yong Liu\",\"doi\":\"10.1109/IV55152.2023.10186804\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Learning-based algorithms are gradually emerging in the field of autonomous driving due to their powerful data processing capabilities. Researchers in the field of intelligent vehicle planning and decision-making are gradually using reinforcement learning algorithms to solve related problems. The safety research of reinforcement learning algorithms is significant and widely concerned. The main reason for the safety problem of the existing reinforcement learning algorithm is that there is still a bias in the safety judgment of the current environment, and it is impossible to make directional improvements by modifying the network and training method. In this paper, an action judgment network is designed as a standard to select the optimal action, which can assist the algorithm to judge environmental safety more deeply. Firstly, the action judgment network takes the state space and action as input, and the output is the safety state of the vehicle after the action. Secondly, this work establishes the required database to train the action judgment network through deep learning and achieves the highest accuracy of 98%. Finally, the proposed algorithm is tested in three scenarios: single-lane, intersection, and roundabout. This algorithm can judge the actions according to the reinforcement learning q value table order until the optimal and safe action is selected. The results show that the newly proposed algorithm can greatly improve the safety of the algorithm without affecting vehicle speed.\",\"PeriodicalId\":195148,\"journal\":{\"name\":\"2023 IEEE Intelligent Vehicles Symposium (IV)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Intelligent Vehicles Symposium (IV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IV55152.2023.10186804\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Intelligent Vehicles Symposium (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV55152.2023.10186804","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于学习的算法由于其强大的数据处理能力,在自动驾驶领域逐渐兴起。智能车辆规划与决策领域的研究人员正在逐步使用强化学习算法来解决相关问题。强化学习算法的安全性研究意义重大,受到广泛关注。现有强化学习算法存在安全性问题的主要原因是对当前环境的安全性判断仍然存在偏差,无法通过修改网络和训练方法进行定向改进。本文设计了一个行为判断网络作为选择最优行为的标准,帮助算法对环境安全进行更深入的判断。首先,动作判断网络以状态空间和动作作为输入,输出为动作后车辆的安全状态。其次,本文建立了所需的数据库,通过深度学习训练动作判断网络,达到了98%的最高准确率。最后,在单车道、交叉路口和环形交叉路口三种场景下对该算法进行了测试。该算法可以根据强化学习q值表顺序判断动作,直到选择出最优且安全的动作。结果表明,该算法在不影响车速的情况下,大大提高了算法的安全性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Filling Action Selection Reinforcement Learning Algorithm for Safer Autonomous Driving in Multi-Traffic Scenes
Learning-based algorithms are gradually emerging in the field of autonomous driving due to their powerful data processing capabilities. Researchers in the field of intelligent vehicle planning and decision-making are gradually using reinforcement learning algorithms to solve related problems. The safety research of reinforcement learning algorithms is significant and widely concerned. The main reason for the safety problem of the existing reinforcement learning algorithm is that there is still a bias in the safety judgment of the current environment, and it is impossible to make directional improvements by modifying the network and training method. In this paper, an action judgment network is designed as a standard to select the optimal action, which can assist the algorithm to judge environmental safety more deeply. Firstly, the action judgment network takes the state space and action as input, and the output is the safety state of the vehicle after the action. Secondly, this work establishes the required database to train the action judgment network through deep learning and achieves the highest accuracy of 98%. Finally, the proposed algorithm is tested in three scenarios: single-lane, intersection, and roundabout. This algorithm can judge the actions according to the reinforcement learning q value table order until the optimal and safe action is selected. The results show that the newly proposed algorithm can greatly improve the safety of the algorithm without affecting vehicle speed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信