RL-based path planning for controller performance validation

Lukas Schichler, Karin Festl, M. Stolz, D. Watzenig
{"title":"RL-based path planning for controller performance validation","authors":"Lukas Schichler, Karin Festl, M. Stolz, D. Watzenig","doi":"10.1109/MED59994.2023.10185811","DOIUrl":null,"url":null,"abstract":"Autonomous vehicles (AVs) will be part of everyday life in the near future. In order to accelerate this process, many subsystems need to be optimised and validated. One of the most important subsystem of AVs is the steering controller. It’s task is to keep the vehicle on track, which is the reason, why many steering controllers have been designed for a large variety of applications. However, the validation of such controllers is a labour-intensive task, which is why in this paper, an Artificial Intelligence (AI) is trained to find an edge case path that brings the steering controller to its limits. This path is a sufficient substitute for a large set of paths and enables fast validation of steering controllers. This contribution describes the development of a reinforcement learning (RL) based path planner using the PPO-Algorithm to train a so called agent. Comparing the resulting key feature maps shows that the agent adapts to each controllers characteristics during the learning process. The result is demonstrated for three different state of the art path tracking controllers. For each controller the agent finds a path that leads to the controllers failure within seconds.","PeriodicalId":270226,"journal":{"name":"2023 31st Mediterranean Conference on Control and Automation (MED)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 31st Mediterranean Conference on Control and Automation (MED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MED59994.2023.10185811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous vehicles (AVs) will be part of everyday life in the near future. In order to accelerate this process, many subsystems need to be optimised and validated. One of the most important subsystem of AVs is the steering controller. It’s task is to keep the vehicle on track, which is the reason, why many steering controllers have been designed for a large variety of applications. However, the validation of such controllers is a labour-intensive task, which is why in this paper, an Artificial Intelligence (AI) is trained to find an edge case path that brings the steering controller to its limits. This path is a sufficient substitute for a large set of paths and enables fast validation of steering controllers. This contribution describes the development of a reinforcement learning (RL) based path planner using the PPO-Algorithm to train a so called agent. Comparing the resulting key feature maps shows that the agent adapts to each controllers characteristics during the learning process. The result is demonstrated for three different state of the art path tracking controllers. For each controller the agent finds a path that leads to the controllers failure within seconds.
基于rl的控制器性能验证路径规划
在不久的将来,自动驾驶汽车(av)将成为日常生活的一部分。为了加速这一过程,需要对许多子系统进行优化和验证。自动驾驶汽车中最重要的子系统之一是转向控制器。它的任务是保持车辆在轨道上,这就是为什么许多转向控制器被设计用于各种各样的应用。然而,这种控制器的验证是一项劳动密集型任务,这就是为什么在本文中,人工智能(AI)被训练来找到一个边缘情况路径,使转向控制器达到其极限。这个路径是一个足够的替代大量的路径集,并能够快速验证转向控制器。该贡献描述了基于强化学习(RL)的路径规划器的开发,该路径规划器使用ppo算法来训练所谓的代理。对比得到的关键特征映射,可以看出智能体在学习过程中适应了每个控制器的特征。结果演示了三个不同状态的最先进的路径跟踪控制器。对于每个控制器,代理会在几秒钟内找到导致控制器故障的路径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信