基于双临界DRL方法的深度驾驶精确控制

2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops) Pub Date : 2021-07-11 DOI:10.1109/ivworkshops54471.2021.9669204

Surbhi Gupta, Gaurav Singal, Deepak Garg

{"title":"基于双临界DRL方法的深度驾驶精确控制","authors":"Surbhi Gupta, Gaurav Singal, Deepak Garg","doi":"10.1109/ivworkshops54471.2021.9669204","DOIUrl":null,"url":null,"abstract":"Autonomous driving problems related to vehicle control using deep reinforcement learning (DRL) techniques, are still unsolved. DRL approaches have achieved notable results, its dependability on reward functions and defining the type of control actions are dominating factors of the objective, that controls its success. Several DRL approaches applied in the past consider a finite set of available actions to be controlled by the agent hence, it performs sharp actions. While real driving requires precision control capabilities that tend to apply safer and smoother actions. For incorporating such precision control capabilities, this paper considers the driving problem as a continuous control problem. For this, the gym-highway environments are used as these environments are controllable and customizable to simulate diverse driving scenarios. The simulation setup for parking is updated to resemble the complex scenario and for highway driving a novel reward function is designed to handle continuous actions. Dual critic based DRL approaches are applied as these approaches have shown remarkable performance in robotic locomotion control problems. The video results demonstrate the way different policies fulfil the objective.","PeriodicalId":256905,"journal":{"name":"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Precise Control for Deep Driving using Dual Critic based DRL Approaches\",\"authors\":\"Surbhi Gupta, Gaurav Singal, Deepak Garg\",\"doi\":\"10.1109/ivworkshops54471.2021.9669204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Autonomous driving problems related to vehicle control using deep reinforcement learning (DRL) techniques, are still unsolved. DRL approaches have achieved notable results, its dependability on reward functions and defining the type of control actions are dominating factors of the objective, that controls its success. Several DRL approaches applied in the past consider a finite set of available actions to be controlled by the agent hence, it performs sharp actions. While real driving requires precision control capabilities that tend to apply safer and smoother actions. For incorporating such precision control capabilities, this paper considers the driving problem as a continuous control problem. For this, the gym-highway environments are used as these environments are controllable and customizable to simulate diverse driving scenarios. The simulation setup for parking is updated to resemble the complex scenario and for highway driving a novel reward function is designed to handle continuous actions. Dual critic based DRL approaches are applied as these approaches have shown remarkable performance in robotic locomotion control problems. The video results demonstrate the way different policies fulfil the objective.\",\"PeriodicalId\":256905,\"journal\":{\"name\":\"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ivworkshops54471.2021.9669204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ivworkshops54471.2021.9669204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

与使用深度强化学习(DRL)技术的车辆控制相关的自动驾驶问题仍未得到解决。DRL方法已经取得了显著的成果，其对奖励函数的依赖性和控制动作类型的定义是控制目标成功的主要因素。过去应用的几种DRL方法考虑由代理控制的有限可用动作集，因此它执行尖锐动作。而真正的驾驶需要精确的控制能力，往往会应用更安全、更平稳的动作。为了结合这种精确控制能力，本文将驱动问题视为连续控制问题。为此，我们使用了健身房-高速公路环境，因为这些环境是可控和可定制的，可以模拟不同的驾驶场景。停车的模拟设置更新到类似于复杂的场景，高速公路驾驶设计了一个新的奖励函数来处理连续的动作。基于双批评的DRL方法在机器人运动控制问题中表现出了显著的性能，因此被应用。视频结果展示了不同政策实现目标的方式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Precise Control for Deep Driving using Dual Critic based DRL Approaches

Autonomous driving problems related to vehicle control using deep reinforcement learning (DRL) techniques, are still unsolved. DRL approaches have achieved notable results, its dependability on reward functions and defining the type of control actions are dominating factors of the objective, that controls its success. Several DRL approaches applied in the past consider a finite set of available actions to be controlled by the agent hence, it performs sharp actions. While real driving requires precision control capabilities that tend to apply safer and smoother actions. For incorporating such precision control capabilities, this paper considers the driving problem as a continuous control problem. For this, the gym-highway environments are used as these environments are controllable and customizable to simulate diverse driving scenarios. The simulation setup for parking is updated to resemble the complex scenario and for highway driving a novel reward function is designed to handle continuous actions. Dual critic based DRL approaches are applied as these approaches have shown remarkable performance in robotic locomotion control problems. The video results demonstrate the way different policies fulfil the objective.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Intelligent Vehicles Symposium Workshops (IV Workshops)

自引率

0.00%

发文量