基于CNN和LSTM网络的强化学习在高速公路上驾驶

2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES) Pub Date : 2020-07-01 DOI:10.1109/INES49302.2020.9147185

Lászlo Szőke, S. Aradi, Tamás Bécsi, P. Gáspár

{"title":"基于CNN和LSTM网络的强化学习在高速公路上驾驶","authors":"Lászlo Szőke, S. Aradi, Tamás Bécsi, P. Gáspár","doi":"10.1109/INES49302.2020.9147185","DOIUrl":null,"url":null,"abstract":"This work presents a powerful and intelligent driver agent, designed to operate in a preset highway situation using Policy Gradient Reinforcement Learning (RL) agent. Our goal is to create an agent that is capable of navigating safely in changing highway traffic and successfully accomplish to get through the defined section keeping the reference speed. Meanwhile, creating a state representation that is capable of extracting information from images based on the actual highway situation. The algorithm uses Convolutional Neural Network (CNN) with Long-Short Term Memory (LSTM) layers as a function approximator for the agent with discrete action space on the control level, e.g., acceleration and lane change. Simulation of Urban MObility (SUMO), an open-source microscopic traffic simulator is chosen as our simulation environment. It is integrated with an open interface to interact with the agent in real-time. The agent can learn from numerous driving and highway situations that are created and fed to it. The representation becomes more general by randomizing and customizing the behavior of the other road users in the simulation, thus the experience of the agent can be much more diverse. The article briefly describes the modeling environment, the details on the learning agent, and the rewarding scheme. After evaluating the experiences gained from the training, some further plans and optimization ideas are briefed.","PeriodicalId":175830,"journal":{"name":"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)","volume":"59 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Driving on Highway by Using Reinforcement Learning with CNN and LSTM Networks\",\"authors\":\"Lászlo Szőke, S. Aradi, Tamás Bécsi, P. Gáspár\",\"doi\":\"10.1109/INES49302.2020.9147185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents a powerful and intelligent driver agent, designed to operate in a preset highway situation using Policy Gradient Reinforcement Learning (RL) agent. Our goal is to create an agent that is capable of navigating safely in changing highway traffic and successfully accomplish to get through the defined section keeping the reference speed. Meanwhile, creating a state representation that is capable of extracting information from images based on the actual highway situation. The algorithm uses Convolutional Neural Network (CNN) with Long-Short Term Memory (LSTM) layers as a function approximator for the agent with discrete action space on the control level, e.g., acceleration and lane change. Simulation of Urban MObility (SUMO), an open-source microscopic traffic simulator is chosen as our simulation environment. It is integrated with an open interface to interact with the agent in real-time. The agent can learn from numerous driving and highway situations that are created and fed to it. The representation becomes more general by randomizing and customizing the behavior of the other road users in the simulation, thus the experience of the agent can be much more diverse. The article briefly describes the modeling environment, the details on the learning agent, and the rewarding scheme. After evaluating the experiences gained from the training, some further plans and optimization ideas are briefed.\",\"PeriodicalId\":175830,\"journal\":{\"name\":\"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)\",\"volume\":\"59 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INES49302.2020.9147185\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INES49302.2020.9147185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

这项工作提出了一个强大而智能的驾驶员代理，设计用于使用策略梯度强化学习(RL)代理在预设的高速公路情况下运行。我们的目标是创建一个能够在变化的高速公路交通中安全导航的智能体，并在保持参考速度的情况下成功完成通过定义的路段。同时，根据公路的实际情况，创建能够从图像中提取信息的状态表示。该算法使用具有长短期记忆(LSTM)层的卷积神经网络(CNN)作为控制层上具有离散动作空间的智能体(如加速和变道)的函数逼近器。本文选择开源的微观交通模拟器SUMO作为仿真环境。它集成了一个开放的接口，与代理进行实时交互。智能体可以从创建并提供给它的众多驾驶和高速公路情况中学习。通过随机化和定制模拟中其他道路使用者的行为，表示变得更加通用，因此代理的经验可以更加多样化。本文简要介绍了建模环境、学习智能体的细节和奖励方案。在总结培训经验的基础上，提出了进一步的规划和优化思路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Driving on Highway by Using Reinforcement Learning with CNN and LSTM Networks

This work presents a powerful and intelligent driver agent, designed to operate in a preset highway situation using Policy Gradient Reinforcement Learning (RL) agent. Our goal is to create an agent that is capable of navigating safely in changing highway traffic and successfully accomplish to get through the defined section keeping the reference speed. Meanwhile, creating a state representation that is capable of extracting information from images based on the actual highway situation. The algorithm uses Convolutional Neural Network (CNN) with Long-Short Term Memory (LSTM) layers as a function approximator for the agent with discrete action space on the control level, e.g., acceleration and lane change. Simulation of Urban MObility (SUMO), an open-source microscopic traffic simulator is chosen as our simulation environment. It is integrated with an open interface to interact with the agent in real-time. The agent can learn from numerous driving and highway situations that are created and fed to it. The representation becomes more general by randomizing and customizing the behavior of the other road users in the simulation, thus the experience of the agent can be much more diverse. The article briefly describes the modeling environment, the details on the learning agent, and the rewarding scheme. After evaluating the experiences gained from the training, some further plans and optimization ideas are briefed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 24th International Conference on Intelligent Engineering Systems (INES)

自引率

0.00%

发文量