基于深度强化学习的可变限速控制:一种可能的实现

2020 International Symposium ELMAR Pub Date : 2020-09-01 DOI:10.1109/ELMAR49956.2020.9219031

M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko

{"title":"基于深度强化学习的可变限速控制:一种可能的实现","authors":"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko","doi":"10.1109/ELMAR49956.2020.9219031","DOIUrl":null,"url":null,"abstract":"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.","PeriodicalId":235289,"journal":{"name":"2020 International Symposium ELMAR","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation\",\"authors\":\"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko\",\"doi\":\"10.1109/ELMAR49956.2020.9219031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.\",\"PeriodicalId\":235289,\"journal\":{\"name\":\"2020 International Symposium ELMAR\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Symposium ELMAR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR49956.2020.9219031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium ELMAR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR49956.2020.9219031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

今天的城市高速公路不能满足其目的，同时为过境和本地城市交通提供高水平的服务。就城市高速公路基础设施而言，由于空间不足，传统的“只建造”方法并不总是可行的。本文研究了可变限速控制(VSLC)作为一种适用于任何类型高速公路的交通控制方法，而q -学习是设计基于学习的VSLC算法的一种常用方法。这种方法的缺点是对大状态-行动空间的表示和探索，就像它在VSLC中的应用一样。本研究引入深度q网络来近似q函数，并为VSLC应用提供了一种新的学习方法，可以在微观水平上跟踪车辆。所提出的奖励函数引导学习朝着改进奖励和防止连续限速之间振荡的方向发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation

Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Symposium ELMAR

自引率

0.00%

发文量