基于深度强化学习的可变限速控制:一种可能的实现

M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko
{"title":"基于深度强化学习的可变限速控制:一种可能的实现","authors":"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko","doi":"10.1109/ELMAR49956.2020.9219031","DOIUrl":null,"url":null,"abstract":"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.","PeriodicalId":235289,"journal":{"name":"2020 International Symposium ELMAR","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation\",\"authors\":\"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko\",\"doi\":\"10.1109/ELMAR49956.2020.9219031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.\",\"PeriodicalId\":235289,\"journal\":{\"name\":\"2020 International Symposium ELMAR\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Symposium ELMAR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR49956.2020.9219031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium ELMAR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR49956.2020.9219031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

今天的城市高速公路不能满足其目的,同时为过境和本地城市交通提供高水平的服务。就城市高速公路基础设施而言,由于空间不足,传统的“只建造”方法并不总是可行的。本文研究了可变限速控制(VSLC)作为一种适用于任何类型高速公路的交通控制方法,而q -学习是设计基于学习的VSLC算法的一种常用方法。这种方法的缺点是对大状态-行动空间的表示和探索,就像它在VSLC中的应用一样。本研究引入深度q网络来近似q函数,并为VSLC应用提供了一种新的学习方法,可以在微观水平上跟踪车辆。所提出的奖励函数引导学习朝着改进奖励和防止连续限速之间振荡的方向发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation
Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信