{"title":"基于深度强化学习的可变限速控制:一种可能的实现","authors":"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko","doi":"10.1109/ELMAR49956.2020.9219031","DOIUrl":null,"url":null,"abstract":"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.","PeriodicalId":235289,"journal":{"name":"2020 International Symposium ELMAR","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation\",\"authors\":\"M. Gregurić, K. Kušić, Filip Vrbanić, E. Ivanjko\",\"doi\":\"10.1109/ELMAR49956.2020.9219031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.\",\"PeriodicalId\":235289,\"journal\":{\"name\":\"2020 International Symposium ELMAR\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Symposium ELMAR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR49956.2020.9219031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium ELMAR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR49956.2020.9219031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Variable Speed Limit Control Based on Deep Reinforcement Learning: A Possible Implementation
Today’s urban motorways cannot fulfill their purpose to simultaneously serve transit and local urban traffic with a high Level of Service. In the case of urban motorway infrastructure, the traditional “build only” approach is not always possible due to the lack of space. This study is focused on the Variable Speed Limit Control (VSLC) as one of the traffic control methods applicable for any type of motorway and Q-learning as one commonly used approach for designing learning based VSLC algorithms. The drawback of this methodology is the representation and exploration of the large state-action space as it is the case in its application for VSLC. This study introduces a Deep Q-Network to approximate the Q-function and presents a novel learning approach for the VSLC application with possibility to track vehicles on the microscopic level. The proposed reward function steers the learning towards the improvement of reward and prevention of oscillation among consecutive speed limits.