{"title":"基于交叉口模型的深度q学习网络交通灯控制奖励函数设计","authors":"Tanghong Wu, Fanchen Kong, Peng Peng, Zichuan Fan","doi":"10.1109/ICRAE50850.2020.9310858","DOIUrl":null,"url":null,"abstract":"Reinforcement learning based control is an effective approach to traffic light control. In terms of the Deep Q-learning Network (DQN) based traffic light control method, the reward function should be carefully designed, as it affects the performance of the control method and it also brings the challenge in standardized design. Thus, the control method would fail to perform well if the reward function was established improperly. If the reward function was too complicated, it would be time-consuming for the algorithm. To solve this problem, we proposed an intersection-model based reward function in the DQN algorithm. We investigated the different types of urban road intersection structures and analyzed their features. Then we used those features to formulate the reward function. Our model was evaluated via the simulation dataset under different road situations and got less emergency stop with 14%~32% improvement while the number of passing vehicles was also a bit more than the baseline. There were also 22% and 7% speedup in the training process under the tow training process.","PeriodicalId":296832,"journal":{"name":"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Road Intersection Model Based Reward Function Design in Deep Q-Learning Network for Traffic Light Control\",\"authors\":\"Tanghong Wu, Fanchen Kong, Peng Peng, Zichuan Fan\",\"doi\":\"10.1109/ICRAE50850.2020.9310858\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reinforcement learning based control is an effective approach to traffic light control. In terms of the Deep Q-learning Network (DQN) based traffic light control method, the reward function should be carefully designed, as it affects the performance of the control method and it also brings the challenge in standardized design. Thus, the control method would fail to perform well if the reward function was established improperly. If the reward function was too complicated, it would be time-consuming for the algorithm. To solve this problem, we proposed an intersection-model based reward function in the DQN algorithm. We investigated the different types of urban road intersection structures and analyzed their features. Then we used those features to formulate the reward function. Our model was evaluated via the simulation dataset under different road situations and got less emergency stop with 14%~32% improvement while the number of passing vehicles was also a bit more than the baseline. There were also 22% and 7% speedup in the training process under the tow training process.\",\"PeriodicalId\":296832,\"journal\":{\"name\":\"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRAE50850.2020.9310858\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAE50850.2020.9310858","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Road Intersection Model Based Reward Function Design in Deep Q-Learning Network for Traffic Light Control
Reinforcement learning based control is an effective approach to traffic light control. In terms of the Deep Q-learning Network (DQN) based traffic light control method, the reward function should be carefully designed, as it affects the performance of the control method and it also brings the challenge in standardized design. Thus, the control method would fail to perform well if the reward function was established improperly. If the reward function was too complicated, it would be time-consuming for the algorithm. To solve this problem, we proposed an intersection-model based reward function in the DQN algorithm. We investigated the different types of urban road intersection structures and analyzed their features. Then we used those features to formulate the reward function. Our model was evaluated via the simulation dataset under different road situations and got less emergency stop with 14%~32% improvement while the number of passing vehicles was also a bit more than the baseline. There were also 22% and 7% speedup in the training process under the tow training process.