{"title":"基于彩虹深度强化学习智能体的交通拥堵改进解决方案","authors":"Mahmoud Nawar, Ahmed M. Fares, A. Al-sammak","doi":"10.1109/JAC-ECC48896.2019.9051262","DOIUrl":null,"url":null,"abstract":"While traffic congestion hits severely the world economy, adaptive traffic signal systems would efficiently provide potential solutions. In this paper, we propose a deep reinforcement learning system to control the signal lights in an isolated intersection. The proposed system uses a deep convolutional neural network to extract the crucial features from the environment state that is described by raw traffic information; i.e., vehicles positions, speeds, and waiting times. Besides, the system utilizes a multi-objective reward and the Rainbow agent which provides further space of enhancements to the conventional Deep Q-Networks agent. Extensive experiments illustrate that our proposed deep framework outperforms the baseline under a number of settings and traffic measures, including trip time, waiting time, fuel consumption, and stability.","PeriodicalId":351812,"journal":{"name":"2019 7th International Japan-Africa Conference on Electronics, Communications, and Computations, (JAC-ECC)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Rainbow Deep Reinforcement Learning Agent for Improved Solution of the Traffic Congestion\",\"authors\":\"Mahmoud Nawar, Ahmed M. Fares, A. Al-sammak\",\"doi\":\"10.1109/JAC-ECC48896.2019.9051262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While traffic congestion hits severely the world economy, adaptive traffic signal systems would efficiently provide potential solutions. In this paper, we propose a deep reinforcement learning system to control the signal lights in an isolated intersection. The proposed system uses a deep convolutional neural network to extract the crucial features from the environment state that is described by raw traffic information; i.e., vehicles positions, speeds, and waiting times. Besides, the system utilizes a multi-objective reward and the Rainbow agent which provides further space of enhancements to the conventional Deep Q-Networks agent. Extensive experiments illustrate that our proposed deep framework outperforms the baseline under a number of settings and traffic measures, including trip time, waiting time, fuel consumption, and stability.\",\"PeriodicalId\":351812,\"journal\":{\"name\":\"2019 7th International Japan-Africa Conference on Electronics, Communications, and Computations, (JAC-ECC)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 7th International Japan-Africa Conference on Electronics, Communications, and Computations, (JAC-ECC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JAC-ECC48896.2019.9051262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 7th International Japan-Africa Conference on Electronics, Communications, and Computations, (JAC-ECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JAC-ECC48896.2019.9051262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Rainbow Deep Reinforcement Learning Agent for Improved Solution of the Traffic Congestion
While traffic congestion hits severely the world economy, adaptive traffic signal systems would efficiently provide potential solutions. In this paper, we propose a deep reinforcement learning system to control the signal lights in an isolated intersection. The proposed system uses a deep convolutional neural network to extract the crucial features from the environment state that is described by raw traffic information; i.e., vehicles positions, speeds, and waiting times. Besides, the system utilizes a multi-objective reward and the Rainbow agent which provides further space of enhancements to the conventional Deep Q-Networks agent. Extensive experiments illustrate that our proposed deep framework outperforms the baseline under a number of settings and traffic measures, including trip time, waiting time, fuel consumption, and stability.