{"title":"Intelligent Traffic Control System using Deep Reinforcement Learning","authors":"A. R, M. Krishnan, Akshay Kekuda","doi":"10.1109/ICITIIT54346.2022.9744226","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a deep reinforcement learning based traffic signal controller. We use the recently developed Distributional Reinforcement Learning with Quantile Regression (QR-DQN) algorithm to design a risk-sensitive approach to traffic signal control. A neural network is used to estimate the value distribution of state-action pairs. A novel control policy that gives variable weightage to the risk of an action depending on the congestion state of the system, effectively minimizes congestion in the network. Our results show that our algorithm outperforms conventional approaches and also classic RL based ones.","PeriodicalId":184353,"journal":{"name":"2022 International Conference on Innovative Trends in Information Technology (ICITIIT)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Innovative Trends in Information Technology (ICITIIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITIIT54346.2022.9744226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we propose a deep reinforcement learning based traffic signal controller. We use the recently developed Distributional Reinforcement Learning with Quantile Regression (QR-DQN) algorithm to design a risk-sensitive approach to traffic signal control. A neural network is used to estimate the value distribution of state-action pairs. A novel control policy that gives variable weightage to the risk of an action depending on the congestion state of the system, effectively minimizes congestion in the network. Our results show that our algorithm outperforms conventional approaches and also classic RL based ones.