{"title":"Edge-based Situ-aware Reinforcement Learning for Traffic Congestion Mitigation","authors":"Chen-Yeou Yu, Wensheng Zhang, Carl K. Chang","doi":"10.1109/ISC255366.2022.9922461","DOIUrl":null,"url":null,"abstract":"Traffic congestion may cause elongated travel time, increased fuel consumption and extra pollution. To mitigate congestion, we propose a new approach based on multi-agent reinforcement learning (RL) to learn policies dictating path selections for vehicles. The algorithm utilizes the interactions between RL agents with Q-Learning and edge servers in monitoring traffic at road intersections. As an important difference between this work and existing approaches, we take human desire and realistic rewards into account. Extensive simulation experiments show that the resulting mechanism is promising and more RL agents can be incentive to follow rerouting directions when congestion is detected. Also, this algorithm has comparable performance as the Dynamic Dijkstra Algorithm.","PeriodicalId":277015,"journal":{"name":"2022 IEEE International Smart Cities Conference (ISC2)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Smart Cities Conference (ISC2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISC255366.2022.9922461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Traffic congestion may cause elongated travel time, increased fuel consumption and extra pollution. To mitigate congestion, we propose a new approach based on multi-agent reinforcement learning (RL) to learn policies dictating path selections for vehicles. The algorithm utilizes the interactions between RL agents with Q-Learning and edge servers in monitoring traffic at road intersections. As an important difference between this work and existing approaches, we take human desire and realistic rewards into account. Extensive simulation experiments show that the resulting mechanism is promising and more RL agents can be incentive to follow rerouting directions when congestion is detected. Also, this algorithm has comparable performance as the Dynamic Dijkstra Algorithm.