{"title":"基于边缘的态势感知强化学习缓解交通拥堵","authors":"Chen-Yeou Yu, Wensheng Zhang, Carl K. Chang","doi":"10.1109/ISC255366.2022.9922461","DOIUrl":null,"url":null,"abstract":"Traffic congestion may cause elongated travel time, increased fuel consumption and extra pollution. To mitigate congestion, we propose a new approach based on multi-agent reinforcement learning (RL) to learn policies dictating path selections for vehicles. The algorithm utilizes the interactions between RL agents with Q-Learning and edge servers in monitoring traffic at road intersections. As an important difference between this work and existing approaches, we take human desire and realistic rewards into account. Extensive simulation experiments show that the resulting mechanism is promising and more RL agents can be incentive to follow rerouting directions when congestion is detected. Also, this algorithm has comparable performance as the Dynamic Dijkstra Algorithm.","PeriodicalId":277015,"journal":{"name":"2022 IEEE International Smart Cities Conference (ISC2)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Edge-based Situ-aware Reinforcement Learning for Traffic Congestion Mitigation\",\"authors\":\"Chen-Yeou Yu, Wensheng Zhang, Carl K. Chang\",\"doi\":\"10.1109/ISC255366.2022.9922461\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traffic congestion may cause elongated travel time, increased fuel consumption and extra pollution. To mitigate congestion, we propose a new approach based on multi-agent reinforcement learning (RL) to learn policies dictating path selections for vehicles. The algorithm utilizes the interactions between RL agents with Q-Learning and edge servers in monitoring traffic at road intersections. As an important difference between this work and existing approaches, we take human desire and realistic rewards into account. Extensive simulation experiments show that the resulting mechanism is promising and more RL agents can be incentive to follow rerouting directions when congestion is detected. Also, this algorithm has comparable performance as the Dynamic Dijkstra Algorithm.\",\"PeriodicalId\":277015,\"journal\":{\"name\":\"2022 IEEE International Smart Cities Conference (ISC2)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Smart Cities Conference (ISC2)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISC255366.2022.9922461\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Smart Cities Conference (ISC2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISC255366.2022.9922461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Edge-based Situ-aware Reinforcement Learning for Traffic Congestion Mitigation
Traffic congestion may cause elongated travel time, increased fuel consumption and extra pollution. To mitigate congestion, we propose a new approach based on multi-agent reinforcement learning (RL) to learn policies dictating path selections for vehicles. The algorithm utilizes the interactions between RL agents with Q-Learning and edge servers in monitoring traffic at road intersections. As an important difference between this work and existing approaches, we take human desire and realistic rewards into account. Extensive simulation experiments show that the resulting mechanism is promising and more RL agents can be incentive to follow rerouting directions when congestion is detected. Also, this algorithm has comparable performance as the Dynamic Dijkstra Algorithm.