{"title":"基于强化学习的自主交叉口管理交通效率与公平性优化","authors":"Yuanyuan Wu , David Z. W. Wang , Feng Zhu","doi":"10.1080/23249935.2023.2232047","DOIUrl":null,"url":null,"abstract":"<div><div>Autonomous Intersection Management (AIM) for high-level Connected and Automated Vehicles (CAVs) has evolved from rule-based to optimisation-based policies. However, at congested major-minor intersections, optimising solely for efficiency can negatively impact vehicle fairness. This study addresses this issue by proposing a deep reinforcement learning approach that optimises both traffic efficiency and fairness for AIM. In the modelled multi-objective Markov decision process, traffic fairness is measured by the difference between the crossing order and the approaching order of CAVs, while traffic efficiency is measured by average travel time. With unknown preferences of the objectives, Bellman optimality equation is generalised to obtain the optimal policies over the space of all possible preferences during the iterative training process. The effectiveness of the proposed method is evaluated in a simulated real-world intersection and compared with three benchmark policies, including the fairest policy for AIM: first-come-first-served. The learned policies perform best in reducing overall average vehicle delay, and demonstrate outstanding performance in balancing traffic fairness and efficiency.</div></div>","PeriodicalId":48871,"journal":{"name":"Transportmetrica A-Transport Science","volume":"21 1","pages":"Pages 247-271"},"PeriodicalIF":3.6000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Traffic efficiency and fairness optimisation for autonomous intersection management based on reinforcement learning\",\"authors\":\"Yuanyuan Wu , David Z. W. Wang , Feng Zhu\",\"doi\":\"10.1080/23249935.2023.2232047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Autonomous Intersection Management (AIM) for high-level Connected and Automated Vehicles (CAVs) has evolved from rule-based to optimisation-based policies. However, at congested major-minor intersections, optimising solely for efficiency can negatively impact vehicle fairness. This study addresses this issue by proposing a deep reinforcement learning approach that optimises both traffic efficiency and fairness for AIM. In the modelled multi-objective Markov decision process, traffic fairness is measured by the difference between the crossing order and the approaching order of CAVs, while traffic efficiency is measured by average travel time. With unknown preferences of the objectives, Bellman optimality equation is generalised to obtain the optimal policies over the space of all possible preferences during the iterative training process. The effectiveness of the proposed method is evaluated in a simulated real-world intersection and compared with three benchmark policies, including the fairest policy for AIM: first-come-first-served. The learned policies perform best in reducing overall average vehicle delay, and demonstrate outstanding performance in balancing traffic fairness and efficiency.</div></div>\",\"PeriodicalId\":48871,\"journal\":{\"name\":\"Transportmetrica A-Transport Science\",\"volume\":\"21 1\",\"pages\":\"Pages 247-271\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportmetrica A-Transport Science\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/org/science/article/pii/S2324993523001859\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportmetrica A-Transport Science","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S2324993523001859","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Traffic efficiency and fairness optimisation for autonomous intersection management based on reinforcement learning
Autonomous Intersection Management (AIM) for high-level Connected and Automated Vehicles (CAVs) has evolved from rule-based to optimisation-based policies. However, at congested major-minor intersections, optimising solely for efficiency can negatively impact vehicle fairness. This study addresses this issue by proposing a deep reinforcement learning approach that optimises both traffic efficiency and fairness for AIM. In the modelled multi-objective Markov decision process, traffic fairness is measured by the difference between the crossing order and the approaching order of CAVs, while traffic efficiency is measured by average travel time. With unknown preferences of the objectives, Bellman optimality equation is generalised to obtain the optimal policies over the space of all possible preferences during the iterative training process. The effectiveness of the proposed method is evaluated in a simulated real-world intersection and compared with three benchmark policies, including the fairest policy for AIM: first-come-first-served. The learned policies perform best in reducing overall average vehicle delay, and demonstrate outstanding performance in balancing traffic fairness and efficiency.
期刊介绍:
Transportmetrica A provides a forum for original discourse in transport science. The international journal''s focus is on the scientific approach to transport research methodology and empirical analysis of moving people and goods. Papers related to all aspects of transportation are welcome. A rigorous peer review that involves editor screening and anonymous refereeing for submitted articles facilitates quality output.