Nasser Parishad, Mehmet Yildirimoglu, Mark Hickman
{"title":"多模态网络中的拥堵定价:深度强化学习的应用","authors":"Nasser Parishad, Mehmet Yildirimoglu, Mark Hickman","doi":"10.1016/j.trc.2025.105166","DOIUrl":null,"url":null,"abstract":"<div><div>Developing a real-time dynamic pricing mechanism that proactively generates toll profiles while incorporating demand elasticity and travellers’ heterogeneity remains a significant challenge. Many existing approaches suffer from low transferability and rely heavily on precise estimation of network parameters such as critical accumulation. This study introduces a data-driven, cordon-based pricing framework using reinforcement learning to optimise traffic flow and address these limitations. A multi-modal, trip-based Macroscopic Fundamental Diagram (MFD) simulation has been developed, capable of capturing individual mode choice decisions. Traveller heterogeneity is addressed through variations in origin and destination, trip length, departure time, and value of time (VoT). To establish a tolling strategy that maximises network outflow (minimises total travel time) and proactively addresses traffic congestion, a Double Deep Q-Network (DDQN) agent has been introduced. Remarkably, without prior knowledge of network parameters, the agent successfully regulates car accumulation at critical levels to maximise network outflow. Sensitivity analysis reveals that even with a 20% margin of error in input data, the agent remains effective in mitigating congestion. Additionally, the agent’s transferability has been evaluated under various traffic conditions and dynamics by introducing different demand profiles and MFD coefficients, demonstrating robust performance. Benchmark comparisons with a feedback controller across all scenarios further confirm that the DDQN agent consistently outperforms.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"177 ","pages":"Article 105166"},"PeriodicalIF":7.6000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Congestion pricing in multi-modal networks: An application of deep reinforcement learning\",\"authors\":\"Nasser Parishad, Mehmet Yildirimoglu, Mark Hickman\",\"doi\":\"10.1016/j.trc.2025.105166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Developing a real-time dynamic pricing mechanism that proactively generates toll profiles while incorporating demand elasticity and travellers’ heterogeneity remains a significant challenge. Many existing approaches suffer from low transferability and rely heavily on precise estimation of network parameters such as critical accumulation. This study introduces a data-driven, cordon-based pricing framework using reinforcement learning to optimise traffic flow and address these limitations. A multi-modal, trip-based Macroscopic Fundamental Diagram (MFD) simulation has been developed, capable of capturing individual mode choice decisions. Traveller heterogeneity is addressed through variations in origin and destination, trip length, departure time, and value of time (VoT). To establish a tolling strategy that maximises network outflow (minimises total travel time) and proactively addresses traffic congestion, a Double Deep Q-Network (DDQN) agent has been introduced. Remarkably, without prior knowledge of network parameters, the agent successfully regulates car accumulation at critical levels to maximise network outflow. Sensitivity analysis reveals that even with a 20% margin of error in input data, the agent remains effective in mitigating congestion. Additionally, the agent’s transferability has been evaluated under various traffic conditions and dynamics by introducing different demand profiles and MFD coefficients, demonstrating robust performance. Benchmark comparisons with a feedback controller across all scenarios further confirm that the DDQN agent consistently outperforms.</div></div>\",\"PeriodicalId\":54417,\"journal\":{\"name\":\"Transportation Research Part C-Emerging Technologies\",\"volume\":\"177 \",\"pages\":\"Article 105166\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part C-Emerging Technologies\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968090X25001706\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25001706","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
Congestion pricing in multi-modal networks: An application of deep reinforcement learning
Developing a real-time dynamic pricing mechanism that proactively generates toll profiles while incorporating demand elasticity and travellers’ heterogeneity remains a significant challenge. Many existing approaches suffer from low transferability and rely heavily on precise estimation of network parameters such as critical accumulation. This study introduces a data-driven, cordon-based pricing framework using reinforcement learning to optimise traffic flow and address these limitations. A multi-modal, trip-based Macroscopic Fundamental Diagram (MFD) simulation has been developed, capable of capturing individual mode choice decisions. Traveller heterogeneity is addressed through variations in origin and destination, trip length, departure time, and value of time (VoT). To establish a tolling strategy that maximises network outflow (minimises total travel time) and proactively addresses traffic congestion, a Double Deep Q-Network (DDQN) agent has been introduced. Remarkably, without prior knowledge of network parameters, the agent successfully regulates car accumulation at critical levels to maximise network outflow. Sensitivity analysis reveals that even with a 20% margin of error in input data, the agent remains effective in mitigating congestion. Additionally, the agent’s transferability has been evaluated under various traffic conditions and dynamics by introducing different demand profiles and MFD coefficients, demonstrating robust performance. Benchmark comparisons with a feedback controller across all scenarios further confirm that the DDQN agent consistently outperforms.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.