基于宏观基本图的周边控制和路线引导的迭代自适应动态编程方法

Transportation Science Pub Date : 2024-05-17 DOI:10.1287/trsc.2023.0091

Can Chen, N. Geroliminis, Renxin Zhong

{"title":"基于宏观基本图的周边控制和路线引导的迭代自适应动态编程方法","authors":"Can Chen, N. Geroliminis, Renxin Zhong","doi":"10.1287/trsc.2023.0091","DOIUrl":null,"url":null,"abstract":"Macroscopic fundamental diagrams (MFDs) have been widely adopted to model the traffic flow of large-scale urban networks. Coupling perimeter control and regional route guidance (PCRG) is a promising strategy to decrease congestion heterogeneity and reduce delays in large-scale MFD-based urban networks. For MFD-based PCRG, one needs to distinguish between the dynamics of (a) the plant that represents reality and is used as the simulation tool and (b) the model that contains easier-to-measure states than the plant and is used for devising controllers, that is, the model-plant mismatch should be considered. Traditional model-based methods (e.g., model predictive control (MPC)) require an accurate representation of the plant dynamics as the prediction model. However, because of the inherent network uncertainties, such as uncertain dynamics of heterogeneity and demand disturbance, MFD parameters could be time-varying and uncertain. Conversely, existing data-driven methods (e.g., reinforcement learning) do not consider the model-plant mismatch and the limited access to plant-generated data, for example, subregional OD-specific accumulations. Therefore, we develop an iterative adaptive dynamic programming (IADP)-based method to address the limited data source induced by the model-plant mismatch. An actor-critic neural network structure is developed to circumvent the requirement of complete information on plant dynamics. Performance comparisons with other PCRG schemes under various scenarios are carried out. The numerical results indicate that the IADP controller trained with a limited data source can achieve comparable performance with the “benchmark” MPC approach using perfect measurements from the plant. The results also validate the IADP’s robustness against various uncertainties (e.g., demand noise, MFD error, and trip distance heterogeneity) when minimizing the total time spent in the urban network. These results demonstrate the great potential of the proposed scheme in improving the efficiency of multiregion MFD systems. Funding: The work was jointly funded by the National Natural Science Foundation of China under [Grant 72071214] (R. Zhong), the Dit4Tram project from the European Union’s Horizon 2020 Research and Innovation Programme under [Grant 953783] (N. Geroliminis), and the Research Student Attachment Programme of The Hong Kong Polytechnic University (C. Chen). Supplemental Material: The online appendix is available at https://doi.org/10.1287/trsc.2023.0091 .","PeriodicalId":510068,"journal":{"name":"Transportation Science","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Iterative Adaptive Dynamic Programming Approach for Macroscopic Fundamental Diagram-Based Perimeter Control and Route Guidance\",\"authors\":\"Can Chen, N. Geroliminis, Renxin Zhong\",\"doi\":\"10.1287/trsc.2023.0091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Macroscopic fundamental diagrams (MFDs) have been widely adopted to model the traffic flow of large-scale urban networks. Coupling perimeter control and regional route guidance (PCRG) is a promising strategy to decrease congestion heterogeneity and reduce delays in large-scale MFD-based urban networks. For MFD-based PCRG, one needs to distinguish between the dynamics of (a) the plant that represents reality and is used as the simulation tool and (b) the model that contains easier-to-measure states than the plant and is used for devising controllers, that is, the model-plant mismatch should be considered. Traditional model-based methods (e.g., model predictive control (MPC)) require an accurate representation of the plant dynamics as the prediction model. However, because of the inherent network uncertainties, such as uncertain dynamics of heterogeneity and demand disturbance, MFD parameters could be time-varying and uncertain. Conversely, existing data-driven methods (e.g., reinforcement learning) do not consider the model-plant mismatch and the limited access to plant-generated data, for example, subregional OD-specific accumulations. Therefore, we develop an iterative adaptive dynamic programming (IADP)-based method to address the limited data source induced by the model-plant mismatch. An actor-critic neural network structure is developed to circumvent the requirement of complete information on plant dynamics. Performance comparisons with other PCRG schemes under various scenarios are carried out. The numerical results indicate that the IADP controller trained with a limited data source can achieve comparable performance with the “benchmark” MPC approach using perfect measurements from the plant. The results also validate the IADP’s robustness against various uncertainties (e.g., demand noise, MFD error, and trip distance heterogeneity) when minimizing the total time spent in the urban network. These results demonstrate the great potential of the proposed scheme in improving the efficiency of multiregion MFD systems. Funding: The work was jointly funded by the National Natural Science Foundation of China under [Grant 72071214] (R. Zhong), the Dit4Tram project from the European Union’s Horizon 2020 Research and Innovation Programme under [Grant 953783] (N. Geroliminis), and the Research Student Attachment Programme of The Hong Kong Polytechnic University (C. Chen). Supplemental Material: The online appendix is available at https://doi.org/10.1287/trsc.2023.0091 .\",\"PeriodicalId\":510068,\"journal\":{\"name\":\"Transportation Science\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1287/trsc.2023.0091\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/trsc.2023.0091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

宏观基本图（MFD）已被广泛用于模拟大规模城市网络的交通流。在基于宏观基本图的大规模城市网络中，将周边控制和区域路线引导（PCRG）结合起来，是减少拥堵异质性和延迟的一种有前途的策略。对于基于 MFD 的 PCRG，需要区分（a）代表现实并用作仿真工具的工厂的动态和（b）包含比工厂更易测量的状态并用于设计控制器的模型的动态，即应考虑模型与工厂的不匹配。传统的基于模型的方法（如模型预测控制 (MPC)）需要将工厂动态精确地表示为预测模型。然而，由于网络固有的不确定性，如不确定的异质性动态和需求干扰，MFD 参数可能是时变和不确定的。相反，现有的数据驱动方法（如强化学习）并没有考虑模型与工厂的不匹配以及获取工厂生成数据（如次区域 OD 特定累积）的有限性。因此，我们开发了一种基于迭代自适应动态编程（IADP）的方法，以解决由模型-植物不匹配引起的数据源有限的问题。我们开发了一种行为批判神经网络结构，以规避对植物动态完整信息的要求。在各种情况下与其他 PCRG 方案进行了性能比较。数值结果表明，使用有限数据源训练的 IADP 控制器可以达到与使用完美工厂测量数据的 "基准" MPC 方法相当的性能。结果还验证了 IADP 在最小化城市网络总耗时时，对各种不确定性（如需求噪声、MFD 误差和行程距离异质性）的鲁棒性。这些结果证明了所提出的方案在提高多区域多式联运系统效率方面的巨大潜力。资助：本研究由国家自然科学基金项目[批准号：72071214]（R. Zhong）、欧盟 "地平线 2020 研究与创新计划 "的 Dit4Tram 项目[批准号：953783]（N. Geroliminis）和香港理工大学研究学生实习计划（C. Chen）共同资助。补充材料：在线附录见 https://doi.org/10.1287/trsc.2023.0091 。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Iterative Adaptive Dynamic Programming Approach for Macroscopic Fundamental Diagram-Based Perimeter Control and Route Guidance

Macroscopic fundamental diagrams (MFDs) have been widely adopted to model the traffic flow of large-scale urban networks. Coupling perimeter control and regional route guidance (PCRG) is a promising strategy to decrease congestion heterogeneity and reduce delays in large-scale MFD-based urban networks. For MFD-based PCRG, one needs to distinguish between the dynamics of (a) the plant that represents reality and is used as the simulation tool and (b) the model that contains easier-to-measure states than the plant and is used for devising controllers, that is, the model-plant mismatch should be considered. Traditional model-based methods (e.g., model predictive control (MPC)) require an accurate representation of the plant dynamics as the prediction model. However, because of the inherent network uncertainties, such as uncertain dynamics of heterogeneity and demand disturbance, MFD parameters could be time-varying and uncertain. Conversely, existing data-driven methods (e.g., reinforcement learning) do not consider the model-plant mismatch and the limited access to plant-generated data, for example, subregional OD-specific accumulations. Therefore, we develop an iterative adaptive dynamic programming (IADP)-based method to address the limited data source induced by the model-plant mismatch. An actor-critic neural network structure is developed to circumvent the requirement of complete information on plant dynamics. Performance comparisons with other PCRG schemes under various scenarios are carried out. The numerical results indicate that the IADP controller trained with a limited data source can achieve comparable performance with the “benchmark” MPC approach using perfect measurements from the plant. The results also validate the IADP’s robustness against various uncertainties (e.g., demand noise, MFD error, and trip distance heterogeneity) when minimizing the total time spent in the urban network. These results demonstrate the great potential of the proposed scheme in improving the efficiency of multiregion MFD systems. Funding: The work was jointly funded by the National Natural Science Foundation of China under [Grant 72071214] (R. Zhong), the Dit4Tram project from the European Union’s Horizon 2020 Research and Innovation Programme under [Grant 953783] (N. Geroliminis), and the Research Student Attachment Programme of The Hong Kong Polytechnic University (C. Chen). Supplemental Material: The online appendix is available at https://doi.org/10.1287/trsc.2023.0091 .

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Transportation Science

自引率

0.00%

发文量