不确定条件下跑道容量管理优化的随机无模型强化学习框架

IF 6.8 1区工程技术 Q1 ECONOMICS

Transportation Research Part A-Policy and Practice Pub Date : 2025-08-19 DOI:10.1016/j.tra.2025.104620

Lucas Orbolato Carvalho, Mayara Condé Rocha Murça

{"title":"不确定条件下跑道容量管理优化的随机无模型强化学习框架","authors":"Lucas Orbolato Carvalho, Mayara Condé Rocha Murça","doi":"10.1016/j.tra.2025.104620","DOIUrl":null,"url":null,"abstract":"<div><div>Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.</div></div>","PeriodicalId":49421,"journal":{"name":"Transportation Research Part A-Policy and Practice","volume":"200 ","pages":"Article 104620"},"PeriodicalIF":6.8000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A stochastic model-free reinforcement learning framework for optimizing runway capacity management under uncertainty\",\"authors\":\"Lucas Orbolato Carvalho, Mayara Condé Rocha Murça\",\"doi\":\"10.1016/j.tra.2025.104620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.</div></div>\",\"PeriodicalId\":49421,\"journal\":{\"name\":\"Transportation Research Part A-Policy and Practice\",\"volume\":\"200 \",\"pages\":\"Article 104620\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part A-Policy and Practice\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0965856425002484\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part A-Policy and Practice","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965856425002484","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

由于航空旅行需求水平的上升以及机场和空域资源的容量限制，空中交通运营经常受到拥堵的影响。恶劣的天气条件往往加剧了这些能力限制，这是造成航班延误和额外运营成本的主要原因之一。为了减轻需求-容量失衡对整个航空系统性能的影响，迫切需要更先进的空中交通流量管理（ATFM）流程，该流程必须能够更好地应对动态和随机操作环境带来的复杂性和挑战。近年来，机器学习技术已经成为增强ATFM决策的有前途的工具，为这些挑战提供了潜在的解决方案。本研究探讨了不同的强化学习（RL）方法和算法在不确定条件下跑道容量管理中的应用，包括跑道配置选择和机场服务费率分配决策。将该问题表述为马尔可夫决策过程（MDP），并提出了基于数据和基于预测的两种方法。这两种方法都利用了最先进的无模型RL方法，以及可屏蔽的近端策略优化（PPO）算法，该算法与传统的RL算法Deep Q-Network （DQN）进行了比较。结果表明，两种算法的表现相似，我们基于随机预测和增量数据驱动的方法优于传统方法。与实践中通常使用的基准策略相比，这些方法显著降低了延迟成本，并产生了与遗传算法得出的最佳理论解决方案相当的结果。本研究强调了解决机场跑道容量管理挑战的两种有效方法，并为数据驱动的ATFM优化和政策影响提供了有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A stochastic model-free reinforcement learning framework for optimizing runway capacity management under uncertainty

Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Transportation Research Part A-Policy and Practice 工程技术-运输科技

CiteScore

13.20

自引率

7.80%

发文量

257

审稿时长

9.8 months

期刊介绍： Transportation Research: Part A contains papers of general interest in all passenger and freight transportation modes: policy analysis, formulation and evaluation; planning; interaction with the political, socioeconomic and physical environment; design, management and evaluation of transportation systems. Topics are approached from any discipline or perspective: economics, engineering, sociology, psychology, etc. Case studies, survey and expository papers are included, as are articles which contribute to unification of the field, or to an understanding of the comparative aspects of different systems. Papers which assess the scope for technological innovation within a social or political framework are also published. The journal is international, and places equal emphasis on the problems of industrialized and non-industrialized regions. Part A''s aims and scope are complementary to Transportation Research Part B: Methodological, Part C: Emerging Technologies and Part D: Transport and Environment. Part E: Logistics and Transportation Review. Part F: Traffic Psychology and Behaviour. The complete set forms the most cohesive and comprehensive reference of current research in transportation science.