{"title":"不确定条件下跑道容量管理优化的随机无模型强化学习框架","authors":"Lucas Orbolato Carvalho, Mayara Condé Rocha Murça","doi":"10.1016/j.tra.2025.104620","DOIUrl":null,"url":null,"abstract":"<div><div>Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.</div></div>","PeriodicalId":49421,"journal":{"name":"Transportation Research Part A-Policy and Practice","volume":"200 ","pages":"Article 104620"},"PeriodicalIF":6.8000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A stochastic model-free reinforcement learning framework for optimizing runway capacity management under uncertainty\",\"authors\":\"Lucas Orbolato Carvalho, Mayara Condé Rocha Murça\",\"doi\":\"10.1016/j.tra.2025.104620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.</div></div>\",\"PeriodicalId\":49421,\"journal\":{\"name\":\"Transportation Research Part A-Policy and Practice\",\"volume\":\"200 \",\"pages\":\"Article 104620\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part A-Policy and Practice\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0965856425002484\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part A-Policy and Practice","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965856425002484","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
A stochastic model-free reinforcement learning framework for optimizing runway capacity management under uncertainty
Air traffic operations are often subject to congestion due to rising air travel demand levels and capacity limitations at airport and airspace resources. These capacity constraints are frequently exacerbated by adverse weather conditions, one of the primary causes of flight delays and additional operational costs. To mitigate the impact of demand-capacity imbalances on overall aviation system performance, there is a pressing need for more advanced Air Traffic Flow Management (ATFM) processes, which must be able to better address the complexities and challenges arising from dynamic and stochastic operational environments. In recent years, machine learning techniques have emerged as promising tools to enhance ATFM decision-making, offering potential solutions to these challenges. This study investigates the application of different reinforcement learning (RL) approaches and algorithms for runway capacity management under uncertainty, including both runway configuration selection and airport service rate allocation decisions. The problem is formulated as a Markov Decision Process (MDP), and two approaches are proposed: data-based and forecast-based. Both approaches leverage a state-of-the-art model-free RL method, with the Maskable Proximal Policy Optimization (PPO) algorithm, which is compared to a traditional RL algorithm - Deep Q-Network (DQN). The results reveal that both algorithms perform similarly, with our stochastic forecast-based and incremental data-driven approaches outperforming traditional methods. These approaches offer notable reductions in delay costs compared to the baseline policy typically used in practice and yield results comparable to the best theoretical solutions derived from genetic algorithms. This study highlights two efficient methods for addressing runway capacity management challenges at airports and provides valuable insights into data-driven ATFM optimization and policy implications.
期刊介绍:
Transportation Research: Part A contains papers of general interest in all passenger and freight transportation modes: policy analysis, formulation and evaluation; planning; interaction with the political, socioeconomic and physical environment; design, management and evaluation of transportation systems. Topics are approached from any discipline or perspective: economics, engineering, sociology, psychology, etc. Case studies, survey and expository papers are included, as are articles which contribute to unification of the field, or to an understanding of the comparative aspects of different systems. Papers which assess the scope for technological innovation within a social or political framework are also published. The journal is international, and places equal emphasis on the problems of industrialized and non-industrialized regions.
Part A''s aims and scope are complementary to Transportation Research Part B: Methodological, Part C: Emerging Technologies and Part D: Transport and Environment. Part E: Logistics and Transportation Review. Part F: Traffic Psychology and Behaviour. The complete set forms the most cohesive and comprehensive reference of current research in transportation science.