Maarten Beltman , Marta Ribeiro , Jasper de Wilde , Junzi Sun
{"title":"利用监督学习动态预测单个航班的航班起飞延误概率分布","authors":"Maarten Beltman , Marta Ribeiro , Jasper de Wilde , Junzi Sun","doi":"10.1016/j.jairtraman.2025.102788","DOIUrl":null,"url":null,"abstract":"<div><div>Punctuality is a key performance indicator for any airline, especially hub-and-spoke airlines, given their focus on short passenger connections. Flights that are delayed at departure need to compensate for lost time whilst airborne. Because fuelling takes place well before scheduled departure, predicted departure delays determine the planned fuel amounts for en-route speed optimization. To prevent unnecessary fuel burn, airlines benefit from highly accurate departure delay predictions. This study aims to extend previous work on airline departure delay forecasting to a dynamic and probabilistic domain, whilst incorporating novel day-of-operations airline information to further minimize prediction errors. Random Forest, CatBoost, and Deep Neural Network models are proposed for a case study on departure flights of a major hub-and-spoke airline from its hub airport between 1 January 2020 and 1 August 2023. The Random Forest model is selected for its probabilistic performance and high accuracy in predicting delays between 5 and 25 min, for which en-route speed optimization has the largest effect. At the 90 min prediction horizon, the model reaches a Mean Absolute Error of 8.46 min and a Root Mean Square Error of 11.91 min. For 76% of flights, the actual delay is within the predicted probability distribution range. Finally, this study puts a strong emphasis on explainability. Flight dispatchers are therefore provided with the main factors impacting the prediction, explaining the context of the flight. The versatility of the model is demonstrated in two shadow runs within the procedures of an international airline, where delays caused by familiar and unfamiliar factors were successfully predicted.</div></div>","PeriodicalId":14925,"journal":{"name":"Journal of Air Transport Management","volume":"126 ","pages":"Article 102788"},"PeriodicalIF":3.9000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamically forecasting airline departure delay probability distributions for individual flights using supervised learning\",\"authors\":\"Maarten Beltman , Marta Ribeiro , Jasper de Wilde , Junzi Sun\",\"doi\":\"10.1016/j.jairtraman.2025.102788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Punctuality is a key performance indicator for any airline, especially hub-and-spoke airlines, given their focus on short passenger connections. Flights that are delayed at departure need to compensate for lost time whilst airborne. Because fuelling takes place well before scheduled departure, predicted departure delays determine the planned fuel amounts for en-route speed optimization. To prevent unnecessary fuel burn, airlines benefit from highly accurate departure delay predictions. This study aims to extend previous work on airline departure delay forecasting to a dynamic and probabilistic domain, whilst incorporating novel day-of-operations airline information to further minimize prediction errors. Random Forest, CatBoost, and Deep Neural Network models are proposed for a case study on departure flights of a major hub-and-spoke airline from its hub airport between 1 January 2020 and 1 August 2023. The Random Forest model is selected for its probabilistic performance and high accuracy in predicting delays between 5 and 25 min, for which en-route speed optimization has the largest effect. At the 90 min prediction horizon, the model reaches a Mean Absolute Error of 8.46 min and a Root Mean Square Error of 11.91 min. For 76% of flights, the actual delay is within the predicted probability distribution range. Finally, this study puts a strong emphasis on explainability. Flight dispatchers are therefore provided with the main factors impacting the prediction, explaining the context of the flight. The versatility of the model is demonstrated in two shadow runs within the procedures of an international airline, where delays caused by familiar and unfamiliar factors were successfully predicted.</div></div>\",\"PeriodicalId\":14925,\"journal\":{\"name\":\"Journal of Air Transport Management\",\"volume\":\"126 \",\"pages\":\"Article 102788\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Air Transport Management\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0969699725000511\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Air Transport Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0969699725000511","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Dynamically forecasting airline departure delay probability distributions for individual flights using supervised learning
Punctuality is a key performance indicator for any airline, especially hub-and-spoke airlines, given their focus on short passenger connections. Flights that are delayed at departure need to compensate for lost time whilst airborne. Because fuelling takes place well before scheduled departure, predicted departure delays determine the planned fuel amounts for en-route speed optimization. To prevent unnecessary fuel burn, airlines benefit from highly accurate departure delay predictions. This study aims to extend previous work on airline departure delay forecasting to a dynamic and probabilistic domain, whilst incorporating novel day-of-operations airline information to further minimize prediction errors. Random Forest, CatBoost, and Deep Neural Network models are proposed for a case study on departure flights of a major hub-and-spoke airline from its hub airport between 1 January 2020 and 1 August 2023. The Random Forest model is selected for its probabilistic performance and high accuracy in predicting delays between 5 and 25 min, for which en-route speed optimization has the largest effect. At the 90 min prediction horizon, the model reaches a Mean Absolute Error of 8.46 min and a Root Mean Square Error of 11.91 min. For 76% of flights, the actual delay is within the predicted probability distribution range. Finally, this study puts a strong emphasis on explainability. Flight dispatchers are therefore provided with the main factors impacting the prediction, explaining the context of the flight. The versatility of the model is demonstrated in two shadow runs within the procedures of an international airline, where delays caused by familiar and unfamiliar factors were successfully predicted.
期刊介绍:
The Journal of Air Transport Management (JATM) sets out to address, through high quality research articles and authoritative commentary, the major economic, management and policy issues facing the air transport industry today. It offers practitioners and academics an international and dynamic forum for analysis and discussion of these issues, linking research and practice and stimulating interaction between the two. The refereed papers in the journal cover all the major sectors of the industry (airlines, airports, air traffic management) as well as related areas such as tourism management and logistics. Papers are blind reviewed, normally by two referees, chosen for their specialist knowledge. The journal provides independent, original and rigorous analysis in the areas of: • Policy, regulation and law • Strategy • Operations • Marketing • Economics and finance • Sustainability