Juan Pineda-Jaramillo , Claudia Munoz , Rodrigo Mesa-Arango , Carlos Gonzalez-Calderon , Anne Lange
{"title":"Integrating multiple data sources for improved flight delay prediction using explainable machine learning","authors":"Juan Pineda-Jaramillo , Claudia Munoz , Rodrigo Mesa-Arango , Carlos Gonzalez-Calderon , Anne Lange","doi":"10.1016/j.rtbm.2024.101161","DOIUrl":null,"url":null,"abstract":"<div><p>Flight delays negatively impact costs, customer satisfaction, and revenue in the aviation industry. As a result, it is critical to identify the factors that cause flight delays for each airport, as they can vary depending on various attributes associated with their operations.</p><p>This study proposes an explainable artificial intelligence (xAI) methodology for identifying the features that affect airport delays by integrating data from multiple sources and implementing explainable artificial intelligence. The methodology incorporates operational data, airport information, geographic data, and weather data combined and used to train a series of machine learning models. Furthermore, the SHAP and Sobol techniques are used to thoroughly analyze the features that influence flight delays for the specific case of the airport in Santiago, Chile.</p><p>The results show that a linear discriminant analysis model is best suited for predicting flight delays in this specific case study, and the features that have the most significant impact on delays are the international flight status, average temperature at the destination airport, wind speed, and average temperature at Santiago airport.</p><p>The proposed methodology could be applied by airlines that can collect data from multiple sources and conduct similar investigations, leading to the development of a decision support system to make better-informed decisions and reduce the impact of flight delays.</p></div>","PeriodicalId":47453,"journal":{"name":"Research in Transportation Business and Management","volume":"56 ","pages":"Article 101161"},"PeriodicalIF":4.1000,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2210539524000634/pdfft?md5=959075ae048a998d7e8597ff18d4e5e0&pid=1-s2.0-S2210539524000634-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Transportation Business and Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210539524000634","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0
Abstract
Flight delays negatively impact costs, customer satisfaction, and revenue in the aviation industry. As a result, it is critical to identify the factors that cause flight delays for each airport, as they can vary depending on various attributes associated with their operations.
This study proposes an explainable artificial intelligence (xAI) methodology for identifying the features that affect airport delays by integrating data from multiple sources and implementing explainable artificial intelligence. The methodology incorporates operational data, airport information, geographic data, and weather data combined and used to train a series of machine learning models. Furthermore, the SHAP and Sobol techniques are used to thoroughly analyze the features that influence flight delays for the specific case of the airport in Santiago, Chile.
The results show that a linear discriminant analysis model is best suited for predicting flight delays in this specific case study, and the features that have the most significant impact on delays are the international flight status, average temperature at the destination airport, wind speed, and average temperature at Santiago airport.
The proposed methodology could be applied by airlines that can collect data from multiple sources and conduct similar investigations, leading to the development of a decision support system to make better-informed decisions and reduce the impact of flight delays.
期刊介绍:
Research in Transportation Business & Management (RTBM) will publish research on international aspects of transport management such as business strategy, communication, sustainability, finance, human resource management, law, logistics, marketing, franchising, privatisation and commercialisation. Research in Transportation Business & Management welcomes proposals for themed volumes from scholars in management, in relation to all modes of transport. Issues should be cross-disciplinary for one mode or single-disciplinary for all modes. We are keen to receive proposals that combine and integrate theories and concepts that are taken from or can be traced to origins in different disciplines or lessons learned from different modes and approaches to the topic. By facilitating the development of interdisciplinary or intermodal concepts, theories and ideas, and by synthesizing these for the journal''s audience, we seek to contribute to both scholarly advancement of knowledge and the state of managerial practice. Potential volume themes include: -Sustainability and Transportation Management- Transport Management and the Reduction of Transport''s Carbon Footprint- Marketing Transport/Branding Transportation- Benchmarking, Performance Measurement and Best Practices in Transport Operations- Franchising, Concessions and Alternate Governance Mechanisms for Transport Organisations- Logistics and the Integration of Transportation into Freight Supply Chains- Risk Management (or Asset Management or Transportation Finance or ...): Lessons from Multiple Modes- Engaging the Stakeholder in Transportation Governance- Reliability in the Freight Sector