Integrating multiple data sources for improved flight delay prediction using explainable machine learning

IF 4.1 2区 工程技术 Q2 BUSINESS
Juan Pineda-Jaramillo , Claudia Munoz , Rodrigo Mesa-Arango , Carlos Gonzalez-Calderon , Anne Lange
{"title":"Integrating multiple data sources for improved flight delay prediction using explainable machine learning","authors":"Juan Pineda-Jaramillo ,&nbsp;Claudia Munoz ,&nbsp;Rodrigo Mesa-Arango ,&nbsp;Carlos Gonzalez-Calderon ,&nbsp;Anne Lange","doi":"10.1016/j.rtbm.2024.101161","DOIUrl":null,"url":null,"abstract":"<div><p>Flight delays negatively impact costs, customer satisfaction, and revenue in the aviation industry. As a result, it is critical to identify the factors that cause flight delays for each airport, as they can vary depending on various attributes associated with their operations.</p><p>This study proposes an explainable artificial intelligence (xAI) methodology for identifying the features that affect airport delays by integrating data from multiple sources and implementing explainable artificial intelligence. The methodology incorporates operational data, airport information, geographic data, and weather data combined and used to train a series of machine learning models. Furthermore, the SHAP and Sobol techniques are used to thoroughly analyze the features that influence flight delays for the specific case of the airport in Santiago, Chile.</p><p>The results show that a linear discriminant analysis model is best suited for predicting flight delays in this specific case study, and the features that have the most significant impact on delays are the international flight status, average temperature at the destination airport, wind speed, and average temperature at Santiago airport.</p><p>The proposed methodology could be applied by airlines that can collect data from multiple sources and conduct similar investigations, leading to the development of a decision support system to make better-informed decisions and reduce the impact of flight delays.</p></div>","PeriodicalId":47453,"journal":{"name":"Research in Transportation Business and Management","volume":"56 ","pages":"Article 101161"},"PeriodicalIF":4.1000,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2210539524000634/pdfft?md5=959075ae048a998d7e8597ff18d4e5e0&pid=1-s2.0-S2210539524000634-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Transportation Business and Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210539524000634","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0

Abstract

Flight delays negatively impact costs, customer satisfaction, and revenue in the aviation industry. As a result, it is critical to identify the factors that cause flight delays for each airport, as they can vary depending on various attributes associated with their operations.

This study proposes an explainable artificial intelligence (xAI) methodology for identifying the features that affect airport delays by integrating data from multiple sources and implementing explainable artificial intelligence. The methodology incorporates operational data, airport information, geographic data, and weather data combined and used to train a series of machine learning models. Furthermore, the SHAP and Sobol techniques are used to thoroughly analyze the features that influence flight delays for the specific case of the airport in Santiago, Chile.

The results show that a linear discriminant analysis model is best suited for predicting flight delays in this specific case study, and the features that have the most significant impact on delays are the international flight status, average temperature at the destination airport, wind speed, and average temperature at Santiago airport.

The proposed methodology could be applied by airlines that can collect data from multiple sources and conduct similar investigations, leading to the development of a decision support system to make better-informed decisions and reduce the impact of flight delays.

整合多种数据源,利用可解释机器学习改进航班延误预测
航班延误会对航空业的成本、客户满意度和收入产生负面影响。因此,识别造成每个机场航班延误的因素至关重要,因为这些因素会因机场运营相关的各种属性而有所不同。本研究提出了一种可解释人工智能(xAI)方法,通过整合来自多个来源的数据并实施可解释人工智能来识别影响机场延误的特征。该方法将运行数据、机场信息、地理数据和天气数据结合起来,用于训练一系列机器学习模型。结果表明,线性判别分析模型最适合预测该具体案例研究中的航班延误,而对延误影响最大的特征是国际航班状态、目的地机场的平均气温、风速和圣地亚哥机场的平均气温。建议的方法可应用于航空公司,它们可以从多个来源收集数据并进行类似的调查,从而开发出决策支持系统,做出更明智的决策,减少航班延误的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.10
自引率
8.30%
发文量
175
期刊介绍: Research in Transportation Business & Management (RTBM) will publish research on international aspects of transport management such as business strategy, communication, sustainability, finance, human resource management, law, logistics, marketing, franchising, privatisation and commercialisation. Research in Transportation Business & Management welcomes proposals for themed volumes from scholars in management, in relation to all modes of transport. Issues should be cross-disciplinary for one mode or single-disciplinary for all modes. We are keen to receive proposals that combine and integrate theories and concepts that are taken from or can be traced to origins in different disciplines or lessons learned from different modes and approaches to the topic. By facilitating the development of interdisciplinary or intermodal concepts, theories and ideas, and by synthesizing these for the journal''s audience, we seek to contribute to both scholarly advancement of knowledge and the state of managerial practice. Potential volume themes include: -Sustainability and Transportation Management- Transport Management and the Reduction of Transport''s Carbon Footprint- Marketing Transport/Branding Transportation- Benchmarking, Performance Measurement and Best Practices in Transport Operations- Franchising, Concessions and Alternate Governance Mechanisms for Transport Organisations- Logistics and the Integration of Transportation into Freight Supply Chains- Risk Management (or Asset Management or Transportation Finance or ...): Lessons from Multiple Modes- Engaging the Stakeholder in Transportation Governance- Reliability in the Freight Sector
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信