结合调度命令的数据驱动列车延误预测:一个xgboost -元启发式框架

IF 2.3 4区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Tianze Gao, Junhua Chen, Huizhang Xu
{"title":"结合调度命令的数据驱动列车延误预测:一个xgboost -元启发式框架","authors":"Tianze Gao,&nbsp;Junhua Chen,&nbsp;Huizhang Xu","doi":"10.1049/itr2.12461","DOIUrl":null,"url":null,"abstract":"<p>Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12461","citationCount":"0","resultStr":"{\"title\":\"Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework\",\"authors\":\"Tianze Gao,&nbsp;Junhua Chen,&nbsp;Huizhang Xu\",\"doi\":\"10.1049/itr2.12461\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling.</p>\",\"PeriodicalId\":50381,\"journal\":{\"name\":\"IET Intelligent Transport Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2023-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12461\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Intelligent Transport Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12461\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12461","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

列车延误会严重影响高铁的正点率和服务质量,这对调度员的决策也起着至关重要的作用。本文提出了一个数据驱动的列车延误预测框架,并考虑调度命令的影响和XGBoost列车延误传播机制,对该框架进行了强化。利用四种元启发式算法对其超参数进行微调。利用涵盖38个月列车运行数据的190万条记录的庞大数据集进行特征提取和模型训练。利用三种统计指标对模型的精度进行了评估,并对四种调优框架进行了比较。为了强调模型的可解释性和对列车重调度的实际指导作用,将理论与实际结果相结合,验证了调度命令、延迟传播和延迟预测之间的关系,并采用SHapley加性解释(SHapley Additive explanation)分析法对模型进行了更清晰的解释。结果表明,不同的xgboost - meta启发式模型在不同的标准下表现出独特的效果,但它们都表现出高精度和低预测误差,从而揭示了使用机器学习进行列车延误预测的潜力,这对决策和重新调度有价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

Data-driven train delay prediction incorporating dispatching commands: An XGBoost-metaheuristic framework

Train delays can significantly impact the punctuality and service quality of high-speed trains, which also play a crucial role in affecting dispatchers with their decision-making. In this study, a data-driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine-tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost-Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision-making and rescheduling.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IET Intelligent Transport Systems
IET Intelligent Transport Systems 工程技术-运输科技
CiteScore
6.50
自引率
7.40%
发文量
159
审稿时长
3 months
期刊介绍: IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following: Sustainable traffic solutions Deployments with enabling technologies Pervasive monitoring Applications; demonstrations and evaluation Economic and behavioural analyses of ITS services and scenario Data Integration and analytics Information collection and processing; image processing applications in ITS ITS aspects of electric vehicles Autonomous vehicles; connected vehicle systems; In-vehicle ITS, safety and vulnerable road user aspects Mobility as a service systems Traffic management and control Public transport systems technologies Fleet and public transport logistics Emergency and incident management Demand management and electronic payment systems Traffic related air pollution management Policy and institutional issues Interoperability, standards and architectures Funding scenarios Enforcement Human machine interaction Education, training and outreach Current Special Issue Call for papers: Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信