采用梯度增强决策树(GBDT)算法建立了考虑延迟传播特征的列车延误预测模型

IF 2.8 3区 工程技术 Q2 ENGINEERING, MANUFACTURING
Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li
{"title":"采用梯度增强决策树(GBDT)算法建立了考虑延迟传播特征的列车延误预测模型","authors":"Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li","doi":"10.14743/apem2021.3.400","DOIUrl":null,"url":null,"abstract":"Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.","PeriodicalId":48763,"journal":{"name":"Advances in Production Engineering & Management","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature\",\"authors\":\"Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li\",\"doi\":\"10.14743/apem2021.3.400\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.\",\"PeriodicalId\":48763,\"journal\":{\"name\":\"Advances in Production Engineering & Management\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Production Engineering & Management\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.14743/apem2021.3.400\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MANUFACTURING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Production Engineering & Management","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.14743/apem2021.3.400","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}
引用次数: 3

摘要

列车延误的准确预测是列车运行计划智能调整的重要依据。提出了一种考虑延迟传播特征的列车延误预测模型。该模型由两部分组成。第一部分是延迟传播特征的提取。基于基于密度的空间聚类应用噪声算法(DBSCAN),通过历史数据延迟类型聚类方法确定最佳延迟分类方案,并将最佳延迟分类方案与k近邻(KNN)算法相结合,设计在线数据延迟类型分类方法。用延迟传播因子来量化延迟传播关系,在此基础上构造水平和垂直延迟传播特征。第二部分是延迟预测,以列车运行状态特征和延迟传播特征作为输入特征,采用梯度增强决策树(GBDT)算法完成预测。利用实际列车运行数据对模型进行了测试和仿真,并与随机森林(RF)、支持向量回归(SVR)和多层感知器(MLP)进行了比较。结果表明,在列车延误预测模型中考虑延误传播特性可以进一步提高列车延误预测的精度。本文提出的延误预测模型可为铁路调度智能化提供理论依据,使调度员更合理地控制延误,提高铁路运输服务质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature
Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advances in Production Engineering & Management
Advances in Production Engineering & Management ENGINEERING, MANUFACTURINGMATERIALS SCIENC-MATERIALS SCIENCE, MULTIDISCIPLINARY
CiteScore
5.90
自引率
22.20%
发文量
19
期刊介绍: Advances in Production Engineering & Management (APEM journal) is an interdisciplinary international academic journal published quarterly. The main goal of the APEM journal is to present original, high quality, theoretical and application-oriented research developments in all areas of production engineering and production management to a broad audience of academics and practitioners. In order to bridge the gap between theory and practice, applications based on advanced theory and case studies are particularly welcome. For theoretical papers, their originality and research contributions are the main factors in the evaluation process. General approaches, formalisms, algorithms or techniques should be illustrated with significant applications that demonstrate their applicability to real-world problems. Please note the APEM journal is not intended especially for studying problems in the finance, economics, business, and bank sectors even though the methodology in the paper is quality/project management oriented. Therefore, the papers should include a substantial level of engineering issues in the field of manufacturing engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信