采用梯度增强决策树(GBDT)算法建立了考虑延迟传播特征的列车延误预测模型

IF 2.8 3区工程技术 Q2 ENGINEERING, MANUFACTURING

Advances in Production Engineering & Management Pub Date : 2021-09-30 DOI:10.14743/apem2021.3.400

Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li

{"title":"采用梯度增强决策树(GBDT)算法建立了考虑延迟传播特征的列车延误预测模型","authors":"Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li","doi":"10.14743/apem2021.3.400","DOIUrl":null,"url":null,"abstract":"Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.","PeriodicalId":48763,"journal":{"name":"Advances in Production Engineering & Management","volume":"644 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature\",\"authors\":\"Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li\",\"doi\":\"10.14743/apem2021.3.400\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.\",\"PeriodicalId\":48763,\"journal\":{\"name\":\"Advances in Production Engineering & Management\",\"volume\":\"644 1\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Production Engineering & Management\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.14743/apem2021.3.400\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MANUFACTURING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Production Engineering & Management","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.14743/apem2021.3.400","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}

引用次数: 3

摘要

列车延误的准确预测是列车运行计划智能调整的重要依据。提出了一种考虑延迟传播特征的列车延误预测模型。该模型由两部分组成。第一部分是延迟传播特征的提取。基于基于密度的空间聚类应用噪声算法(DBSCAN)，通过历史数据延迟类型聚类方法确定最佳延迟分类方案，并将最佳延迟分类方案与k近邻(KNN)算法相结合，设计在线数据延迟类型分类方法。用延迟传播因子来量化延迟传播关系，在此基础上构造水平和垂直延迟传播特征。第二部分是延迟预测，以列车运行状态特征和延迟传播特征作为输入特征，采用梯度增强决策树(GBDT)算法完成预测。利用实际列车运行数据对模型进行了测试和仿真，并与随机森林(RF)、支持向量回归(SVR)和多层感知器(MLP)进行了比较。结果表明，在列车延误预测模型中考虑延误传播特性可以进一步提高列车延误预测的精度。本文提出的延误预测模型可为铁路调度智能化提供理论依据，使调度员更合理地控制延误，提高铁路运输服务质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature

Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advances in Production Engineering & Management ENGINEERING, MANUFACTURINGMATERIALS SCIENC-MATERIALS SCIENCE, MULTIDISCIPLINARY

CiteScore

5.90

自引率

22.20%

发文量

期刊介绍： Advances in Production Engineering & Management (APEM journal) is an interdisciplinary international academic journal published quarterly. The main goal of the APEM journal is to present original, high quality, theoretical and application-oriented research developments in all areas of production engineering and production management to a broad audience of academics and practitioners. In order to bridge the gap between theory and practice, applications based on advanced theory and case studies are particularly welcome. For theoretical papers, their originality and research contributions are the main factors in the evaluation process. General approaches, formalisms, algorithms or techniques should be illustrated with significant applications that demonstrate their applicability to real-world problems. Please note the APEM journal is not intended especially for studying problems in the finance, economics, business, and bank sectors even though the methodology in the paper is quality/project management oriented. Therefore, the papers should include a substantial level of engineering issues in the field of manufacturing engineering.