货运列车通过停车信号的机器学习分析:XGBoost和SHAP的应用

IF 2.7 Q3 TRANSPORTATION
Mona Ahmadi Rad , Lianne M. Lefsrud , Michael T. Hendry , Asdrubal Cheng Cen , Sara Soltaninejad
{"title":"货运列车通过停车信号的机器学习分析:XGBoost和SHAP的应用","authors":"Mona Ahmadi Rad ,&nbsp;Lianne M. Lefsrud ,&nbsp;Michael T. Hendry ,&nbsp;Asdrubal Cheng Cen ,&nbsp;Sara Soltaninejad","doi":"10.1016/j.jrtpm.2025.100532","DOIUrl":null,"url":null,"abstract":"<div><div>Passing a Stop Signal (PASS) is a critical safety concern in railway operations, with the potential to cause serious accidents. This study investigates non-human contributing factors to PASS events in Canadian mainline freight operations using machine learning. We analyze incident narratives from the Rail Occurrence Database System (RODS) through text mining and enrich them with geospatial and weather data. We develop a binary classification model using XGBoost and interpret feature importance and interactions with SHAP (SHapley Additive exPlanations). To address class imbalance and improve model performance, we apply a custom sampling method, combined with hyperparameter tuning and data standardization. Key contributors to PASS events include sharp track curvature near signals, downhill grades, low atmospheric pressure, high relative humidity, non-clear weather, and heavy traffic—placing Rocky Mountain subdivisions among the highest-risk areas. The model also reveals that combinations of environmental conditions, such as low temperature, low pressure, and high humidity, increase the likelihood of PASS events by reducing visibility and braking effectiveness. This study offers methodological and empirical contributions by modelling complex operational contexts, incorporating underexplored environmental factors, and producing region-specific insights. The proposed framework informs proactive safety strategies and supports risk analysis in other linear infrastructure systems.</div></div>","PeriodicalId":51821,"journal":{"name":"Journal of Rail Transport Planning & Management","volume":"35 ","pages":"Article 100532"},"PeriodicalIF":2.7000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of freight train passing a stop signal using machine learning: Application of XGBoost and SHAP\",\"authors\":\"Mona Ahmadi Rad ,&nbsp;Lianne M. Lefsrud ,&nbsp;Michael T. Hendry ,&nbsp;Asdrubal Cheng Cen ,&nbsp;Sara Soltaninejad\",\"doi\":\"10.1016/j.jrtpm.2025.100532\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Passing a Stop Signal (PASS) is a critical safety concern in railway operations, with the potential to cause serious accidents. This study investigates non-human contributing factors to PASS events in Canadian mainline freight operations using machine learning. We analyze incident narratives from the Rail Occurrence Database System (RODS) through text mining and enrich them with geospatial and weather data. We develop a binary classification model using XGBoost and interpret feature importance and interactions with SHAP (SHapley Additive exPlanations). To address class imbalance and improve model performance, we apply a custom sampling method, combined with hyperparameter tuning and data standardization. Key contributors to PASS events include sharp track curvature near signals, downhill grades, low atmospheric pressure, high relative humidity, non-clear weather, and heavy traffic—placing Rocky Mountain subdivisions among the highest-risk areas. The model also reveals that combinations of environmental conditions, such as low temperature, low pressure, and high humidity, increase the likelihood of PASS events by reducing visibility and braking effectiveness. This study offers methodological and empirical contributions by modelling complex operational contexts, incorporating underexplored environmental factors, and producing region-specific insights. The proposed framework informs proactive safety strategies and supports risk analysis in other linear infrastructure systems.</div></div>\",\"PeriodicalId\":51821,\"journal\":{\"name\":\"Journal of Rail Transport Planning & Management\",\"volume\":\"35 \",\"pages\":\"Article 100532\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Rail Transport Planning & Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2210970625000290\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Rail Transport Planning & Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210970625000290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0

摘要

通过停车信号(PASS)是铁路运营中一个关键的安全问题,有可能导致严重事故。本研究使用机器学习调查了加拿大干线货运业务中PASS事件的非人为影响因素。我们通过文本挖掘分析铁路事故数据库系统(RODS)中的事件叙述,并用地理空间和天气数据丰富它们。我们使用XGBoost开发了一个二元分类模型,并解释了特征重要性和与SHapley Additive explanation (SHapley Additive exPlanations)的相互作用。为了解决类失衡和提高模型性能,我们采用自定义采样方法,结合超参数调优和数据标准化。PASS事件的主要因素包括信号附近的急剧轨道曲率,下坡坡度,低气压,高相对湿度,不晴朗的天气和繁忙的交通,使落基山细分地区成为风险最高的地区。该模型还显示,低温、低压和高湿等环境条件的组合会降低能见度和制动效果,从而增加发生超车事件的可能性。本研究通过模拟复杂的操作背景、纳入未充分开发的环境因素和产生特定区域的见解,提供了方法和经验贡献。拟议的框架为主动安全策略提供信息,并支持其他线性基础设施系统的风险分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of freight train passing a stop signal using machine learning: Application of XGBoost and SHAP
Passing a Stop Signal (PASS) is a critical safety concern in railway operations, with the potential to cause serious accidents. This study investigates non-human contributing factors to PASS events in Canadian mainline freight operations using machine learning. We analyze incident narratives from the Rail Occurrence Database System (RODS) through text mining and enrich them with geospatial and weather data. We develop a binary classification model using XGBoost and interpret feature importance and interactions with SHAP (SHapley Additive exPlanations). To address class imbalance and improve model performance, we apply a custom sampling method, combined with hyperparameter tuning and data standardization. Key contributors to PASS events include sharp track curvature near signals, downhill grades, low atmospheric pressure, high relative humidity, non-clear weather, and heavy traffic—placing Rocky Mountain subdivisions among the highest-risk areas. The model also reveals that combinations of environmental conditions, such as low temperature, low pressure, and high humidity, increase the likelihood of PASS events by reducing visibility and braking effectiveness. This study offers methodological and empirical contributions by modelling complex operational contexts, incorporating underexplored environmental factors, and producing region-specific insights. The proposed framework informs proactive safety strategies and supports risk analysis in other linear infrastructure systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
8.10%
发文量
41
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信