Discovering Crash Severity Factors of Grade Crossing With a Machine Learning Approach

Dahye Lee, J. Warner, C. Morgan
{"title":"Discovering Crash Severity Factors of Grade Crossing With a Machine Learning Approach","authors":"Dahye Lee, J. Warner, C. Morgan","doi":"10.1115/JRC2019-1231","DOIUrl":null,"url":null,"abstract":"According to the Federal Railroad Administration (FRA) Highway-Rail Grade Crossing Accident/Incident database, more than 12,000 accidents occurred between 2012 and 2017 in the United States with casualties of around 3900. Despite repeated efforts to fully understand the risk factors that contribute to highway-rail grade crossing collisions, there still remain many uncertainties. A machine learning approach is proposed in this paper to find out significant factors, along with their individual impacts of crash severities at grade crossings. One of the most efficient and accurate machine learning algorithms, extreme gradient boosting (XGB or XGBoost), is applied to analyze 21 different accident and crossing -related characteristics per driver severities. The XGB model has been proven in previous studies across many research areas in transportation to outperform other machine learning-based methods and statistical classification methods, such as multinomial logit model, multiple additive regression trees, decision tree, and random forest, especially in prediction accuracy. Thereby, applying the algorithm is expected to provide highly reliable results to identify important factors that have impacts on injury severities at grade crossings. Such application will further aid the discovery of potential crossings with significant factors. The FRA’s Highway-Rail Grade Crossing Accident/Incident database from 2012 to 2017 is fused with the FRA Highway-Rail Crossing Inventory database for the analysis. Observations with missing information were removed from the original database. Crossing position under or over the railroad and pedestrian or other types of highway users were also not considered since they were not specifically of interest in this study. After the database cleaning process, it condensed to the total of 1,250 accidents out of the retrieved 12,630 from the combined database. The results show that adjacent highway traffic volume and train speed are the most significant factors causing accidents and injury severity. They are followed by the driver’s age and the estimated vehicle speed. It also indicated that truck-involved accidents and crossings with gates, flashing lights, and other types of warning devices combined, and highway user’s gender as a male also pertain to the higher injury rate. Through this study, it is possible to provide guidance to decision-makers in recognizing possible risks at-grade crossings that may cause driver casualties.","PeriodicalId":287025,"journal":{"name":"2019 Joint Rail Conference","volume":"4 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Joint Rail Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/JRC2019-1231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

According to the Federal Railroad Administration (FRA) Highway-Rail Grade Crossing Accident/Incident database, more than 12,000 accidents occurred between 2012 and 2017 in the United States with casualties of around 3900. Despite repeated efforts to fully understand the risk factors that contribute to highway-rail grade crossing collisions, there still remain many uncertainties. A machine learning approach is proposed in this paper to find out significant factors, along with their individual impacts of crash severities at grade crossings. One of the most efficient and accurate machine learning algorithms, extreme gradient boosting (XGB or XGBoost), is applied to analyze 21 different accident and crossing -related characteristics per driver severities. The XGB model has been proven in previous studies across many research areas in transportation to outperform other machine learning-based methods and statistical classification methods, such as multinomial logit model, multiple additive regression trees, decision tree, and random forest, especially in prediction accuracy. Thereby, applying the algorithm is expected to provide highly reliable results to identify important factors that have impacts on injury severities at grade crossings. Such application will further aid the discovery of potential crossings with significant factors. The FRA’s Highway-Rail Grade Crossing Accident/Incident database from 2012 to 2017 is fused with the FRA Highway-Rail Crossing Inventory database for the analysis. Observations with missing information were removed from the original database. Crossing position under or over the railroad and pedestrian or other types of highway users were also not considered since they were not specifically of interest in this study. After the database cleaning process, it condensed to the total of 1,250 accidents out of the retrieved 12,630 from the combined database. The results show that adjacent highway traffic volume and train speed are the most significant factors causing accidents and injury severity. They are followed by the driver’s age and the estimated vehicle speed. It also indicated that truck-involved accidents and crossings with gates, flashing lights, and other types of warning devices combined, and highway user’s gender as a male also pertain to the higher injury rate. Through this study, it is possible to provide guidance to decision-makers in recognizing possible risks at-grade crossings that may cause driver casualties.
用机器学习方法发现平交道口碰撞严重程度因素
根据联邦铁路管理局(FRA)公路-铁路平交道口事故/事件数据库,2012年至2017年期间,美国发生了12,000多起事故,造成约3900人伤亡。尽管多次努力充分了解导致公路铁路平交道口碰撞的风险因素,但仍然存在许多不确定因素。本文提出了一种机器学习方法来找出显著因素,以及它们对平交道口碰撞严重程度的个别影响。最有效和最准确的机器学习算法之一,极端梯度增强(XGB或XGBoost),被用于分析每个驾驶员严重程度的21种不同的事故和交叉相关特征。在交通运输的许多研究领域中,XGB模型已经被证明优于其他基于机器学习的方法和统计分类方法,如多项logit模型、多元加性回归树、决策树和随机森林,特别是在预测精度方面。因此,应用该算法有望提供高可靠性的结果,以识别影响平交道口伤害严重程度的重要因素。这种应用将进一步有助于发现具有重要因素的潜在交叉。联邦铁路局2012年至2017年的公路-铁路平交道口事故/事件数据库与联邦铁路局公路-铁路平交道口库存数据库融合进行分析。从原始数据库中删除了信息缺失的观测值。铁路、行人或其他类型的高速公路使用者下方或上方的交叉位置也没有被考虑在内,因为他们不是本研究特别感兴趣的对象。在数据库清理过程之后,从合并数据库中检索到的12,630个事故中浓缩为总共1,250个事故。结果表明,相邻公路交通量和列车速度是造成事故和伤害严重程度的最显著因素。紧随其后的是司机的年龄和估计的车速。该研究还表明,涉及卡车的事故和有大门、闪光灯和其他类型警告装置的交叉路口,以及高速公路使用者的性别为男性也与较高的伤害率有关。通过本研究,可以为决策者识别平交道口可能造成驾驶员伤亡的风险提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信