基于随机森林选择变量的实时碰撞预测贝叶斯网络模型

Mingxian Wu, Donghui Shan, Zuo Wang, Xiaoduan Sun, Jianbei Liu, Ming Sun
{"title":"基于随机森林选择变量的实时碰撞预测贝叶斯网络模型","authors":"Mingxian Wu, Donghui Shan, Zuo Wang, Xiaoduan Sun, Jianbei Liu, Ming Sun","doi":"10.1109/ICTIS.2019.8883694","DOIUrl":null,"url":null,"abstract":"To reduce crash risk through managing real-time traffic flow is an effective way to improve highway safety. Populating crash data with the high-resolution (in space and time) traffic data to investigate the relationship between crash risk and traffic conditions can help the active traffic management (ATM) to predict crash risk in real-time. To improve the accuracy of the existing real-time crash prediction methods, this paper proposes a Bayesian network (BN) model with the variables selected by the random forest (RF). The study used the traffic and crash data from I-5 segment (13.8-miles-long) in Los Angeles, where the traffic data (flow, speed, occupancy and etc.) were recorded by sensors. The RF was used to rank the explanatory variables based on the Gini index, which yields the most significant variables for the crash prediction. Different from the previous studies, the gradient change of traffic data along distance is recognized as an important variable. The developed BN-RF model is evaluated by ROC curve. The results indicate that traffic conditions at two five-minute intervals prior to a crash is very sensitive to the crash risk prediction. A sudden speed change (speed magnitude in a short time and distance), characterized as one unit, makes the distribution of crash posterior probability at least 0.045 higher compared to the marginal value. The most important finding is that the proposed BN-RF model for the real-time crash prediction can accurately predict 70.46% of crashes with a 16.07% of false alarm rate, which is better than that of previous studies.","PeriodicalId":325712,"journal":{"name":"2019 5th International Conference on Transportation Information and Safety (ICTIS)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A Bayesian Network Model for Real-time Crash Prediction Based on Selected Variables by Random Forest\",\"authors\":\"Mingxian Wu, Donghui Shan, Zuo Wang, Xiaoduan Sun, Jianbei Liu, Ming Sun\",\"doi\":\"10.1109/ICTIS.2019.8883694\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To reduce crash risk through managing real-time traffic flow is an effective way to improve highway safety. Populating crash data with the high-resolution (in space and time) traffic data to investigate the relationship between crash risk and traffic conditions can help the active traffic management (ATM) to predict crash risk in real-time. To improve the accuracy of the existing real-time crash prediction methods, this paper proposes a Bayesian network (BN) model with the variables selected by the random forest (RF). The study used the traffic and crash data from I-5 segment (13.8-miles-long) in Los Angeles, where the traffic data (flow, speed, occupancy and etc.) were recorded by sensors. The RF was used to rank the explanatory variables based on the Gini index, which yields the most significant variables for the crash prediction. Different from the previous studies, the gradient change of traffic data along distance is recognized as an important variable. The developed BN-RF model is evaluated by ROC curve. The results indicate that traffic conditions at two five-minute intervals prior to a crash is very sensitive to the crash risk prediction. A sudden speed change (speed magnitude in a short time and distance), characterized as one unit, makes the distribution of crash posterior probability at least 0.045 higher compared to the marginal value. The most important finding is that the proposed BN-RF model for the real-time crash prediction can accurately predict 70.46% of crashes with a 16.07% of false alarm rate, which is better than that of previous studies.\",\"PeriodicalId\":325712,\"journal\":{\"name\":\"2019 5th International Conference on Transportation Information and Safety (ICTIS)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 5th International Conference on Transportation Information and Safety (ICTIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTIS.2019.8883694\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 5th International Conference on Transportation Information and Safety (ICTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTIS.2019.8883694","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

通过实时交通流管理来降低碰撞风险是提高公路安全的有效途径。利用高分辨率(时空)交通数据填充碰撞数据,研究碰撞风险与交通状况之间的关系,有助于主动交通管理(ATM)实时预测碰撞风险。为了提高现有实时碰撞预测方法的准确性,本文提出了一种贝叶斯网络(BN)模型,该模型的变量由随机森林(RF)选择。该研究使用了洛杉矶I-5路段(13.8英里长)的交通和碰撞数据,那里的交通数据(流量、速度、占用率等)由传感器记录。RF用于根据基尼指数对解释变量进行排名,基尼指数产生了对崩溃预测最重要的变量。与以往的研究不同,本文将交通数据沿距离的梯度变化视为一个重要变量。用ROC曲线对所建立的BN-RF模型进行评价。结果表明,碰撞前两个5分钟间隔的交通状况对碰撞风险预测非常敏感。以一个单位为特征的突然速度变化(短时间和短距离内的速度量级)使碰撞后验概率分布比边际值至少高0.045。最重要的发现是,本文提出的用于实时碰撞预测的BN-RF模型可以准确预测70.46%的碰撞,虚警率为16.07%,优于以往的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Bayesian Network Model for Real-time Crash Prediction Based on Selected Variables by Random Forest
To reduce crash risk through managing real-time traffic flow is an effective way to improve highway safety. Populating crash data with the high-resolution (in space and time) traffic data to investigate the relationship between crash risk and traffic conditions can help the active traffic management (ATM) to predict crash risk in real-time. To improve the accuracy of the existing real-time crash prediction methods, this paper proposes a Bayesian network (BN) model with the variables selected by the random forest (RF). The study used the traffic and crash data from I-5 segment (13.8-miles-long) in Los Angeles, where the traffic data (flow, speed, occupancy and etc.) were recorded by sensors. The RF was used to rank the explanatory variables based on the Gini index, which yields the most significant variables for the crash prediction. Different from the previous studies, the gradient change of traffic data along distance is recognized as an important variable. The developed BN-RF model is evaluated by ROC curve. The results indicate that traffic conditions at two five-minute intervals prior to a crash is very sensitive to the crash risk prediction. A sudden speed change (speed magnitude in a short time and distance), characterized as one unit, makes the distribution of crash posterior probability at least 0.045 higher compared to the marginal value. The most important finding is that the proposed BN-RF model for the real-time crash prediction can accurately predict 70.46% of crashes with a 16.07% of false alarm rate, which is better than that of previous studies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信