Ranking Analysis of Highway Accident Impact Factors Based on Machine Learning Methods

Fei Ma, Xu Wang, Xiaoling Liao, Di Yu, Hongji Fang, Jing Cao
{"title":"Ranking Analysis of Highway Accident Impact Factors Based on Machine Learning Methods","authors":"Fei Ma, Xu Wang, Xiaoling Liao, Di Yu, Hongji Fang, Jing Cao","doi":"10.1109/ICTIS54573.2021.9798658","DOIUrl":null,"url":null,"abstract":"Road traffic accidents occur frequently recent years. To improve the driving safety of drivers, traffic accident analysis attracts road safety management agencies to investigate what lead to accidents and how to prevent them. In this paper, the impact factors of traffic accidents were analyzed and ranked by using field data of the year of 2012 in California from the Highway Safety Information System (HSIS) dataset. Accident severity was taken as the dependent variable; and thirteen indexes, which are related to environment, roads and drivers, were selected as impact factors respectively. Firstly, the dataset of the accident causation analysis was established after data cleaning. Secondly, support vector machine-recursive feature elimination (SVC-RFE) and random forest-recursive feature elimination algorithm (RF -RFE) were used to rank the importance of indicators. After triple cross validation, the optimal number of features of the two algorithms is obtained. The results show that the rankings obtained by the two methods are similar whether the accident severity or the degree of casualties are the dependent variables. For RF -RFE with high cross-validation scores, the similarity rate can reach 66.7%. The results of this paper can lead to traffic accident analysis methods and can guide effective prevention measures.","PeriodicalId":253824,"journal":{"name":"2021 6th International Conference on Transportation Information and Safety (ICTIS)","volume":"217 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Transportation Information and Safety (ICTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTIS54573.2021.9798658","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Road traffic accidents occur frequently recent years. To improve the driving safety of drivers, traffic accident analysis attracts road safety management agencies to investigate what lead to accidents and how to prevent them. In this paper, the impact factors of traffic accidents were analyzed and ranked by using field data of the year of 2012 in California from the Highway Safety Information System (HSIS) dataset. Accident severity was taken as the dependent variable; and thirteen indexes, which are related to environment, roads and drivers, were selected as impact factors respectively. Firstly, the dataset of the accident causation analysis was established after data cleaning. Secondly, support vector machine-recursive feature elimination (SVC-RFE) and random forest-recursive feature elimination algorithm (RF -RFE) were used to rank the importance of indicators. After triple cross validation, the optimal number of features of the two algorithms is obtained. The results show that the rankings obtained by the two methods are similar whether the accident severity or the degree of casualties are the dependent variables. For RF -RFE with high cross-validation scores, the similarity rate can reach 66.7%. The results of this paper can lead to traffic accident analysis methods and can guide effective prevention measures.
基于机器学习方法的公路事故影响因素排序分析
道路交通事故近年来频频发生。为了提高驾驶员的驾驶安全,交通事故分析吸引了道路安全管理机构研究导致事故的原因和如何预防事故。本文利用美国加州公路安全信息系统(HSIS)数据集2012年的现场数据,对交通事故的影响因素进行了分析和排序。以事故严重程度为因变量;选取与环境、道路、驾驶员相关的13个指标作为影响因子。首先,对数据进行清洗,建立事故原因分析数据集。其次,采用支持向量机递归特征消去算法(SVC-RFE)和随机森林递归特征消去算法(RF -RFE)对指标重要性进行排序。经过三次交叉验证,得到了两种算法的最优特征数。结果表明,无论以事故严重程度还是伤亡程度作为因变量,两种方法得到的排名结果相近。对于交叉验证分数较高的RF -RFE,相似率可达66.7%。本文的研究结果可以指导交通事故的分析方法和有效的预防措施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信