Identification of rear-end crash patterns on instrumented freeways: a data mining approach

A. Pande, M. Abdel-Aty
{"title":"Identification of rear-end crash patterns on instrumented freeways: a data mining approach","authors":"A. Pande, M. Abdel-Aty","doi":"10.1109/ITSC.2005.1520155","DOIUrl":null,"url":null,"abstract":"Data mining is the analysis of large \"observational\" datasets to find unsuspected relationships that might be useful to the data owner. It typically involves analysis where objectives of the mining exercise have no bearing on the data collection strategy. Freeway traffic surveillance data collected through underground loop detectors is one such \"observational\" database maintained for various ITS (intelligent transportation systems) applications such as travel time prediction etc. In this research data mining process is used to relate this surrogate measure of traffic conditions (data from freeway loop detectors) with occurrence of rear-end crashes on freeways. The results from this analysis are envisioned to be the first step in the development of a functional proactive traffic management system. The dataset under consideration includes information on crashes and corresponding traffic data collected from detectors neighboring the crash locations just prior to the time of the crash. The problem is setup as a classification problem for a crash being rear-end vs. not. Three types of classification tree involving different splitting criterion were attempted for variable selection. It was found that the classification tree with chi sq. test as the splitting criterion resulted in the most inclusive list of variables. The variable selection was followed by two neural network architectures, namely, the RBF (radial basis function) and MLP (multi-layer perceptron) to model the binary target variable. The two neural network models were then combined based on their output to achieve any possible improvement in the classification accuracy. It was found, however, that the classification tree model with chi sq. test as splitting criterion (with more than 65% classification accuracy) was better than any of the individual or combined neural network models (54-55% classification accuracy). Since the decision tree model also provides simple interpretable rules to classify the data in a real-time application it was recommended as the final classification model.","PeriodicalId":153203,"journal":{"name":"Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC.2005.1520155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Data mining is the analysis of large "observational" datasets to find unsuspected relationships that might be useful to the data owner. It typically involves analysis where objectives of the mining exercise have no bearing on the data collection strategy. Freeway traffic surveillance data collected through underground loop detectors is one such "observational" database maintained for various ITS (intelligent transportation systems) applications such as travel time prediction etc. In this research data mining process is used to relate this surrogate measure of traffic conditions (data from freeway loop detectors) with occurrence of rear-end crashes on freeways. The results from this analysis are envisioned to be the first step in the development of a functional proactive traffic management system. The dataset under consideration includes information on crashes and corresponding traffic data collected from detectors neighboring the crash locations just prior to the time of the crash. The problem is setup as a classification problem for a crash being rear-end vs. not. Three types of classification tree involving different splitting criterion were attempted for variable selection. It was found that the classification tree with chi sq. test as the splitting criterion resulted in the most inclusive list of variables. The variable selection was followed by two neural network architectures, namely, the RBF (radial basis function) and MLP (multi-layer perceptron) to model the binary target variable. The two neural network models were then combined based on their output to achieve any possible improvement in the classification accuracy. It was found, however, that the classification tree model with chi sq. test as splitting criterion (with more than 65% classification accuracy) was better than any of the individual or combined neural network models (54-55% classification accuracy). Since the decision tree model also provides simple interpretable rules to classify the data in a real-time application it was recommended as the final classification model.
在仪表高速公路上识别追尾碰撞模式:一种数据挖掘方法
数据挖掘是对大型“观察”数据集进行分析,以发现可能对数据所有者有用的未预料到的关系。它通常涉及挖掘工作的目标与数据收集战略无关的分析。通过地下环路探测器收集的高速公路交通监控数据是一种“观测”数据库,用于各种智能交通系统(ITS)应用,如旅行时间预测等。在本研究中,使用数据挖掘过程将交通状况的替代度量(来自高速公路环路探测器的数据)与高速公路追尾事故的发生联系起来。从这个分析的结果被设想为第一步,在一个功能主动交通管理系统的发展。正在考虑的数据集包括坠机信息和相应的交通数据,这些数据是在坠机发生之前从坠机地点附近的探测器收集的。这个问题被设置为一个分类问题,即碰撞是追尾还是非追尾。尝试了三种不同分割准则的分类树进行变量选择。结果表明,该分类树具有chi平方。测试作为分割标准产生了最具包容性的变量列表。在变量选择之后,采用径向基函数(RBF)和多层感知器(MLP)两种神经网络架构对二值目标变量进行建模。然后根据它们的输出将两个神经网络模型组合在一起,以实现分类精度的任何可能的提高。然而,我们发现,用卡平方的分类树模型。Test作为分割标准(分类准确率大于65%)优于任何单个或组合神经网络模型(分类准确率为54-55%)。由于决策树模型还提供了简单的可解释规则来对实时应用程序中的数据进行分类,因此建议将其作为最终分类模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信