High-risk event prone driver identification considering driving behavior temporal covariate shift

IF 6.2 1区工程技术 Q1 ERGONOMICS

Accident; analysis and prevention Pub Date : 2024-03-02 DOI:10.1016/j.aap.2024.107526

Ruici Zhang , Xiang Wen , Huanqiang Cao , Pengfei Cui , Hua Chai , Runbo Hu , Rongjie Yu

{"title":"High-risk event prone driver identification considering driving behavior temporal covariate shift","authors":"Ruici Zhang , Xiang Wen , Huanqiang Cao , Pengfei Cui , Hua Chai , Runbo Hu , Rongjie Yu","doi":"10.1016/j.aap.2024.107526","DOIUrl":null,"url":null,"abstract":"<div><p>Drivers who perform frequent high-risk events (e.g., hard braking maneuvers) pose a significant threat to traffic safety. Existing studies commonly estimated high-risk event occurrence probabilities based upon the assumption that data collected from different time periods are independent and identically distributed (referred to as i.i.d. assumption). Such approach ignored the issue of driving behavior temporal covariate shift, where the distributions of driving behavior factors vary over time. To fill the gap, this study targets at obtaining time-invariant driving behavior features and establishing their relationships with high-risk event occurrence probability. Specifically, a generalized modeling framework consisting of distribution characterization (DC) and distribution matching (DM) modules was proposed. The DC module split the whole dataset into several segments with the largest distribution gaps, while the DM module identified time-invariant driving behavior features through learning common knowledge among different segments. Then, gated recurrent unit (GRU) was employed to conduct time-invariant driving behavior feature mining for high-risk event occurrence probability estimation. Moreover, modified loss functions were introduced for imbalanced data learning caused by the rarity of high-risk events. The empirical analyses were conducted utilizing online ride-hailing services data. Experiment results showed that the proposed generalized modeling framework provided a 7.2% higher average precision compared to the traditional i.i.d. assumption based approach. The modified loss functions further improved the model performance by 3.8%. Finally, benefits for the driver management program improvement have been explored by a case study, demonstrating a 33.34% enhancement in the identification precision of high-risk event prone drivers.</p></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"199 ","pages":"Article 107526"},"PeriodicalIF":6.2000,"publicationDate":"2024-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S000145752400071X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

Drivers who perform frequent high-risk events (e.g., hard braking maneuvers) pose a significant threat to traffic safety. Existing studies commonly estimated high-risk event occurrence probabilities based upon the assumption that data collected from different time periods are independent and identically distributed (referred to as i.i.d. assumption). Such approach ignored the issue of driving behavior temporal covariate shift, where the distributions of driving behavior factors vary over time. To fill the gap, this study targets at obtaining time-invariant driving behavior features and establishing their relationships with high-risk event occurrence probability. Specifically, a generalized modeling framework consisting of distribution characterization (DC) and distribution matching (DM) modules was proposed. The DC module split the whole dataset into several segments with the largest distribution gaps, while the DM module identified time-invariant driving behavior features through learning common knowledge among different segments. Then, gated recurrent unit (GRU) was employed to conduct time-invariant driving behavior feature mining for high-risk event occurrence probability estimation. Moreover, modified loss functions were introduced for imbalanced data learning caused by the rarity of high-risk events. The empirical analyses were conducted utilizing online ride-hailing services data. Experiment results showed that the proposed generalized modeling framework provided a 7.2% higher average precision compared to the traditional i.i.d. assumption based approach. The modified loss functions further improved the model performance by 3.8%. Finally, benefits for the driver management program improvement have been explored by a case study, demonstrating a 33.34% enhancement in the identification precision of high-risk event prone drivers.

查看原文本刊更多论文

考虑到驾驶行为时间协变量的变化，识别易发生高风险事件的驾驶员

频繁发生高风险事件（如急刹车）的驾驶员对交通安全构成了重大威胁。现有研究通常基于不同时间段收集的数据是独立且同分布的假设（称为 i.i.d. 假设）来估算高风险事件发生概率。这种方法忽略了驾驶行为时间协变量转移的问题，即驾驶行为因素的分布随时间而变化。为填补这一空白，本研究旨在获取随时间变化的驾驶行为特征，并建立其与高风险事件发生概率的关系。具体来说，研究提出了一个由分布特征描述（DC）和分布匹配（DM）模块组成的通用建模框架。分布表征模块将整个数据集分割成分布差距最大的几个片段，而分布匹配模块则通过学习不同片段之间的共同知识来识别时变驾驶行为特征。然后，利用门控递归单元（GRU）进行时变驾驶行为特征挖掘，以估计高风险事件的发生概率。此外，针对高风险事件的罕见性所导致的不平衡数据学习，引入了修正的损失函数。利用在线叫车服务数据进行了实证分析。实验结果表明，与传统的基于 i.i.d. 假设的方法相比，所提出的广义建模框架的平均精度提高了 7.2%。修改后的损失函数进一步提高了模型性能 3.8%。最后，通过案例研究探讨了改进驾驶员管理程序的益处，结果表明高风险事件易发驾驶员的识别精度提高了 33.34%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Accident; analysis and prevention Multiple-

CiteScore

11.90

自引率

16.90%

发文量

264

审稿时长

48 days

期刊介绍： Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.