Distilled Routing Transformer for Driving Behavior Prediction

IF 0.7 Q4 TRANSPORTATION SCIENCE & TECHNOLOGY
Jun Gao, Jiangang Yi, Yi Lu Murphey
{"title":"Distilled Routing Transformer for Driving Behavior Prediction","authors":"Jun Gao, Jiangang Yi, Yi Lu Murphey","doi":"10.4271/09-12-01-0003","DOIUrl":null,"url":null,"abstract":"<div>The uncertainty of a driver’s state, the variability of the traffic environment, and the complexity of road conditions have made driving behavior a critical factor affecting traffic safety. Accurate predicting of driving behavior is therefore crucial for ensuring safe driving. In this research, an efficient framework, distilled routing transformer (DRTR), is proposed for driving behavior prediction using multiple modality data, i.e., front view video frames and vehicle signals. First, a cross-modal attention distiller is introduced, which distills the cross-modal attention knowledge of a fusion-encoder transformer to guide the training of our DRTR and learn deep interactions between different modalities. Second, since the multi-modal learning usually requires information from the macro view to the micro view, a self-attention (SA)-routing module is custom-designed for SA layers in DRTR for dynamic scheduling of global and local attentions for each input instance. Finally, a Mogrifier long short-term memory (Mogrifier LSTM) network is employed for DRTR to predict driving behaviors. We applied our approach to real-world data collected during drives in both urban and freeway environments by an instrumented vehicle. The experimental results demonstrate that the DRTR can predict the imminent driving behavior effectively while enjoying a faster inference speed than other state-of-the-art (SOTA) baselines.</div>","PeriodicalId":42847,"journal":{"name":"SAE International Journal of Transportation Safety","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SAE International Journal of Transportation Safety","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4271/09-12-01-0003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The uncertainty of a driver’s state, the variability of the traffic environment, and the complexity of road conditions have made driving behavior a critical factor affecting traffic safety. Accurate predicting of driving behavior is therefore crucial for ensuring safe driving. In this research, an efficient framework, distilled routing transformer (DRTR), is proposed for driving behavior prediction using multiple modality data, i.e., front view video frames and vehicle signals. First, a cross-modal attention distiller is introduced, which distills the cross-modal attention knowledge of a fusion-encoder transformer to guide the training of our DRTR and learn deep interactions between different modalities. Second, since the multi-modal learning usually requires information from the macro view to the micro view, a self-attention (SA)-routing module is custom-designed for SA layers in DRTR for dynamic scheduling of global and local attentions for each input instance. Finally, a Mogrifier long short-term memory (Mogrifier LSTM) network is employed for DRTR to predict driving behaviors. We applied our approach to real-world data collected during drives in both urban and freeway environments by an instrumented vehicle. The experimental results demonstrate that the DRTR can predict the imminent driving behavior effectively while enjoying a faster inference speed than other state-of-the-art (SOTA) baselines.
用于驱动行为预测的蒸馏路由变压器
驾驶员状态的不确定性、交通环境的多变性以及道路条件的复杂性,使得驾驶行为成为影响交通安全的关键因素。因此,准确预测驾驶行为对于确保安全驾驶至关重要。在本研究中,提出了一种高效的框架,即提取路由变压器(DRTR),用于使用多模态数据(即前视视频帧和车辆信号)进行驾驶行为预测。首先,介绍了一个跨模态注意提取器,它提取融合编码器转换器的跨模态注意知识,以指导我们的DRTR训练并学习不同模态之间的深度交互。其次,由于多模态学习通常需要从宏观视图到微观视图的信息,因此在DRTR中为SA层定制了自关注路由模块,用于动态调度每个输入实例的全局和局部关注。最后,将Mogrifier长短期记忆(Mogrifier LSTM)网络用于DRTR预测驾驶行为。我们将我们的方法应用于一辆仪表化车辆在城市和高速公路环境中行驶时收集的真实数据。实验结果表明,DRTR可以有效地预测即将发生的驾驶行为,并且具有比其他最先进(SOTA)基线更快的推理速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
SAE International Journal of Transportation Safety
SAE International Journal of Transportation Safety TRANSPORTATION SCIENCE & TECHNOLOGY-
CiteScore
1.10
自引率
0.00%
发文量
21
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信