Incorporating multi-path risk assessment in transformer-based pedestrian crossing action prediction

IF 5.7 1区 工程技术 Q1 ERGONOMICS
Bowen Liu , Meng Li , Ruyi Feng , Wei Zhou , Zhibin Li
{"title":"Incorporating multi-path risk assessment in transformer-based pedestrian crossing action prediction","authors":"Bowen Liu ,&nbsp;Meng Li ,&nbsp;Ruyi Feng ,&nbsp;Wei Zhou ,&nbsp;Zhibin Li","doi":"10.1016/j.aap.2025.108002","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes a Transformer-based framework for predicting pedestrian crossing actions that uses visualized pedestrian-vehicle collision risks, which are assessed from multiple potential paths. Our framework contains two sequential steps: (1) multi-path risks of a pedestrian-vehicle interaction (PVIs) at each time point are estimated and encoded into an RGB image, which captures a high-density array of safety information. (2) a multi-modal fusion architecture that incorporates both risk images and historical sequential data (e.g., pedestrian action and vehicle velocity) is developed based on the Cross-Attention Transformer. The model outputs are also risk-informed, categorized as yielding, risky crossing, and safe crossing. Experiments are conducted on real-world data from the Euro-PVI dataset. Through two-dimensional mapping tests, risk images are validated to have significant spatiotemporal feature differences and transition associations under different PVIs. The Transformer architecture proves to be an effective method for processing multi-path risk images. Prediction accuracy reaches 87.34% for short-term forecasts (0.5 s ahead), maintains stability as the prediction time horizon progressively extends to 2 s, and improves the prediction of abrupt action switches. For further exploration and validation, the risk image data and imaging code are available at <span><span>www.github.com/Sivan0227/PVI-Risk-Image</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"215 ","pages":"Article 108002"},"PeriodicalIF":5.7000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525000880","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

This paper proposes a Transformer-based framework for predicting pedestrian crossing actions that uses visualized pedestrian-vehicle collision risks, which are assessed from multiple potential paths. Our framework contains two sequential steps: (1) multi-path risks of a pedestrian-vehicle interaction (PVIs) at each time point are estimated and encoded into an RGB image, which captures a high-density array of safety information. (2) a multi-modal fusion architecture that incorporates both risk images and historical sequential data (e.g., pedestrian action and vehicle velocity) is developed based on the Cross-Attention Transformer. The model outputs are also risk-informed, categorized as yielding, risky crossing, and safe crossing. Experiments are conducted on real-world data from the Euro-PVI dataset. Through two-dimensional mapping tests, risk images are validated to have significant spatiotemporal feature differences and transition associations under different PVIs. The Transformer architecture proves to be an effective method for processing multi-path risk images. Prediction accuracy reaches 87.34% for short-term forecasts (0.5 s ahead), maintains stability as the prediction time horizon progressively extends to 2 s, and improves the prediction of abrupt action switches. For further exploration and validation, the risk image data and imaging code are available at www.github.com/Sivan0227/PVI-Risk-Image.
基于多路径风险评估的变压器行人过马路行为预测
本文提出了一种基于变压器的行人过马路预测框架,该框架使用可视化的行人与车辆碰撞风险,从多个潜在路径评估行人与车辆的碰撞风险。我们的框架包含两个连续步骤:(1)估计每个时间点行人与车辆交互(PVIs)的多路径风险,并将其编码为RGB图像,该图像捕获高密度的安全信息阵列。(2)基于交叉注意转换器,开发了一种融合风险图像和历史序列数据(如行人行为和车辆速度)的多模态融合架构。模型输出也有风险信息,分为收益、风险交叉和安全交叉。在Euro-PVI数据集的真实数据上进行了实验。通过二维映射测试,验证了风险图像在不同PVIs下具有显著的时空特征差异和过渡关联。Transformer架构被证明是处理多路径风险图像的有效方法。短期预测(提前0.5 s)的预测精度达到87.34%,随着预测时间范围逐步延长至2 s,预测精度保持稳定,并提高了对突然动作切换的预测。为了进一步探索和验证,风险图像数据和成像代码可在www.github.com/Sivan0227/PVI-Risk-Image上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.90
自引率
16.90%
发文量
264
审稿时长
48 days
期刊介绍: Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信