Ling Deng, Chengcheng Xu, Pan Liu, Yuxuan Wang, Kequan Chen
{"title":"Interpretable multi-variable transformer network for regional-level short-term bicycle crash risk prediction","authors":"Ling Deng, Chengcheng Xu, Pan Liu, Yuxuan Wang, Kequan Chen","doi":"10.1016/j.aap.2025.108169","DOIUrl":null,"url":null,"abstract":"<div><div>Effective short-term prediction of bicycle crashes at the urban regional level is critical for proactive infrastructure safety interventions and data-driven traffic management. However, three key challenges persist: (1) inadequate modeling of complex spatiotemporal dependencies in multi-source heterogeneous data; (2) poor handling of extreme class imbalance and lack of interpretability in deep learning-based short-term predictions; and (3) limited exploration of bicycle infrastructure’s role in regional crash risk assessment. In response to these challenges, we propose an Interpretable Multi-variable Transformer Network (IMTN) that employs four specialized Transformer encoder blocks to extract spatial and temporal dependencies from heterogeneous inputs. To mitigate the severe class imbalance, our approach uses a single, shared model to predict crash risk for one region at a time, rather than all regions simultaneously. This reformulation avoids data sparsity while retaining multi-region inputs, and a spatial weighting mechanism is used to preserve inter-regional dependencies. Meanwhile, an improved Layer-wise Relevance Propagation (LRP) framework is employed to enhance the interpretability of IMTN. We conduct our experiments on a four-year dataset from London, which includes crash records, public bicycle trips, time, weather, road networks, land use, and a rich set of 48 bicycle infrastructure features. The model comparison demonstrates that IMTN consistently outperforms competitive baselines, reducing false positive rate (FPR) by up to 9.08%, improving the area under the curve (AUC) by up to 3.49%, and increasing the G-mean by up to 5.39%. Our model achieves the best performance at the finest temporal resolution (1-hour aggregation), contrary to common expectations. This suggests that the proposed class imbalance handling method may enhance model performance in high-resolution settings. In addition, interpretability analysis identifies segregated cycle lanes, Sheffield stands, and colored path markings as high-impact infrastructure variables, providing data-driven insights that can help inform urban safety planning.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"220 ","pages":"Article 108169"},"PeriodicalIF":6.2000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0001457525002556","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Effective short-term prediction of bicycle crashes at the urban regional level is critical for proactive infrastructure safety interventions and data-driven traffic management. However, three key challenges persist: (1) inadequate modeling of complex spatiotemporal dependencies in multi-source heterogeneous data; (2) poor handling of extreme class imbalance and lack of interpretability in deep learning-based short-term predictions; and (3) limited exploration of bicycle infrastructure’s role in regional crash risk assessment. In response to these challenges, we propose an Interpretable Multi-variable Transformer Network (IMTN) that employs four specialized Transformer encoder blocks to extract spatial and temporal dependencies from heterogeneous inputs. To mitigate the severe class imbalance, our approach uses a single, shared model to predict crash risk for one region at a time, rather than all regions simultaneously. This reformulation avoids data sparsity while retaining multi-region inputs, and a spatial weighting mechanism is used to preserve inter-regional dependencies. Meanwhile, an improved Layer-wise Relevance Propagation (LRP) framework is employed to enhance the interpretability of IMTN. We conduct our experiments on a four-year dataset from London, which includes crash records, public bicycle trips, time, weather, road networks, land use, and a rich set of 48 bicycle infrastructure features. The model comparison demonstrates that IMTN consistently outperforms competitive baselines, reducing false positive rate (FPR) by up to 9.08%, improving the area under the curve (AUC) by up to 3.49%, and increasing the G-mean by up to 5.39%. Our model achieves the best performance at the finest temporal resolution (1-hour aggregation), contrary to common expectations. This suggests that the proposed class imbalance handling method may enhance model performance in high-resolution settings. In addition, interpretability analysis identifies segregated cycle lanes, Sheffield stands, and colored path markings as high-impact infrastructure variables, providing data-driven insights that can help inform urban safety planning.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.