Xiaohui Yang , Yu Qiao , Tongzhen Si , Jing Wang , Tao Xu
{"title":"Eye-SCAN: Eye-Movement-Attention-based Spatial Channel Adaptive Network for traffic accident prediction","authors":"Xiaohui Yang , Yu Qiao , Tongzhen Si , Jing Wang , Tao Xu","doi":"10.1016/j.patcog.2025.111590","DOIUrl":null,"url":null,"abstract":"<div><div>In the task of using visual cues extracted from DashCam video data to predict future accidents, understanding the dynamic spatio-temporal interactions in driving scenarios poses a major challenge. Given that the gaze attention information of experienced drivers during the driving process involves complex spatio-temporal interactions, this information can provide valuable guidance for training accident prediction models. Therefore, we propose an Eye-Movement-Attention-based Spatial Channel Adaptive Network (Eye-SCAN) for traffic accident prediction, which can efficiently learn multi-scale spatial channel information from driver gaze data. To integrate potential guidance information from driver eye movement information (EyeInfo) into Eye-SCAN, we propose two sub-modules in our model: the Spatial Adaptive Module (SAM), which helps Eye-SCAN adaptively learn low-dimensional spatial features of EyeInfo; and the Channel Adaptive Module (CAM), which aids Eye-SCAN to adaptively learning high-dimensional channel features of EyeInfo. Additionally, we introduce a novel recursive transmission strategy for temporal information to mitigate the impact of varying past results on the model’s current inferences. Experimental results demonstrate that our model outperforms state-of-the-art methods on two benchmark datasets, highlighting the contributions of each component and offering an effective solution for enhancing the safety of intelligent vehicles.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"165 ","pages":"Article 111590"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032500250X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In the task of using visual cues extracted from DashCam video data to predict future accidents, understanding the dynamic spatio-temporal interactions in driving scenarios poses a major challenge. Given that the gaze attention information of experienced drivers during the driving process involves complex spatio-temporal interactions, this information can provide valuable guidance for training accident prediction models. Therefore, we propose an Eye-Movement-Attention-based Spatial Channel Adaptive Network (Eye-SCAN) for traffic accident prediction, which can efficiently learn multi-scale spatial channel information from driver gaze data. To integrate potential guidance information from driver eye movement information (EyeInfo) into Eye-SCAN, we propose two sub-modules in our model: the Spatial Adaptive Module (SAM), which helps Eye-SCAN adaptively learn low-dimensional spatial features of EyeInfo; and the Channel Adaptive Module (CAM), which aids Eye-SCAN to adaptively learning high-dimensional channel features of EyeInfo. Additionally, we introduce a novel recursive transmission strategy for temporal information to mitigate the impact of varying past results on the model’s current inferences. Experimental results demonstrate that our model outperforms state-of-the-art methods on two benchmark datasets, highlighting the contributions of each component and offering an effective solution for enhancing the safety of intelligent vehicles.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.