驾驶任务驱动下驾驶员视觉注意的模型研究

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-06-20 DOI:10.1016/j.engappai.2025.111382

Chuan Xu , Bo Jiang , Yukun Wang , Yan Su

{"title":"驾驶任务驱动下驾驶员视觉注意的模型研究","authors":"Chuan Xu , Bo Jiang , Yukun Wang , Yan Su","doi":"10.1016/j.engappai.2025.111382","DOIUrl":null,"url":null,"abstract":"<div><div>Visual attention is an indispensable component of driving, enabling drivers to swiftly identify critical objects within complex and dynamic traffic environments. Despite its significance, existing visual attention models predominantly focus on static or idealized driving scenarios, limiting their ability to capture attention distribution patterns in real-world, dynamic environments. Furthermore, most of these models rely heavily on data-driven approaches, extracting features exclusively from visual image data, while neglecting the profound influence of “the driver, the vehicle, and the road environment”. Consequently, these models frequently fail to effectively address the intricacies of practical driving scenarios. To bridge these gaps, this study introduces a driver visual attention prediction model that comprehensively incorporates the driving task, driver experience, and the impact of dynamic visual scenes. The proposed model leverages the advanced learning capabilities of Convolutional Neural Networks (CNN) and Vision Transformer (ViT), coupled with sequence modeling mechanisms, to effectively capture the nuanced attention allocation patterns of drivers in complex driving contexts. The model is meticulously designed to adapt to dynamically evolving driving task requirements. Experimental results demonstrate that the proposed model outperforms state-of-the-art (SOTA) visual attention prediction models across multiple benchmark evaluation metrics on the DR(eye)VE dataset, particularly excelling in dynamic driving conditions. Moreover, generalization experiments were conducted on the BDD-A and TDV datasets validate the model’s robustness and applicability across varied driving tasks and dynamic conditions.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"157 ","pages":"Article 111382"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A modeled study of driver visual attention driven by driving tasks\",\"authors\":\"Chuan Xu , Bo Jiang , Yukun Wang , Yan Su\",\"doi\":\"10.1016/j.engappai.2025.111382\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Visual attention is an indispensable component of driving, enabling drivers to swiftly identify critical objects within complex and dynamic traffic environments. Despite its significance, existing visual attention models predominantly focus on static or idealized driving scenarios, limiting their ability to capture attention distribution patterns in real-world, dynamic environments. Furthermore, most of these models rely heavily on data-driven approaches, extracting features exclusively from visual image data, while neglecting the profound influence of “the driver, the vehicle, and the road environment”. Consequently, these models frequently fail to effectively address the intricacies of practical driving scenarios. To bridge these gaps, this study introduces a driver visual attention prediction model that comprehensively incorporates the driving task, driver experience, and the impact of dynamic visual scenes. The proposed model leverages the advanced learning capabilities of Convolutional Neural Networks (CNN) and Vision Transformer (ViT), coupled with sequence modeling mechanisms, to effectively capture the nuanced attention allocation patterns of drivers in complex driving contexts. The model is meticulously designed to adapt to dynamically evolving driving task requirements. Experimental results demonstrate that the proposed model outperforms state-of-the-art (SOTA) visual attention prediction models across multiple benchmark evaluation metrics on the DR(eye)VE dataset, particularly excelling in dynamic driving conditions. Moreover, generalization experiments were conducted on the BDD-A and TDV datasets validate the model’s robustness and applicability across varied driving tasks and dynamic conditions.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"157 \",\"pages\":\"Article 111382\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625013843\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625013843","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

视觉注意力是驾驶不可或缺的组成部分，它使驾驶员能够在复杂和动态的交通环境中迅速识别关键物体。尽管具有重要意义，但现有的视觉注意模型主要关注静态或理想化的驾驶场景，限制了它们捕捉现实世界动态环境中注意力分布模式的能力。此外，这些模型大多依赖于数据驱动的方法，仅从视觉图像数据中提取特征，而忽略了“驾驶员、车辆和道路环境”的深刻影响。因此，这些模型经常不能有效地解决实际驾驶场景的复杂性。为了弥补这些差距，本研究引入了一种综合考虑驾驶任务、驾驶员经验和动态视觉场景影响的驾驶员视觉注意力预测模型。该模型利用卷积神经网络（CNN）和视觉变换（ViT）的先进学习能力，结合序列建模机制，有效捕捉复杂驾驶环境中驾驶员细微的注意力分配模式。该模型经过精心设计，以适应动态发展的驾驶任务要求。实验结果表明，该模型在DR(eye)VE数据集的多个基准评估指标上优于最先进的视觉注意力预测模型，特别是在动态驾驶条件下表现出色。在BDD-A和TDV数据集上进行了泛化实验，验证了该模型在不同驾驶任务和动态条件下的鲁棒性和适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A modeled study of driver visual attention driven by driving tasks

Visual attention is an indispensable component of driving, enabling drivers to swiftly identify critical objects within complex and dynamic traffic environments. Despite its significance, existing visual attention models predominantly focus on static or idealized driving scenarios, limiting their ability to capture attention distribution patterns in real-world, dynamic environments. Furthermore, most of these models rely heavily on data-driven approaches, extracting features exclusively from visual image data, while neglecting the profound influence of “the driver, the vehicle, and the road environment”. Consequently, these models frequently fail to effectively address the intricacies of practical driving scenarios. To bridge these gaps, this study introduces a driver visual attention prediction model that comprehensively incorporates the driving task, driver experience, and the impact of dynamic visual scenes. The proposed model leverages the advanced learning capabilities of Convolutional Neural Networks (CNN) and Vision Transformer (ViT), coupled with sequence modeling mechanisms, to effectively capture the nuanced attention allocation patterns of drivers in complex driving contexts. The model is meticulously designed to adapt to dynamically evolving driving task requirements. Experimental results demonstrate that the proposed model outperforms state-of-the-art (SOTA) visual attention prediction models across multiple benchmark evaluation metrics on the DR(eye)VE dataset, particularly excelling in dynamic driving conditions. Moreover, generalization experiments were conducted on the BDD-A and TDV datasets validate the model’s robustness and applicability across varied driving tasks and dynamic conditions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.