InVDriver：基于实例感知的矢量化查询的自动驾驶变压器

IF 7.8

Journal of Intelligent and Connected Vehicles Pub Date : 2025-06-01 DOI:10.26599/JICV.2025.9210060

Bo Zhang;Heye Huang;Chunyang Liu;Yaqin Zhang;Zhenhua Xu

{"title":"InVDriver：基于实例感知的矢量化查询的自动驾驶变压器","authors":"Bo Zhang;Heye Huang;Chunyang Liu;Yaqin Zhang;Zhenhua Xu","doi":"10.26599/JICV.2025.9210060","DOIUrl":null,"url":null,"abstract":"End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210060-1-9210060-8"},"PeriodicalIF":7.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer\",\"authors\":\"Bo Zhang;Heye Huang;Chunyang Liu;Yaqin Zhang;Zhenhua Xu\",\"doi\":\"10.26599/JICV.2025.9210060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.\",\"PeriodicalId\":100793,\"journal\":{\"name\":\"Journal of Intelligent and Connected Vehicles\",\"volume\":\"8 2\",\"pages\":\"9210060-1-9210060-8\"},\"PeriodicalIF\":7.8000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent and Connected Vehicles\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11083708/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent and Connected Vehicles","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11083708/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

端到端自动驾驶以其整体优化能力在学术界和产业界获得了越来越多的关注。向量化表示在减少计算开销的同时保留实例级拓扑信息，已成为一种很有前途的范式。然而，现有的基于矢量化查询的框架往往忽略了实例内点之间固有的空间相关性，导致几何上不一致的输出（例如，碎片化的高清地图元素或振荡轨迹）。为了解决这些限制，我们提出了实例内矢量化驱动变压器（InVDriver），这是一种新颖的基于矢量化查询的系统，通过屏蔽自关注层系统地建模实例内空间依赖关系，从而提高规划精度和轨迹平滑度。在所有核心模块（即感知、预测和规划）中，InVDriver结合了隐藏的自关注机制，将注意力限制在实例内点交互上，从而在抑制不相关的实例间噪声的同时，实现对结构元素的协调细化。在nuScenes基准测试上的实验结果表明，InVDriver达到了最先进的性能，在精度和安全性方面超过了先前的方法，同时保持了较高的计算效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer

End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Intelligent and Connected Vehicles

CiteScore

7.10

自引率

0.00%

发文量