Multimodal Large-Language Model Empowering Next-Generation Autonomous Driving Systems

IF 7.8
Zhiqiang Hu;Mingxing Xu;Qixiu Cheng
{"title":"Multimodal Large-Language Model Empowering Next-Generation Autonomous Driving Systems","authors":"Zhiqiang Hu;Mingxing Xu;Qixiu Cheng","doi":"10.26599/JICV.2025.9210059","DOIUrl":null,"url":null,"abstract":"Autonomous driving technology has made significant advancements in recent years. The evolution of autonomous driving systems from traditional modular designs to end-to-end learning paradigms has led to comprehensive improvements in driving capabilities. In modular designs, driving tasks are segmented into independent modules, such as perception, decision-making, planning, and control. This modular structure offers high explainability and safety in simple scenarios but is hindered by limited generalizability in complex traffic environments, and the sequential connection of multiple modules often leads to error accumulation. In contrast, end-to-end methods process perception data directly to produce control outputs, thereby mitigating information loss and sequential error accumulation, ultimately improving scene generalization in diverse environments. However, this approach is limited by strong data dependency, low interpretability, and inadequate handling of long-tail scenarios (Zhao et al., 2024).","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210059-1-9210059-3"},"PeriodicalIF":7.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11083713","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent and Connected Vehicles","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11083713/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous driving technology has made significant advancements in recent years. The evolution of autonomous driving systems from traditional modular designs to end-to-end learning paradigms has led to comprehensive improvements in driving capabilities. In modular designs, driving tasks are segmented into independent modules, such as perception, decision-making, planning, and control. This modular structure offers high explainability and safety in simple scenarios but is hindered by limited generalizability in complex traffic environments, and the sequential connection of multiple modules often leads to error accumulation. In contrast, end-to-end methods process perception data directly to produce control outputs, thereby mitigating information loss and sequential error accumulation, ultimately improving scene generalization in diverse environments. However, this approach is limited by strong data dependency, low interpretability, and inadequate handling of long-tail scenarios (Zhao et al., 2024).
支持下一代自动驾驶系统的多模态大语言模型
近年来,自动驾驶技术取得了重大进展。自动驾驶系统从传统的模块化设计向端到端学习范式的演变,导致了驾驶能力的全面提高。在模块化设计中,驾驶任务被分割成独立的模块,如感知、决策、规划和控制。这种模块化结构在简单场景下具有较高的可解释性和安全性,但在复杂交通环境下泛化能力有限,且多个模块的顺序连接往往导致错误积累。相比之下,端到端方法直接处理感知数据以产生控制输出,从而减少信息丢失和顺序误差积累,最终提高不同环境下的场景泛化。然而,这种方法受到数据依赖性强、可解释性低以及对长尾场景处理不足的限制(Zhao et al., 2024)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信