traffic - it:增强对多模态大语言模型的交通场景理解

IF 7.6 1区 工程技术 Q1 TRANSPORTATION SCIENCE & TECHNOLOGY
Senyun Kuang , Yang Liu , Xiaobo Qu , Yintao Wei
{"title":"traffic - it:增强对多模态大语言模型的交通场景理解","authors":"Senyun Kuang ,&nbsp;Yang Liu ,&nbsp;Xiaobo Qu ,&nbsp;Yintao Wei","doi":"10.1016/j.trc.2025.105325","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the convergence of artificial intelligence and urban infrastructure has driven transformative advances in intelligent transportation systems (ITS). However, traditional models often lack the generalizability needed to adapt to diverse traffic scenarios. Multimodal large language models (MLLMs) offer a promising solution, yet they are typically trained on general datasets, limiting their effectiveness in specific transportation contexts. To address this, we introduce Traffic-IT, a dataset comprising 220,950 question-and-answer pairs from 30,000 images, designed to enhance MLLMs’ capabilities in traffic scene understanding. The dataset covers various traffic scenarios, including weather conditions, locations, and times of day, providing in-depth insights and driving strategies tailored to real-world needs. Created through expert consultation and rigorous validation, Traffic-IT significantly improves MLLMs’ performance in interpreting complex traffic scenes. We anticipate that Traffic-IT will be a crucial resource for future developments in smart city applications.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"180 ","pages":"Article 105325"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Traffic-IT: Enhancing traffic scene understanding for multimodal large language models\",\"authors\":\"Senyun Kuang ,&nbsp;Yang Liu ,&nbsp;Xiaobo Qu ,&nbsp;Yintao Wei\",\"doi\":\"10.1016/j.trc.2025.105325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, the convergence of artificial intelligence and urban infrastructure has driven transformative advances in intelligent transportation systems (ITS). However, traditional models often lack the generalizability needed to adapt to diverse traffic scenarios. Multimodal large language models (MLLMs) offer a promising solution, yet they are typically trained on general datasets, limiting their effectiveness in specific transportation contexts. To address this, we introduce Traffic-IT, a dataset comprising 220,950 question-and-answer pairs from 30,000 images, designed to enhance MLLMs’ capabilities in traffic scene understanding. The dataset covers various traffic scenarios, including weather conditions, locations, and times of day, providing in-depth insights and driving strategies tailored to real-world needs. Created through expert consultation and rigorous validation, Traffic-IT significantly improves MLLMs’ performance in interpreting complex traffic scenes. We anticipate that Traffic-IT will be a crucial resource for future developments in smart city applications.</div></div>\",\"PeriodicalId\":54417,\"journal\":{\"name\":\"Transportation Research Part C-Emerging Technologies\",\"volume\":\"180 \",\"pages\":\"Article 105325\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part C-Emerging Technologies\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968090X25003298\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25003298","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

近年来,人工智能与城市基础设施的融合推动了智能交通系统(ITS)的变革性进步。然而,传统模型往往缺乏适应各种交通场景所需的通用性。多模式大型语言模型(mllm)提供了一个很有前途的解决方案,但它们通常是在一般数据集上训练的,限制了它们在特定运输环境中的有效性。为了解决这个问题,我们引入了traffic - it,这是一个包含来自30,000张图像的220,950对问答的数据集,旨在增强mlm在交通场景理解方面的能力。该数据集涵盖了各种交通场景,包括天气状况、地点和一天中的时间,提供了针对现实需求的深入见解和驾驶策略。通过专家咨询和严格的验证,traffic - it显著提高了mlms在解释复杂交通场景方面的表现。我们预计,交通信息技术将成为未来智慧城市应用发展的重要资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Traffic-IT: Enhancing traffic scene understanding for multimodal large language models
In recent years, the convergence of artificial intelligence and urban infrastructure has driven transformative advances in intelligent transportation systems (ITS). However, traditional models often lack the generalizability needed to adapt to diverse traffic scenarios. Multimodal large language models (MLLMs) offer a promising solution, yet they are typically trained on general datasets, limiting their effectiveness in specific transportation contexts. To address this, we introduce Traffic-IT, a dataset comprising 220,950 question-and-answer pairs from 30,000 images, designed to enhance MLLMs’ capabilities in traffic scene understanding. The dataset covers various traffic scenarios, including weather conditions, locations, and times of day, providing in-depth insights and driving strategies tailored to real-world needs. Created through expert consultation and rigorous validation, Traffic-IT significantly improves MLLMs’ performance in interpreting complex traffic scenes. We anticipate that Traffic-IT will be a crucial resource for future developments in smart city applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
15.80
自引率
12.00%
发文量
332
审稿时长
64 days
期刊介绍: Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信