Zhiqi Shao , Haoning Xi , Haohui Lu , Ze Wang , Michael G.H. Bell , Junbin Gao
{"title":"A spatial–Temporal Large Language Model with Denoising Diffusion Implicit for predictions in centralized multimodal transport systems","authors":"Zhiqi Shao , Haoning Xi , Haohui Lu , Ze Wang , Michael G.H. Bell , Junbin Gao","doi":"10.1016/j.trc.2025.105249","DOIUrl":null,"url":null,"abstract":"<div><div>Centralized multimodal transport systems face significant challenges due to data isolation, missing values, and heterogeneous spatial–temporal features, which hinder accurate prediction in traffic flow and travel demand. To address these challenges, we propose Spatial–Temporal Large Language Model with Denoising Diffusion Implicit (STLLM-DF), an innovative which integrates a Spatial–Temporal Denoising Diffusion Implicit Model (ST-DDIM) with a Spatial–Temporal Large Language Model (ST-LLM) to improve the predictions in traffic flow and travel demand in multimodal transport systems. The ST-DDIM effectively learns data distributions to recover noisy and incomplete data, while the ST-LLM captures complex spatial–temporal dependencies across multimodal networks, eliminating manual feature engineering. Extensive experiments conducted on ten real-world datasets from Sydney demonstrate that STLLM-DF consistently outperforms baseline models in both single-task and multi-task predictions (e.g., ), while consistently excelling in short-term and long-term predictions. On average, STLLM-DF achieves improvements in Mean Absolute Error (MAE) by 2.40%, Root Mean Square Error (RMSE) by 4.50%, and Mean Absolute Percentage Error (MAPE) by 1.51%. Furthermore, we evaluate the noise tolerance of STLLM-DF, demonstrating its robust performance under data imperfections. This paper presents a scalable, data-driven solution for managing multimodal transport systems, offering actionable insights for transport regulators.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"179 ","pages":"Article 105249"},"PeriodicalIF":7.6000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25002530","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Centralized multimodal transport systems face significant challenges due to data isolation, missing values, and heterogeneous spatial–temporal features, which hinder accurate prediction in traffic flow and travel demand. To address these challenges, we propose Spatial–Temporal Large Language Model with Denoising Diffusion Implicit (STLLM-DF), an innovative which integrates a Spatial–Temporal Denoising Diffusion Implicit Model (ST-DDIM) with a Spatial–Temporal Large Language Model (ST-LLM) to improve the predictions in traffic flow and travel demand in multimodal transport systems. The ST-DDIM effectively learns data distributions to recover noisy and incomplete data, while the ST-LLM captures complex spatial–temporal dependencies across multimodal networks, eliminating manual feature engineering. Extensive experiments conducted on ten real-world datasets from Sydney demonstrate that STLLM-DF consistently outperforms baseline models in both single-task and multi-task predictions (e.g., ), while consistently excelling in short-term and long-term predictions. On average, STLLM-DF achieves improvements in Mean Absolute Error (MAE) by 2.40%, Root Mean Square Error (RMSE) by 4.50%, and Mean Absolute Percentage Error (MAPE) by 1.51%. Furthermore, we evaluate the noise tolerance of STLLM-DF, demonstrating its robust performance under data imperfections. This paper presents a scalable, data-driven solution for managing multimodal transport systems, offering actionable insights for transport regulators.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.