Ruiyuan Jiang , Shangbo Wang , Wei Ma , Yuli Zhang , Pengfei Fan , Dongyao Jia
{"title":"基于知识的车道级交通流预测动态关联建模框架","authors":"Ruiyuan Jiang , Shangbo Wang , Wei Ma , Yuli Zhang , Pengfei Fan , Dongyao Jia","doi":"10.1016/j.inffus.2025.103327","DOIUrl":null,"url":null,"abstract":"<div><div>Lane-level traffic prediction forecasts near-future conditions at specific lane segments, enabling real-time traffic management and particularly aiding autonomous vehicles (AVs) in precise tasks such as car-following and lane changes. Despite substantial advancements in this field, some key challenges remain. First, the traffic state of a lane segment exhibits dynamic, nonlinear spatial correlation with other segments, making accurate modeling complex in real-world environments. Second, existing deep learning models depend heavily on specific datasets, leading to poor generalization. Third, while recent studies have shown that Large Language Models (LLMs) exhibit superior performance in generating reliable traffic prediction results, their direct application is hindered by inefficiency, high computational costs, and difficulties in capturing dynamic traffic features. To address these challenges, we propose the Knowledge-informed Dynamic Correlation Modeling (KIDCM) framework, which integrates pre-trained LLMs with traditional predictive methodologies to achieve a balance between generalization and prediction accuracy. Specifically, we introduce a General Spatial Dynamics Modeling (GSDM) method, which leverages the unbiased traffic data generated by LLM to analyze the general law dynamic spatial correlations. By integrating traditional time-series models with attention mechanisms, GSDM effectively models both linear temporal dependencies and nonlinear spatial interactions, ensuring robust generalization across varying conditions. Additionally, we develop a surrogate model that distills the traffic prediction function of LLMs. This surrogate model can be fine-tuned with small sample sizes, preserving the generalization advantages of LLMs while mitigating their typically high resource demands. Extensive evaluations demonstrate that our framework outperforms state-of-the-art models in terms of generalization, small-sample training, and computational cost.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"124 ","pages":"Article 103327"},"PeriodicalIF":14.7000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A knowledge-informed dynamic correlation modeling framework for lane-level traffic flow prediction\",\"authors\":\"Ruiyuan Jiang , Shangbo Wang , Wei Ma , Yuli Zhang , Pengfei Fan , Dongyao Jia\",\"doi\":\"10.1016/j.inffus.2025.103327\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Lane-level traffic prediction forecasts near-future conditions at specific lane segments, enabling real-time traffic management and particularly aiding autonomous vehicles (AVs) in precise tasks such as car-following and lane changes. Despite substantial advancements in this field, some key challenges remain. First, the traffic state of a lane segment exhibits dynamic, nonlinear spatial correlation with other segments, making accurate modeling complex in real-world environments. Second, existing deep learning models depend heavily on specific datasets, leading to poor generalization. Third, while recent studies have shown that Large Language Models (LLMs) exhibit superior performance in generating reliable traffic prediction results, their direct application is hindered by inefficiency, high computational costs, and difficulties in capturing dynamic traffic features. To address these challenges, we propose the Knowledge-informed Dynamic Correlation Modeling (KIDCM) framework, which integrates pre-trained LLMs with traditional predictive methodologies to achieve a balance between generalization and prediction accuracy. Specifically, we introduce a General Spatial Dynamics Modeling (GSDM) method, which leverages the unbiased traffic data generated by LLM to analyze the general law dynamic spatial correlations. By integrating traditional time-series models with attention mechanisms, GSDM effectively models both linear temporal dependencies and nonlinear spatial interactions, ensuring robust generalization across varying conditions. Additionally, we develop a surrogate model that distills the traffic prediction function of LLMs. This surrogate model can be fine-tuned with small sample sizes, preserving the generalization advantages of LLMs while mitigating their typically high resource demands. Extensive evaluations demonstrate that our framework outperforms state-of-the-art models in terms of generalization, small-sample training, and computational cost.</div></div>\",\"PeriodicalId\":50367,\"journal\":{\"name\":\"Information Fusion\",\"volume\":\"124 \",\"pages\":\"Article 103327\"},\"PeriodicalIF\":14.7000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Fusion\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1566253525004002\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525004002","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A knowledge-informed dynamic correlation modeling framework for lane-level traffic flow prediction
Lane-level traffic prediction forecasts near-future conditions at specific lane segments, enabling real-time traffic management and particularly aiding autonomous vehicles (AVs) in precise tasks such as car-following and lane changes. Despite substantial advancements in this field, some key challenges remain. First, the traffic state of a lane segment exhibits dynamic, nonlinear spatial correlation with other segments, making accurate modeling complex in real-world environments. Second, existing deep learning models depend heavily on specific datasets, leading to poor generalization. Third, while recent studies have shown that Large Language Models (LLMs) exhibit superior performance in generating reliable traffic prediction results, their direct application is hindered by inefficiency, high computational costs, and difficulties in capturing dynamic traffic features. To address these challenges, we propose the Knowledge-informed Dynamic Correlation Modeling (KIDCM) framework, which integrates pre-trained LLMs with traditional predictive methodologies to achieve a balance between generalization and prediction accuracy. Specifically, we introduce a General Spatial Dynamics Modeling (GSDM) method, which leverages the unbiased traffic data generated by LLM to analyze the general law dynamic spatial correlations. By integrating traditional time-series models with attention mechanisms, GSDM effectively models both linear temporal dependencies and nonlinear spatial interactions, ensuring robust generalization across varying conditions. Additionally, we develop a surrogate model that distills the traffic prediction function of LLMs. This surrogate model can be fine-tuned with small sample sizes, preserving the generalization advantages of LLMs while mitigating their typically high resource demands. Extensive evaluations demonstrate that our framework outperforms state-of-the-art models in terms of generalization, small-sample training, and computational cost.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.