Zhuang Zhuang , Lingbo Liu , Kan Guo , Xingtong Yu , Heng Qi , Yanming Shen , Baocai Yin
{"title":"MDPM: Modulating domain-specific prompt memory for multi-domain traffic flow prediction with transformers","authors":"Zhuang Zhuang , Lingbo Liu , Kan Guo , Xingtong Yu , Heng Qi , Yanming Shen , Baocai Yin","doi":"10.1016/j.knosys.2025.113881","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-domain traffic flow prediction aims to develop a versatile model that uses historical traffic data from various sources to forecast future traffic conditions across these individual datasets. Existing deep traffic prediction models typically focus on mining spatial–temporal relationships in a single dataset. However, there are two limitations should be considered: <strong>Lack of Model Universality</strong>, current traffic prediction research remains constrained by the absence of a universal model adaptable to multiple datasets, restricting performance improvement across diverse scenarios; <strong>Underutilized Cross-Dataset Similarities</strong>, while existing datasets exhibit both exclusive and shared spatial–temporal patterns, effectively leveraging these common patterns to enhance model performance continues to present technical challenges. To overcome the limitations mentioned above, this study introduces a straightforward yet efficient <u>M</u>odulating <u>D</u>omain-Specific <u>P</u>rompt <u>M</u>emory (MDPM) to model complex spatial–temporal interaction and better leverage similar spatial–temporal patterns across diverse datasets. Specifically, our approach is tailored with three key innovations: (1) A domain-shared encoder incorporating intra-modality Spatial–Temporal Rotary Position Encoder (ST2R) to capture universal patterns; (2) A gate fusion mechanism enhanced by contrastive learning with inter-modality ST2R to optimize spatial–temporal feature alignment; (3) Domain-specific learnable prompt vectors that dynamically guide each transformer layer in capturing unique urban traffic characteristics at node-level temporal granularity. <strong>Notably, this architecture achieves state-of-the-art performance without requiring supplementary road network data.</strong> Comprehensive experiments conducted on six real-world public traffic datasets show that our proposed method significantly surpasses existing state-of-the-art approaches.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"325 ","pages":"Article 113881"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512500927X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-domain traffic flow prediction aims to develop a versatile model that uses historical traffic data from various sources to forecast future traffic conditions across these individual datasets. Existing deep traffic prediction models typically focus on mining spatial–temporal relationships in a single dataset. However, there are two limitations should be considered: Lack of Model Universality, current traffic prediction research remains constrained by the absence of a universal model adaptable to multiple datasets, restricting performance improvement across diverse scenarios; Underutilized Cross-Dataset Similarities, while existing datasets exhibit both exclusive and shared spatial–temporal patterns, effectively leveraging these common patterns to enhance model performance continues to present technical challenges. To overcome the limitations mentioned above, this study introduces a straightforward yet efficient Modulating Domain-Specific Prompt Memory (MDPM) to model complex spatial–temporal interaction and better leverage similar spatial–temporal patterns across diverse datasets. Specifically, our approach is tailored with three key innovations: (1) A domain-shared encoder incorporating intra-modality Spatial–Temporal Rotary Position Encoder (ST2R) to capture universal patterns; (2) A gate fusion mechanism enhanced by contrastive learning with inter-modality ST2R to optimize spatial–temporal feature alignment; (3) Domain-specific learnable prompt vectors that dynamically guide each transformer layer in capturing unique urban traffic characteristics at node-level temporal granularity. Notably, this architecture achieves state-of-the-art performance without requiring supplementary road network data. Comprehensive experiments conducted on six real-world public traffic datasets show that our proposed method significantly surpasses existing state-of-the-art approaches.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.