Junfeng Zhang;Cheng Xie;Hongming Cai;Weiming Shen;Rui Yang
{"title":"基于知识蒸馏的时空 MLP 模型用于实时交通流量预测","authors":"Junfeng Zhang;Cheng Xie;Hongming Cai;Weiming Shen;Rui Yang","doi":"10.1109/TITS.2024.3424808","DOIUrl":null,"url":null,"abstract":"Real-Time Traffic Flow Prediction (RT-TFP) is one of the critical technologies for implementing the Intelligent Transportation System (ITS), enabling rapid and accurate prediction of real-time traffic flow at intersections. RT-TFP typically needs to be deployed on-site edge devices for real-time traffic flow calculation that requires low inference latency and minimal computational resources. However, the existing Traffic Flow Prediction (TFP) models are generally based on spatiotemporal graph neural networks (STGNNs), which are complex and require high computational resources and relatively high inference times that can hardly be deployed on edge devices. To this end, this work proposes a simple RT-TFP model, SpatioTemporal-MultiLayer Perceptron (ST-MLP), which requires low computational resources and inference times. The base idea of this work is to establish a spatio-temporal MLP model to replace the STGNN model for conducting the TFP, which is much faster and simpler. Specifically, first, a TempEncoder is proposed to encode the temporal information into the MLP features. Then, a Spatiotemporal Mixer is proposed to mix spatial information into the temporal-enriched MLP features. After, MLP features are distilled from a complex STGNN model to obtain a simple MLP that inherits complete Spatial-Temporal information of the traffic graph. The experimental results on four real-world datasets show the proposed model achieves competitive prediction accuracy with STGNN models in much fewer computational resources and lower prediction time costs. It is worth noting that, the proposed method is faster than the compared STGNNs by an average of 21.62 times (~10.81s \n<inline-formula> <tex-math>$\\rightsquigarrow ~\\sim 0.50$ </tex-math></inline-formula>\ns). Interestingly, the proposed ST-MLP even has a −3.23% error rate decreasing on average compared to the corresponding STGNN model. Moreover, the error rate of the proposed ST-MLP decreases over pure MLPs by −3.92% \n<inline-formula> <tex-math>$\\sim -42.62$ </tex-math></inline-formula>\n%. The source code is available at: \n<uri>https://github.com/zhangjunfeng1234/ST-MLP</uri>","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"25 11","pages":"18122-18135"},"PeriodicalIF":7.9000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge Distillation-Based Spatio-Temporal MLP Model for Real-Time Traffic Flow Prediction\",\"authors\":\"Junfeng Zhang;Cheng Xie;Hongming Cai;Weiming Shen;Rui Yang\",\"doi\":\"10.1109/TITS.2024.3424808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-Time Traffic Flow Prediction (RT-TFP) is one of the critical technologies for implementing the Intelligent Transportation System (ITS), enabling rapid and accurate prediction of real-time traffic flow at intersections. RT-TFP typically needs to be deployed on-site edge devices for real-time traffic flow calculation that requires low inference latency and minimal computational resources. However, the existing Traffic Flow Prediction (TFP) models are generally based on spatiotemporal graph neural networks (STGNNs), which are complex and require high computational resources and relatively high inference times that can hardly be deployed on edge devices. To this end, this work proposes a simple RT-TFP model, SpatioTemporal-MultiLayer Perceptron (ST-MLP), which requires low computational resources and inference times. The base idea of this work is to establish a spatio-temporal MLP model to replace the STGNN model for conducting the TFP, which is much faster and simpler. Specifically, first, a TempEncoder is proposed to encode the temporal information into the MLP features. Then, a Spatiotemporal Mixer is proposed to mix spatial information into the temporal-enriched MLP features. After, MLP features are distilled from a complex STGNN model to obtain a simple MLP that inherits complete Spatial-Temporal information of the traffic graph. The experimental results on four real-world datasets show the proposed model achieves competitive prediction accuracy with STGNN models in much fewer computational resources and lower prediction time costs. It is worth noting that, the proposed method is faster than the compared STGNNs by an average of 21.62 times (~10.81s \\n<inline-formula> <tex-math>$\\\\rightsquigarrow ~\\\\sim 0.50$ </tex-math></inline-formula>\\ns). Interestingly, the proposed ST-MLP even has a −3.23% error rate decreasing on average compared to the corresponding STGNN model. Moreover, the error rate of the proposed ST-MLP decreases over pure MLPs by −3.92% \\n<inline-formula> <tex-math>$\\\\sim -42.62$ </tex-math></inline-formula>\\n%. The source code is available at: \\n<uri>https://github.com/zhangjunfeng1234/ST-MLP</uri>\",\"PeriodicalId\":13416,\"journal\":{\"name\":\"IEEE Transactions on Intelligent Transportation Systems\",\"volume\":\"25 11\",\"pages\":\"18122-18135\"},\"PeriodicalIF\":7.9000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Intelligent Transportation Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10689468/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10689468/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Knowledge Distillation-Based Spatio-Temporal MLP Model for Real-Time Traffic Flow Prediction
Real-Time Traffic Flow Prediction (RT-TFP) is one of the critical technologies for implementing the Intelligent Transportation System (ITS), enabling rapid and accurate prediction of real-time traffic flow at intersections. RT-TFP typically needs to be deployed on-site edge devices for real-time traffic flow calculation that requires low inference latency and minimal computational resources. However, the existing Traffic Flow Prediction (TFP) models are generally based on spatiotemporal graph neural networks (STGNNs), which are complex and require high computational resources and relatively high inference times that can hardly be deployed on edge devices. To this end, this work proposes a simple RT-TFP model, SpatioTemporal-MultiLayer Perceptron (ST-MLP), which requires low computational resources and inference times. The base idea of this work is to establish a spatio-temporal MLP model to replace the STGNN model for conducting the TFP, which is much faster and simpler. Specifically, first, a TempEncoder is proposed to encode the temporal information into the MLP features. Then, a Spatiotemporal Mixer is proposed to mix spatial information into the temporal-enriched MLP features. After, MLP features are distilled from a complex STGNN model to obtain a simple MLP that inherits complete Spatial-Temporal information of the traffic graph. The experimental results on four real-world datasets show the proposed model achieves competitive prediction accuracy with STGNN models in much fewer computational resources and lower prediction time costs. It is worth noting that, the proposed method is faster than the compared STGNNs by an average of 21.62 times (~10.81s
$\rightsquigarrow ~\sim 0.50$
s). Interestingly, the proposed ST-MLP even has a −3.23% error rate decreasing on average compared to the corresponding STGNN model. Moreover, the error rate of the proposed ST-MLP decreases over pure MLPs by −3.92%
$\sim -42.62$
%. The source code is available at:
https://github.com/zhangjunfeng1234/ST-MLP
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.