Chunsheng Liu;Jincan Xie;Faliang Chang;Tuo Li;Shuang Li;Yinhai Wang
{"title":"End-Point Drive and Reverse Enhanced Decoding- Based Traffic Participants Trajectory Prediction Under Bird’s Eye View","authors":"Chunsheng Liu;Jincan Xie;Faliang Chang;Tuo Li;Shuang Li;Yinhai Wang","doi":"10.1109/TITS.2025.3554714","DOIUrl":null,"url":null,"abstract":"Trajectory prediction under Bird’s Eye View (BEV) refers to predicting the movement intention of agents based on the historical observation trajectories, which is of great significance for autonomous driving, driving safety and social navigation. The traffic prediction trajectory is multimodality with multiple reasonable trajectories and various prediction time, which often suffer complex agents interaction, cumulative errors and a large number of agents. To overcome these problems, we explore the BEV-based traffic participants trajectory prediction problem and propose the novel End-point Drive and Reverse Enhanced Decoding Network (EDRED-TPNet), based on the mechanisms of end-point driving and reverse enhanced decoding. Firstly, the Encoder based on Dynamic Spatio-temporal Graph and Multimodality Coding Fusion (ST-MC-Encoder) are constructed to effectively represent complex traffic scenario with changeable agents, encode social interactions with historical trajectories, social interaction, future trajectory and multimodality. Secondly, the End-Point Drive Module is proposed to predict the end point before predicting the complete trajectory, thus providing more accurate trajectory prediction; Lastly, to further improve the long-term prediction performance, the Reverse Enhanced Decoder (RE-Decoder) is proposed to fuse forward and reverse hidden state vectors to obtain diverse trajectories that conform to physical and social acceptability rules. We build the first AAV-captured Trajectory Prediction Dataset (UTP-Dataset) for traffic participants trajectory prediction. Experimental results show that the proposed methods can fulfill the multi-target trajectory prediction task in complex traffic scenarios and achieve high performance.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 6","pages":"7792-7806"},"PeriodicalIF":8.4000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10967072/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Trajectory prediction under Bird’s Eye View (BEV) refers to predicting the movement intention of agents based on the historical observation trajectories, which is of great significance for autonomous driving, driving safety and social navigation. The traffic prediction trajectory is multimodality with multiple reasonable trajectories and various prediction time, which often suffer complex agents interaction, cumulative errors and a large number of agents. To overcome these problems, we explore the BEV-based traffic participants trajectory prediction problem and propose the novel End-point Drive and Reverse Enhanced Decoding Network (EDRED-TPNet), based on the mechanisms of end-point driving and reverse enhanced decoding. Firstly, the Encoder based on Dynamic Spatio-temporal Graph and Multimodality Coding Fusion (ST-MC-Encoder) are constructed to effectively represent complex traffic scenario with changeable agents, encode social interactions with historical trajectories, social interaction, future trajectory and multimodality. Secondly, the End-Point Drive Module is proposed to predict the end point before predicting the complete trajectory, thus providing more accurate trajectory prediction; Lastly, to further improve the long-term prediction performance, the Reverse Enhanced Decoder (RE-Decoder) is proposed to fuse forward and reverse hidden state vectors to obtain diverse trajectories that conform to physical and social acceptability rules. We build the first AAV-captured Trajectory Prediction Dataset (UTP-Dataset) for traffic participants trajectory prediction. Experimental results show that the proposed methods can fulfill the multi-target trajectory prediction task in complex traffic scenarios and achieve high performance.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.