{"title":"行人轨迹预测的多元交互和多模态建模","authors":"Ruiping Wang;Zhijian Hu;Junzhi Yu;Jun Cheng","doi":"10.1109/JAS.2025.125363","DOIUrl":null,"url":null,"abstract":"Pedestrian trajectory prediction can significantly enhance the perception and decision-making capabilities of autonomous driving systems and intelligent surveillance systems based on camera sensors by predicting the states and behavior intentions of surrounding pedestrians. However, existing trajectory prediction methods remain failing to effectively model the diverse and complex interactions in the real world, including pedestrian-pedestrian interactions and pedestrian-environment interactions. Besides, these methods are not effective in capturing and characterizing the multimodal property of future trajectories. To address these challenges above, we propose to devise a hand-designed graph convolution and spatial cross attention to dynamically capture the diverse spatial interactions between pedestrians. To effectively explore the impact of scenarios on pedestrian trajectory, we build a pedestrian map, which can reflect the scene constraints and pedestrian motion preferences. Meanwhile, we construct a trajectory multimodality-aware module to capture the different potential mode implicit in diverse social behaviors for pedestrian future trajectory uncertainty. Finally, we compared the proposed method with trajectory prediction baselines on commonly used public pedestrian benchmarks, demonstrating the superior performance of our approach.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 9","pages":"1801-1813"},"PeriodicalIF":19.2000,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modelling Diverse Interactions and Multimodality for Pedestrian Trajectory Prediction\",\"authors\":\"Ruiping Wang;Zhijian Hu;Junzhi Yu;Jun Cheng\",\"doi\":\"10.1109/JAS.2025.125363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pedestrian trajectory prediction can significantly enhance the perception and decision-making capabilities of autonomous driving systems and intelligent surveillance systems based on camera sensors by predicting the states and behavior intentions of surrounding pedestrians. However, existing trajectory prediction methods remain failing to effectively model the diverse and complex interactions in the real world, including pedestrian-pedestrian interactions and pedestrian-environment interactions. Besides, these methods are not effective in capturing and characterizing the multimodal property of future trajectories. To address these challenges above, we propose to devise a hand-designed graph convolution and spatial cross attention to dynamically capture the diverse spatial interactions between pedestrians. To effectively explore the impact of scenarios on pedestrian trajectory, we build a pedestrian map, which can reflect the scene constraints and pedestrian motion preferences. Meanwhile, we construct a trajectory multimodality-aware module to capture the different potential mode implicit in diverse social behaviors for pedestrian future trajectory uncertainty. Finally, we compared the proposed method with trajectory prediction baselines on commonly used public pedestrian benchmarks, demonstrating the superior performance of our approach.\",\"PeriodicalId\":54230,\"journal\":{\"name\":\"Ieee-Caa Journal of Automatica Sinica\",\"volume\":\"12 9\",\"pages\":\"1801-1813\"},\"PeriodicalIF\":19.2000,\"publicationDate\":\"2025-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ieee-Caa Journal of Automatica Sinica\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11208772/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ieee-Caa Journal of Automatica Sinica","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11208772/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Modelling Diverse Interactions and Multimodality for Pedestrian Trajectory Prediction
Pedestrian trajectory prediction can significantly enhance the perception and decision-making capabilities of autonomous driving systems and intelligent surveillance systems based on camera sensors by predicting the states and behavior intentions of surrounding pedestrians. However, existing trajectory prediction methods remain failing to effectively model the diverse and complex interactions in the real world, including pedestrian-pedestrian interactions and pedestrian-environment interactions. Besides, these methods are not effective in capturing and characterizing the multimodal property of future trajectories. To address these challenges above, we propose to devise a hand-designed graph convolution and spatial cross attention to dynamically capture the diverse spatial interactions between pedestrians. To effectively explore the impact of scenarios on pedestrian trajectory, we build a pedestrian map, which can reflect the scene constraints and pedestrian motion preferences. Meanwhile, we construct a trajectory multimodality-aware module to capture the different potential mode implicit in diverse social behaviors for pedestrian future trajectory uncertainty. Finally, we compared the proposed method with trajectory prediction baselines on commonly used public pedestrian benchmarks, demonstrating the superior performance of our approach.
期刊介绍:
The IEEE/CAA Journal of Automatica Sinica is a reputable journal that publishes high-quality papers in English on original theoretical/experimental research and development in the field of automation. The journal covers a wide range of topics including automatic control, artificial intelligence and intelligent control, systems theory and engineering, pattern recognition and intelligent systems, automation engineering and applications, information processing and information systems, network-based automation, robotics, sensing and measurement, and navigation, guidance, and control.
Additionally, the journal is abstracted/indexed in several prominent databases including SCIE (Science Citation Index Expanded), EI (Engineering Index), Inspec, Scopus, SCImago, DBLP, CNKI (China National Knowledge Infrastructure), CSCD (Chinese Science Citation Database), and IEEE Xplore.