InfoSTGCAN: An Information-Maximizing Spatial-Temporal Graph Convolutional Attention Network for Heterogeneous Human Trajectory Prediction

Computers Pub Date : 2024-06-11 DOI:10.3390/computers13060151

Kangrui Ruan, Xuan Di

{"title":"InfoSTGCAN: An Information-Maximizing Spatial-Temporal Graph Convolutional Attention Network for Heterogeneous Human Trajectory Prediction","authors":"Kangrui Ruan, Xuan Di","doi":"10.3390/computers13060151","DOIUrl":null,"url":null,"abstract":"Predicting the future trajectories of multiple interacting pedestrians within a scene has increasingly gained importance in various fields, e.g., autonomous driving, human–robot interaction, and so on. The complexity of this problem is heightened due to the social dynamics among different pedestrians and their heterogeneous implicit preferences. In this paper, we present Information Maximizing Spatial-Temporal Graph Convolutional Attention Network (InfoSTGCAN), which takes into account both pedestrian interactions and heterogeneous behavior choice modeling. To effectively capture the complex interactions among pedestrians, we integrate spatial-temporal graph convolution and spatial-temporal graph attention. For grasping the heterogeneity in pedestrians’ behavior choices, our model goes a step further by learning to predict an individual-level latent code for each pedestrian. Each latent code represents a distinct pattern of movement choice. Finally, based on the observed historical trajectory and the learned latent code, the proposed method is trained to cover the ground-truth future trajectory of this pedestrian with a bi-variate Gaussian distribution. We evaluate the proposed method through a comprehensive list of experiments and demonstrate that our method outperforms all baseline methods on the commonly used metrics, Average Displacement Error and Final Displacement Error. Notably, visualizations of the generated trajectories reveal our method’s capacity to handle different scenarios.","PeriodicalId":503381,"journal":{"name":"Computers","volume":"37 17","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/computers13060151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Predicting the future trajectories of multiple interacting pedestrians within a scene has increasingly gained importance in various fields, e.g., autonomous driving, human–robot interaction, and so on. The complexity of this problem is heightened due to the social dynamics among different pedestrians and their heterogeneous implicit preferences. In this paper, we present Information Maximizing Spatial-Temporal Graph Convolutional Attention Network (InfoSTGCAN), which takes into account both pedestrian interactions and heterogeneous behavior choice modeling. To effectively capture the complex interactions among pedestrians, we integrate spatial-temporal graph convolution and spatial-temporal graph attention. For grasping the heterogeneity in pedestrians’ behavior choices, our model goes a step further by learning to predict an individual-level latent code for each pedestrian. Each latent code represents a distinct pattern of movement choice. Finally, based on the observed historical trajectory and the learned latent code, the proposed method is trained to cover the ground-truth future trajectory of this pedestrian with a bi-variate Gaussian distribution. We evaluate the proposed method through a comprehensive list of experiments and demonstrate that our method outperforms all baseline methods on the commonly used metrics, Average Displacement Error and Final Displacement Error. Notably, visualizations of the generated trajectories reveal our method’s capacity to handle different scenarios.

查看原文本刊更多论文

InfoSTGCAN：用于异构人体轨迹预测的信息最大化时空图卷积注意力网络

在自动驾驶、人机交互等多个领域，预测场景中多个相互影响的行人的未来轨迹越来越重要。由于不同行人之间的社会动态及其不同的隐含偏好，这一问题的复杂性也随之增加。在本文中，我们提出了信息最大化时空图卷积注意力网络（InfoSTGCAN），它同时考虑了行人之间的互动和异质行为选择建模。为了有效捕捉行人之间复杂的相互作用，我们整合了时空图卷积和时空图注意力。为了把握行人行为选择的异质性，我们的模型更进一步，学习预测每个行人的个体级潜在代码。每个潜在代码都代表一种独特的运动选择模式。最后，根据观察到的历史轨迹和学习到的潜在代码，对所提出的方法进行训练，使其覆盖该行人具有双变量高斯分布的地面实况未来轨迹。我们通过一系列综合实验对所提出的方法进行了评估，结果表明我们的方法在常用指标（平均位移误差和最终位移误差）上优于所有基线方法。值得注意的是，生成轨迹的可视化显示了我们的方法处理不同场景的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers

自引率

0.00%

发文量