具有社会意识扩散的级联物理约束条件变分自动编码器用于行人轨迹预测

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Haojie Chen , Zhuo Wang , Hongde Qin , Xiaokai Mu
{"title":"具有社会意识扩散的级联物理约束条件变分自动编码器用于行人轨迹预测","authors":"Haojie Chen ,&nbsp;Zhuo Wang ,&nbsp;Hongde Qin ,&nbsp;Xiaokai Mu","doi":"10.1016/j.patcog.2025.111667","DOIUrl":null,"url":null,"abstract":"<div><div>Pedestrian trajectory prediction serves as a crucial prerequisite for various tasks such as autonomous driving and human–robot interaction. The existing methods mainly leverage deep learning-based generative models to predict future multi-modal trajectories. Nevertheless, the inherent uncertainty in pedestrian movements poses a challenge for deep generative models to generate accurate and plausible future trajectories. In this paper, we propose a two-stage trajectory prediction network termed CPSD. In the first stage, a Cascaded Physical-constraint Conditional Variational Auto Encoder is proposed. It combines Differentiable Physical Constraint Conditional Variational Auto Encoders in the cascaded form to predict the trajectory coordinates with a stepwise manner, which improves the interpretability of deep generative network and alleviates the problem of prediction error accumulation over time. In the second stage, a Socially-aware Diffusion Model is proposed to refine the initial trajectory generated in the first stage. By introducing a non-local attention mechanism and constructing a social mask, we integrate pedestrian social interactions into the diffusion model, enabling the refinement of more realistic and plausible multi-modal pedestrian trajectories. Extensive experiments conducted on the public datasets SDD and ETH/UCY demonstrate that CPSD achieves more promising pedestrian trajectories compared with other state-of-the-art trajectory prediction algorithms.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"166 ","pages":"Article 111667"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cascaded Physical-constraint Conditional Variational Auto Encoder with socially-aware diffusion for pedestrian trajectory prediction\",\"authors\":\"Haojie Chen ,&nbsp;Zhuo Wang ,&nbsp;Hongde Qin ,&nbsp;Xiaokai Mu\",\"doi\":\"10.1016/j.patcog.2025.111667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pedestrian trajectory prediction serves as a crucial prerequisite for various tasks such as autonomous driving and human–robot interaction. The existing methods mainly leverage deep learning-based generative models to predict future multi-modal trajectories. Nevertheless, the inherent uncertainty in pedestrian movements poses a challenge for deep generative models to generate accurate and plausible future trajectories. In this paper, we propose a two-stage trajectory prediction network termed CPSD. In the first stage, a Cascaded Physical-constraint Conditional Variational Auto Encoder is proposed. It combines Differentiable Physical Constraint Conditional Variational Auto Encoders in the cascaded form to predict the trajectory coordinates with a stepwise manner, which improves the interpretability of deep generative network and alleviates the problem of prediction error accumulation over time. In the second stage, a Socially-aware Diffusion Model is proposed to refine the initial trajectory generated in the first stage. By introducing a non-local attention mechanism and constructing a social mask, we integrate pedestrian social interactions into the diffusion model, enabling the refinement of more realistic and plausible multi-modal pedestrian trajectories. Extensive experiments conducted on the public datasets SDD and ETH/UCY demonstrate that CPSD achieves more promising pedestrian trajectories compared with other state-of-the-art trajectory prediction algorithms.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"166 \",\"pages\":\"Article 111667\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325003279\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003279","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

行人轨迹预测是自动驾驶和人机交互等各种任务的重要前提。现有的方法主要利用基于深度学习的生成模型来预测未来的多模态轨迹。然而,行人运动中固有的不确定性对深度生成模型产生准确和可信的未来轨迹提出了挑战。本文提出了一种称为CPSD的两阶段轨迹预测网络。首先,提出了一种级联物理约束条件变分自编码器。将可微物理约束条件变分自编码器以级联形式组合在一起,逐步预测轨迹坐标,提高了深度生成网络的可解释性,缓解了预测误差随时间累积的问题。在第二阶段,提出了一个社会意识扩散模型来细化第一阶段产生的初始轨迹。通过引入非局部注意机制和构建社会面具,我们将行人社会互动整合到扩散模型中,从而使多模态行人轨迹更加真实可信。在公共数据集SDD和ETH/UCY上进行的大量实验表明,与其他最先进的轨迹预测算法相比,CPSD获得了更有希望的行人轨迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Cascaded Physical-constraint Conditional Variational Auto Encoder with socially-aware diffusion for pedestrian trajectory prediction
Pedestrian trajectory prediction serves as a crucial prerequisite for various tasks such as autonomous driving and human–robot interaction. The existing methods mainly leverage deep learning-based generative models to predict future multi-modal trajectories. Nevertheless, the inherent uncertainty in pedestrian movements poses a challenge for deep generative models to generate accurate and plausible future trajectories. In this paper, we propose a two-stage trajectory prediction network termed CPSD. In the first stage, a Cascaded Physical-constraint Conditional Variational Auto Encoder is proposed. It combines Differentiable Physical Constraint Conditional Variational Auto Encoders in the cascaded form to predict the trajectory coordinates with a stepwise manner, which improves the interpretability of deep generative network and alleviates the problem of prediction error accumulation over time. In the second stage, a Socially-aware Diffusion Model is proposed to refine the initial trajectory generated in the first stage. By introducing a non-local attention mechanism and constructing a social mask, we integrate pedestrian social interactions into the diffusion model, enabling the refinement of more realistic and plausible multi-modal pedestrian trajectories. Extensive experiments conducted on the public datasets SDD and ETH/UCY demonstrate that CPSD achieves more promising pedestrian trajectories compared with other state-of-the-art trajectory prediction algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信