3D Human Pose Estimation via Deep Learning from 2D Annotations

2016 Fourth International Conference on 3D Vision (3DV) Pub Date : 2016-10-01 DOI:10.1109/3DV.2016.84

Ernesto Brau, Hao Jiang

引用次数: 47

Abstract

We propose a deep convolutional neural network for 3D human pose and camera estimation from monocular images that learns from 2D joint annotations. The proposed network follows the typical architecture, but contains an additional output layer which projects predicted 3D joints onto 2D, and enforces constraints on body part lengths in 3D. We further enforce pose constraints using an independently trained network that learns a prior distribution over 3D poses. We evaluate our approach on several benchmark datasets and compare against state-of-the-art approaches for 3D human pose estimation, achieving comparable performance. Additionally, we show that our approach significantly outperforms other methods in cases where 3D ground truth data is unavailable, and that our network exhibits good generalization properties.

查看原文本刊更多论文

基于2D注释的深度学习三维人体姿态估计

我们提出了一种深度卷积神经网络，用于从单眼图像中学习2D联合注释的3D人体姿势和相机估计。所提出的网络遵循典型的架构，但包含一个额外的输出层，该输出层将预测的3D关节投影到2D上，并在3D中对身体部位长度进行约束。我们使用一个独立训练的网络来学习3D姿势的先验分布，进一步加强姿势约束。我们在几个基准数据集上评估了我们的方法，并与最先进的3D人体姿态估计方法进行了比较，获得了相当的性能。此外，我们表明，在三维地面真实数据不可用的情况下，我们的方法明显优于其他方法，并且我们的网络表现出良好的泛化特性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Fourth International Conference on 3D Vision (3DV)

自引率

0.00%

发文量