Posebits for Monocular Human Pose Estimation

2014 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2014-06-23 DOI:10.1109/CVPR.2014.300

Gerard Pons-Moll, David J. Fleet, B. Rosenhahn

引用次数: 74

Abstract

We advocate the inference of qualitative information about 3D human pose, called posebits, from images. Posebits represent Boolean geometric relationships between body parts (e.g., left-leg in front of right-leg or hands close to each other). The advantages of posebits as a mid-level representation are 1) for many tasks of interest, such qualitative pose information may be sufficient (e.g., semantic image retrieval), 2) it is relatively easy to annotate large image corpora with posebits, as it simply requires answers to yes/no questions, and 3) they help resolve challenging pose ambiguities and therefore facilitate the difficult talk of image-based 3D pose estimation. We introduce posebits, a posebit database, a method for selecting useful posebits for pose estimation and a structural SVM model for posebit inference. Experiments show the use of posebits for semantic image retrieval and for improving 3D pose estimation.

查看原文本刊更多论文

用于单目人体姿态估计的波塞比特

我们提倡从图像中推断关于三维人体姿势的定性信息，称为posebits。波塞位表示身体部位之间的布尔几何关系(例如，左腿在右腿前面或双手彼此靠近)。posebit作为中级表示的优点是:1)对于许多感兴趣的任务，这种定性的姿态信息可能是足够的(例如，语义图像检索);2)用posebit注释大型图像语料库相对容易，因为它只需要回答是/否问题;3)它们有助于解决具有挑战性的姿态歧义，从而促进基于图像的3D姿态估计的困难讨论。我们介绍了波塞比特、波塞比特数据库、一种选择有用波塞比特进行位姿估计的方法以及一种用于波塞比特推断的结构支持向量机模型。实验表明，posebit可用于语义图像检索和改进三维姿态估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量