First-person pose recognition using egocentric workspaces

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2015-06-07 DOI:10.1109/CVPR.2015.7299061

Grégory Rogez, J. Supančič, Deva Ramanan

{"title":"First-person pose recognition using egocentric workspaces","authors":"Grégory Rogez, J. Supančič, Deva Ramanan","doi":"10.1109/CVPR.2015.7299061","DOIUrl":null,"url":null,"abstract":"We tackle the problem of estimating the 3D pose of an individual's upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volume in front of the camera. We call this volume an egocentric workspace. A notable property is that hand appearance correlates with workspace location. To exploit this correlation, we classify arm+hand configurations in a global egocentric coordinate frame, rather than a local scanning window. This greatly simplify the architecture and improves performance. We propose an efficient pipeline which 1) generates synthetic workspace exemplars for training using a virtual chest-mounted camera whose intrinsic parameters match our physical camera, 2) computes perspective-aware depth features on this entire volume and 3) recognizes discrete arm+hand pose classes through a sparse multi-class SVM. We achieve state-of-the-art hand pose recognition performance from egocentric RGB-D images in real-time.","PeriodicalId":444472,"journal":{"name":"2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"90","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2015.7299061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 90

Abstract

We tackle the problem of estimating the 3D pose of an individual's upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volume in front of the camera. We call this volume an egocentric workspace. A notable property is that hand appearance correlates with workspace location. To exploit this correlation, we classify arm+hand configurations in a global egocentric coordinate frame, rather than a local scanning window. This greatly simplify the architecture and improves performance. We propose an efficient pipeline which 1) generates synthetic workspace exemplars for training using a virtual chest-mounted camera whose intrinsic parameters match our physical camera, 2) computes perspective-aware depth features on this entire volume and 3) recognizes discrete arm+hand pose classes through a sparse multi-class SVM. We achieve state-of-the-art hand pose recognition performance from egocentric RGB-D images in real-time.

查看原文本刊更多论文

使用以自我为中心的工作空间的第一人称姿势识别

我们解决了从安装在胸部的深度相机估计个人上肢(手臂+手)的3D姿势的问题。重要的是，我们在与物体的日常交互中考虑姿势估计。过去的研究表明，强大的姿态+视点先验和基于深度的特征对于稳健的性能至关重要。在以自我为中心的视角中，手和手臂在镜头前的一个明确的体积内是可以观察到的。我们称这个体积为以自我为中心的工作空间。一个值得注意的特性是手的外观与工作空间位置相关。为了利用这种相关性，我们在全局以自我为中心的坐标框架中对手臂+手的构型进行分类，而不是局部扫描窗口。这极大地简化了体系结构并提高了性能。我们提出了一个高效的流水线，它1)生成合成的工作空间样本，用于使用虚拟的胸装相机进行训练，其内在参数与我们的物理相机相匹配，2)计算整个体积上的透视感知深度特征，3)通过稀疏的多类支持向量机识别离散的手臂和手的姿势类别。我们从以自我为中心的RGB-D图像中实时实现了最先进的手部姿势识别性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量