Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image

2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pub Date : 2021-10-01 DOI:10.1109/ICCV48922.2021.01116

Baowen Zhang, Yangang Wang, Xiaoming Deng, Yinda Zhang, P. Tan, Cuixia Ma, Hongan Wang

{"title":"Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image","authors":"Baowen Zhang, Yangang Wang, Xiaoming Deng, Yinda Zhang, P. Tan, Cuixia Ma, Hongan Wang","doi":"10.1109/ICCV48922.2021.01116","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel deep learning framework to reconstruct 3D hand poses and shapes of two interacting hands from a single color image. Previous methods designed for single hand cannot be easily applied for the two hand scenario because of the heavy inter-hand occlusion and larger solution space. In order to address the occlusion and similar appearance between hands that may confuse the network, we design a hand pose-aware attention module to extract features associated to each individual hand respectively. We then leverage the two hand context presented in interaction to propose a context-aware cascaded refinement that improves the hand pose and shape accuracy of each hand conditioned on the context between interacting hands. Extensive experiments on the main benchmark datasets demonstrate that our method predicts accurate 3D hand pose and shape from single color image, and achieves the state-of-the-art performance. Code is available in project webpage https://baowenz.github.io/Intershape/.","PeriodicalId":6820,"journal":{"name":"2021 IEEE/CVF International Conference on Computer Vision (ICCV)","volume":"71 1","pages":"11334-11343"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"52","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CVF International Conference on Computer Vision (ICCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV48922.2021.01116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 52

Abstract

In this paper, we propose a novel deep learning framework to reconstruct 3D hand poses and shapes of two interacting hands from a single color image. Previous methods designed for single hand cannot be easily applied for the two hand scenario because of the heavy inter-hand occlusion and larger solution space. In order to address the occlusion and similar appearance between hands that may confuse the network, we design a hand pose-aware attention module to extract features associated to each individual hand respectively. We then leverage the two hand context presented in interaction to propose a context-aware cascaded refinement that improves the hand pose and shape accuracy of each hand conditioned on the context between interacting hands. Extensive experiments on the main benchmark datasets demonstrate that our method predicts accurate 3D hand pose and shape from single color image, and achieves the state-of-the-art performance. Code is available in project webpage https://baowenz.github.io/Intershape/.

查看原文本刊更多论文

单色图像交互双手三维姿态和形状重建

在本文中，我们提出了一种新的深度学习框架，用于从单个彩色图像中重建两只相互作用的手的三维姿势和形状。以往针对单手设计的方法由于手间遮挡较大，求解空间较大，不容易应用于双手场景。为了解决手之间的遮挡和相似外观可能会混淆网络的问题，我们设计了一个手部姿势感知注意力模块，分别提取与每只手相关的特征。然后，我们利用交互中呈现的两只手上下文来提出上下文感知级联改进，该改进基于交互手之间的上下文来提高每只手的手部姿势和形状准确性。在主要的基准数据集上进行的大量实验表明，我们的方法可以从单色图像中准确地预测三维手的姿势和形状，并达到了最先进的性能。代码可在项目网页https://baowenz.github.io/Intershape/中获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量