SUPR: A Sparse Unified Part-Based Human Representation

Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision Pub Date : 2022-10-25 DOI:10.48550/arXiv.2210.13861

Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black

{"title":"SUPR: A Sparse Unified Part-Based Human Representation","authors":"Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black","doi":"10.48550/arXiv.2210.13861","DOIUrl":null,"url":null,"abstract":"Statistical 3D shape models of the head, hands, and fullbody are widely used in computer vision and graphics. Despite their wide use, we show that existing models of the head and hands fail to capture the full range of motion for these parts. Moreover, existing work largely ignores the feet, which are crucial for modeling human movement and have applications in biomechanics, animation, and the footwear industry. The problem is that previous body part models are trained using 3D scans that are isolated to the individual parts. Such data does not capture the full range of motion for such parts, e.g. the motion of head relative to the neck. Our observation is that full-body scans provide important information about the motion of the body parts. Consequently, we propose a new learning scheme that jointly trains a full-body model and specific part models using a federated dataset of full-body and body-part scans. Specifically, we train an expressive human body model called SUPR (Sparse Unified Part-Based Human Representation), where each joint strictly influences a sparse set of model vertices. The factorized representation enables separating SUPR into an entire suite of body part models. Note that the feet have received little attention and existing 3D body models have highly under-actuated feet. Using novel 4D scans of feet, we train a model with an extended kinematic tree that captures the range of motion of the toes. Additionally, feet deform due to ground contact. To model this, we include a novel non-linear deformation function that predicts foot deformation conditioned on the foot pose, shape, and ground contact. We train SUPR on an unprecedented number of scans: 1.2 million body, head, hand and foot scans. We quantitatively compare SUPR and the separated body parts and find that our suite of models generalizes better than existing models. SUPR is available at http://supr.is.tue.mpg.de","PeriodicalId":72676,"journal":{"name":"Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision","volume":"47 1","pages":"568-585"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.13861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

Statistical 3D shape models of the head, hands, and fullbody are widely used in computer vision and graphics. Despite their wide use, we show that existing models of the head and hands fail to capture the full range of motion for these parts. Moreover, existing work largely ignores the feet, which are crucial for modeling human movement and have applications in biomechanics, animation, and the footwear industry. The problem is that previous body part models are trained using 3D scans that are isolated to the individual parts. Such data does not capture the full range of motion for such parts, e.g. the motion of head relative to the neck. Our observation is that full-body scans provide important information about the motion of the body parts. Consequently, we propose a new learning scheme that jointly trains a full-body model and specific part models using a federated dataset of full-body and body-part scans. Specifically, we train an expressive human body model called SUPR (Sparse Unified Part-Based Human Representation), where each joint strictly influences a sparse set of model vertices. The factorized representation enables separating SUPR into an entire suite of body part models. Note that the feet have received little attention and existing 3D body models have highly under-actuated feet. Using novel 4D scans of feet, we train a model with an extended kinematic tree that captures the range of motion of the toes. Additionally, feet deform due to ground contact. To model this, we include a novel non-linear deformation function that predicts foot deformation conditioned on the foot pose, shape, and ground contact. We train SUPR on an unprecedented number of scans: 1.2 million body, head, hand and foot scans. We quantitatively compare SUPR and the separated body parts and find that our suite of models generalizes better than existing models. SUPR is available at http://supr.is.tue.mpg.de

查看原文本刊更多论文

SUPR:稀疏统一的基于部分的人类表征

头部、手部和全身的统计三维形状模型在计算机视觉和图形学中有着广泛的应用。尽管它们被广泛使用，但我们表明，现有的头部和手部模型无法捕捉到这些部位的全部运动范围。此外，现有的工作在很大程度上忽略了脚，这是模拟人类运动的关键，在生物力学、动画和鞋类工业中都有应用。问题是，以前的身体部位模型是用3D扫描来训练的，这种扫描是孤立于单个部位的。这些数据并没有捕捉到这些部位的全部运动范围，例如头部相对于颈部的运动。我们的观察是，全身扫描提供了有关身体部位运动的重要信息。因此，我们提出了一种新的学习方案，该方案使用全身和身体部位扫描的联合数据集联合训练全身模型和特定部位模型。具体来说，我们训练了一个富有表现力的人体模型，称为SUPR(稀疏统一的基于部分的人体表征)，其中每个关节严格影响一个稀疏的模型顶点集。因式表示可以将SUPR分离成一套完整的身体部位模型。注意，足部很少受到关注，现有的3D身体模型具有高度欠驱动的足部。使用新颖的足部4D扫描，我们训练了一个扩展的运动树模型，该模型捕获了脚趾的运动范围。此外，脚会因接触地面而变形。为了模拟这一点，我们包含了一个新的非线性变形函数，该函数预测足部变形取决于足部姿势、形状和地面接触。我们对SUPR进行了前所未有的扫描训练:120万次身体、头部、手部和脚部扫描。我们定量地比较了SUPR和分离的身体部位，发现我们的模型套件比现有的模型泛化得更好。SUPR可在http://supr.is.tue.mpg.de上获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision

自引率

0.00%

发文量