高斯头像：高保真头像与可学习的高斯推导。

IEEE transactions on visualization and computer graphics Pub Date : 2025-04-17 DOI:10.1109/TVCG.2025.3561794

Jie Wang;Jiu-Cheng Xie;Xianyan Li;Feng Xu;Chi-Man Pun;Hao Gao

{"title":"高斯头像：高保真头像与可学习的高斯推导。","authors":"Jie Wang;Jiu-Cheng Xie;Xianyan Li;Feng Xu;Chi-Man Pun;Hao Gao","doi":"10.1109/TVCG.2025.3561794","DOIUrl":null,"url":null,"abstract":"Creating lifelike 3D head avatars and generating compelling animations for diverse subjects remain challenging in computer vision. This paper presents GaussianHead, which models the active head based on anisotropic 3D Gaussians. Our method integrates a motion deformation field and a single-resolution tri-plane to capture the head's intricate dynamics and detailed texture. Notably, we introduce a customized derivation scheme for each 3D Gaussian, facilitating the generation of multiple “doppelgangers” through learnable parameters for precise position transformation. This approach enables efficient representation of diverse Gaussian attributes and ensures their precision. Additionally, we propose an inherited derivation strategy for newly added Gaussians to expedite training. Extensive experiments demonstrate GaussianHead's efficacy, achieving high-fidelity visual results with a remarkably compact model size (<inline-formula><tex-math>$\\approx 12$</tex-math></inline-formula> MB). Our method outperforms state-of-the-art alternatives in tasks such as reconstruction, cross-identity reenactment, and novel view synthesis.","PeriodicalId":94035,"journal":{"name":"IEEE transactions on visualization and computer graphics","volume":"31 7","pages":"4141-4154"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GaussianHead: High-Fidelity Head Avatars With Learnable Gaussian Derivation\",\"authors\":\"Jie Wang;Jiu-Cheng Xie;Xianyan Li;Feng Xu;Chi-Man Pun;Hao Gao\",\"doi\":\"10.1109/TVCG.2025.3561794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Creating lifelike 3D head avatars and generating compelling animations for diverse subjects remain challenging in computer vision. This paper presents GaussianHead, which models the active head based on anisotropic 3D Gaussians. Our method integrates a motion deformation field and a single-resolution tri-plane to capture the head's intricate dynamics and detailed texture. Notably, we introduce a customized derivation scheme for each 3D Gaussian, facilitating the generation of multiple “doppelgangers” through learnable parameters for precise position transformation. This approach enables efficient representation of diverse Gaussian attributes and ensures their precision. Additionally, we propose an inherited derivation strategy for newly added Gaussians to expedite training. Extensive experiments demonstrate GaussianHead's efficacy, achieving high-fidelity visual results with a remarkably compact model size (<inline-formula><tex-math>$\\\\approx 12$</tex-math></inline-formula> MB). Our method outperforms state-of-the-art alternatives in tasks such as reconstruction, cross-identity reenactment, and novel view synthesis.\",\"PeriodicalId\":94035,\"journal\":{\"name\":\"IEEE transactions on visualization and computer graphics\",\"volume\":\"31 7\",\"pages\":\"4141-4154\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on visualization and computer graphics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10969103/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on visualization and computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10969103/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在计算机视觉领域，创建逼真的3D头像和为不同主题生成引人注目的动画仍然具有挑战性。本文提出了一种基于各向异性三维高斯分布的活动头部模型高斯head。我们的方法将运动变形场和单分辨率三平面相结合，以捕获头部复杂的动态和详细的纹理。值得注意的是，我们为每个三维高斯模型引入了定制的推导方案，通过可学习的参数，便于生成多个“二重身”进行精确的位置变换。这种方法能够有效地表示各种高斯属性，并保证它们的精度。此外，我们提出了一种对新增加的高斯函数的继承派生策略，以加快训练速度。大量的实验证明了GaussianHead的有效性，以非常紧凑的模型大小（约12美元MB）实现高保真的视觉结果。我们的方法在重建、跨身份再现和新视图合成等任务中优于最先进的替代方法。源代码可从https://github.com/chiehwangs/gaussian-head获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

GaussianHead: High-Fidelity Head Avatars With Learnable Gaussian Derivation

Creating lifelike 3D head avatars and generating compelling animations for diverse subjects remain challenging in computer vision. This paper presents GaussianHead, which models the active head based on anisotropic 3D Gaussians. Our method integrates a motion deformation field and a single-resolution tri-plane to capture the head's intricate dynamics and detailed texture. Notably, we introduce a customized derivation scheme for each 3D Gaussian, facilitating the generation of multiple “doppelgangers” through learnable parameters for precise position transformation. This approach enables efficient representation of diverse Gaussian attributes and ensures their precision. Additionally, we propose an inherited derivation strategy for newly added Gaussians to expedite training. Extensive experiments demonstrate GaussianHead's efficacy, achieving high-fidelity visual results with a remarkably compact model size (

$\approx 12$

MB). Our method outperforms state-of-the-art alternatives in tasks such as reconstruction, cross-identity reenactment, and novel view synthesis.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on visualization and computer graphics

自引率

0.00%

发文量