A method for synthesizing dynamic image of virtual human

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE) Pub Date : 2023-01-06 DOI:10.1109/ICCECE58074.2023.10135229

Siyuan Shen, W. Zhang

引用次数: 0

Abstract

With the rise of the Metaverse, the need for efficient modeling of avatars becomes increasingly urgent. Building virtual human models from human image datasets has been a hot topic in computer vision. We used the speech synthesis technology to complete the conversion from text to speech waveform, and used the speech-lip shape generation method to generate a real person image with audio and video synchronization, finally used the thin plate spline transformation method to drive the virtual human image, and synthesizes a virtual human with audio and video synchronization image. Experimental results show that this method can effectively solve the problem of text-driven avatar lip mismatch and text-driven avatar audio and video asynchronous problems, and can synthesize high-quality, high-fidelity, low-latency avatars.

查看原文本刊更多论文

一种虚拟人动态图像的合成方法

随着虚拟世界的兴起，对化身的高效建模的需求变得越来越迫切。利用人体图像数据集构建虚拟人体模型一直是计算机视觉领域的研究热点。我们利用语音合成技术完成从文本到语音波形的转换，并利用语音唇形生成方法生成音视频同步的真人图像，最后利用薄板样条变换方法驱动虚拟人图像，合成音视频同步的虚拟人图像。实验结果表明，该方法能有效解决文本驱动的虚拟形象唇形不匹配和文本驱动的虚拟形象音视频异步问题，并能合成高质量、高保真、低延迟的虚拟形象。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE)

自引率

0.00%

发文量