Training a talking head

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces Pub Date : 2002-10-14 DOI:10.1109/ICMI.2002.1167046

Michael M. Cohen, D. Massaro, R. Clark

引用次数: 39

Abstract

A Cyberware laser scan of DWM was made, Baldi's generic morphology was mapped into the form of DWM, this head was trained on real data recorded with Optotrak LED markers, and the quality of its speech was evaluated. Participants were asked to recognize auditory sentences presented alone in noise, aligned with the newly trained synthetic textured mapped target face, or the original natural face. There was a significant advantage when the noisy auditory sentence was paired with either head, with the synthetic textured mapped target face giving as much of an improvement as the original recordings of the natural face.

查看原文本刊更多论文

训练一个会说话的脑袋

对DWM进行Cyberware激光扫描，将Baldi的一般形态映射为DWM的形式，使用Optotrak LED标记记录的真实数据对该头部进行训练，并对其语音质量进行评估。参与者被要求识别在噪音中单独呈现的听觉句子，与新训练的合成纹理映射的目标脸或原始自然脸对齐。当嘈杂的听觉句子与任何一个头部配对时，有一个显著的优势，合成纹理映射的目标脸与自然脸的原始记录一样有很大的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Fourth IEEE International Conference on Multimodal Interfaces

自引率

0.00%

发文量