Image-based talking heads using radial basis functions

James D. Edge, Steve C. Maddock
{"title":"Image-based talking heads using radial basis functions","authors":"James D. Edge, Steve C. Maddock","doi":"10.1109/TPCG.2003.1206933","DOIUrl":null,"url":null,"abstract":"In recent years talking heads have received a great deal of interest, both in their application to natural human-computer dialogue, and their benefit to the intelligibility of synthesized speech. A model for the realistic synthesis of visual speech animation is described. Images representing the key visual speech poses (visemes) are pre-recorded and labeled. Transitions between visemes are created by using an image morphing technique based upon the use of radial basis functions. Timing information from the festival speech synthesis system is used to plan the appropriate transitions to create realistic speech animation. A model of coarticulation is included in the system to improve the realism of articulatory motion.","PeriodicalId":132138,"journal":{"name":"Proceedings of Theory and Practice of Computer Graphics, 2003.","volume":"622 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of Theory and Practice of Computer Graphics, 2003.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPCG.2003.1206933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

In recent years talking heads have received a great deal of interest, both in their application to natural human-computer dialogue, and their benefit to the intelligibility of synthesized speech. A model for the realistic synthesis of visual speech animation is described. Images representing the key visual speech poses (visemes) are pre-recorded and labeled. Transitions between visemes are created by using an image morphing technique based upon the use of radial basis functions. Timing information from the festival speech synthesis system is used to plan the appropriate transitions to create realistic speech animation. A model of coarticulation is included in the system to improve the realism of articulatory motion.
使用径向基函数的基于图像的谈话头
近年来,说话头在自然人机对话中的应用以及对合成语音的可理解性的好处引起了人们的极大兴趣。描述了一种视觉语音动画的逼真合成模型。代表关键视觉语音姿势(visemes)的图像是预先录制和标记的。viseme之间的过渡是通过使用基于径向基函数的图像变形技术创建的。来自节日语音合成系统的定时信息用于计划适当的过渡,以创建逼真的语音动画。在系统中加入了协同关节模型,提高了关节运动的真实感。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信