日语假名的增强分类及层次加权判别视觉语音识别

Shinsuke Okita, Y. Mitsukura, N. Hamada
{"title":"日语假名的增强分类及层次加权判别视觉语音识别","authors":"Shinsuke Okita, Y. Mitsukura, N. Hamada","doi":"10.1109/SPC.2013.6735104","DOIUrl":null,"url":null,"abstract":"For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.","PeriodicalId":198247,"journal":{"name":"2013 IEEE Conference on Systems, Process & Control (ICSPC)","volume":"3 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition\",\"authors\":\"Shinsuke Okita, Y. Mitsukura, N. Hamada\",\"doi\":\"10.1109/SPC.2013.6735104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.\",\"PeriodicalId\":198247,\"journal\":{\"name\":\"2013 IEEE Conference on Systems, Process & Control (ICSPC)\",\"volume\":\"3 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE Conference on Systems, Process & Control (ICSPC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPC.2013.6735104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Conference on Systems, Process & Control (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPC.2013.6735104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

为了自动语音识别、语音动画合成、说话人验证等目的,人们对“viseme”进行了研究。音素是一种视觉上可识别的话语单位,或音素在听觉域的视觉域的等价单位。粘粒的分类和鉴别方法仍然是一个重要的课题。本文重点研究了日本viseme的分类单元数量和判别方法:将viseme的数量从6个扩展到9个,利用viseme的序列扩展单词表示,然后提出了采用多重判别分析(MDA)的分层加权判别方法来提高判别能力。为了验证和讨论我们的建议的有效性,进行了视素识别和词识别实验。实验结果验证了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition
For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信