日语假名的增强分类及层次加权判别视觉语音识别

2013 IEEE Conference on Systems, Process & Control (ICSPC) Pub Date : 1900-01-01 DOI:10.1109/SPC.2013.6735104

Shinsuke Okita, Y. Mitsukura, N. Hamada

{"title":"日语假名的增强分类及层次加权判别视觉语音识别","authors":"Shinsuke Okita, Y. Mitsukura, N. Hamada","doi":"10.1109/SPC.2013.6735104","DOIUrl":null,"url":null,"abstract":"For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.","PeriodicalId":198247,"journal":{"name":"2013 IEEE Conference on Systems, Process & Control (ICSPC)","volume":"3 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition\",\"authors\":\"Shinsuke Okita, Y. Mitsukura, N. Hamada\",\"doi\":\"10.1109/SPC.2013.6735104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.\",\"PeriodicalId\":198247,\"journal\":{\"name\":\"2013 IEEE Conference on Systems, Process & Control (ICSPC)\",\"volume\":\"3 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE Conference on Systems, Process & Control (ICSPC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPC.2013.6735104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Conference on Systems, Process & Control (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPC.2013.6735104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

为了自动语音识别、语音动画合成、说话人验证等目的，人们对“viseme”进行了研究。音素是一种视觉上可识别的话语单位，或音素在听觉域的视觉域的等价单位。粘粒的分类和鉴别方法仍然是一个重要的课题。本文重点研究了日本viseme的分类单元数量和判别方法:将viseme的数量从6个扩展到9个，利用viseme的序列扩展单词表示，然后提出了采用多重判别分析(MDA)的分层加权判别方法来提高判别能力。为了验证和讨论我们的建议的有效性，进行了视素识别和词识别实验。实验结果验证了所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Augmented classification of Japanese visemes and hierarchical weighted discrimination for visual speech recognition

For the purpose of automatic speech recognition and speech animation synthesis, speaker verification and so on, there have been studies on `viseme'. Viseme is a visually identifiable unit of utterance or the equivalent unit in the visual domain of the phoneme in audio domain. The classification and the discrimination method of visemes are still important topics. This paper focuses on the number of classification units and a discrimination procedure of Japanese visemes: We extend the number of visemes from 6 to 9 to expanse the word representation by their series, then propose the hierarchical weighted discrimination using multiple discriminative analysis (MDA) to enhance the discriminative ability. In order to verify and discuss the availability of our proposals, visemes discrimination and word recognition experiments were conducted. From these results, the validity of the proposed methods was confirmed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 IEEE Conference on Systems, Process & Control (ICSPC)

自引率

0.00%

发文量