2013 IEEE Workshop on Automatic Speech Recognition and Understanding最新文献

Acoustic characteristics related to the perceptual pitch in whispered vowels 与低声元音的感知音高有关的声学特性

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707737

H. Konno, Hideo Kanemitsu, N. Takahashi, Mineichi Kudo

引用次数: 4

Learning state labels for sparse classification of speech with matrix deconvolution 基于矩阵反卷积的语音稀疏分类状态标签学习

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707724

Antti Hurmalainen, T. Virtanen

引用次数: 6

Improved cepstral mean and variance normalization using Bayesian framework 改进的贝叶斯框架倒谱均值和方差归一化

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707722

N. Prasad, S. Umesh

引用次数: 51

ASR for electro-laryngeal speech ASR是指电喉语音

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707735

A. Fuchs, J. A. Morales-Cordovilla, Martin Hagmüller

引用次数: 5

Automatic model complexity control for generalized variable parameter HMMs 广义变参数hmm模型复杂度自动控制

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707721

Rongfeng Su, Xunying Liu, Lan Wang

引用次数: 7

Dialogue management for leading the conversation in persuasive dialogue systems 在说服性对话系统中引导对话的对话管理

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707715

Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, S. Sakti, T. Toda, Satoshi Nakamura

引用次数: 11

Speaker adaptation of neural network acoustic models using i-vectors 基于i向量的说话人神经网络声学模型自适应

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707705

G. Saon, H. Soltau, D. Nahamoo, M. Picheny

引用次数: 650

Deep maxout neural networks for speech recognition 用于语音识别的深度最大输出神经网络

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707745

Meng Cai, Yongzhe Shi, Jia Liu

引用次数: 77

Automatic pronunciation clustering using a World English archive and pronunciation structure analysis 使用世界英语档案和发音结构分析的自动发音聚类

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707733

Han-Ping Shen, N. Minematsu, T. Makino, S. Weinberger, T. Pongkittiphan, Chung-Hsien Wu

{"title":"Automatic pronunciation clustering using a World English archive and pronunciation structure analysis","authors":"Han-Ping Shen, N. Minematsu, T. Makino, S. Weinberger, T. Pongkittiphan, Chung-Hsien Wu","doi":"10.1109/ASRU.2013.6707733","DOIUrl":"https://doi.org/10.1109/ASRU.2013.6707733","url":null,"abstract":"English is the only language available for global communication. Due to the influence of speakers' mother tongue, however, those from different regions inevitably have different accents in their pronunciation of English. The ultimate goal of our project is creating a global pronunciation map of World Englishes on an individual basis, for speakers to use to locate similar English pronunciations. If the speaker is a learner, he can also know how his pronunciation compares to other varieties. Creating the map mathematically requires a matrix of pronunciation distances among all the speakers considered. This paper investigates invariant pronunciation structure analysis and Support Vector Regression (SVR) to predict the inter-speaker pronunciation distances. In experiments, the Speech Accent Archive (SAA), which contains speech data of worldwide accented English, is used as training and testing samples. IPA narrow transcriptions in the archive are used to prepare reference pronunciation distances, which are then predicted based on structural analysis and SVR, not with IPA transcriptions. Correlation between the reference distances and the predicted distances is calculated. Experimental results show very promising results and our proposed method outperforms by far a baseline system developed using an HMM-based phoneme recognizer.","PeriodicalId":265258,"journal":{"name":"2013 IEEE Workshop on Automatic Speech Recognition and Understanding","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116496012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Hybrid acoustic models for distant and multichannel large vocabulary speech recognition 远距离和多通道大词汇语音识别的混合声学模型

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI: 10.1109/ASRU.2013.6707744

P. Swietojanski, Arnab Ghoshal, S. Renals

引用次数: 112