2010 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

Interactive tone mapping for High Dynamic Range video 高动态范围视频的交互式色调映射

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-09-14 DOI: 10.1109/ICASSP.2010.5495318

Zhe Wang, J. Zhai, Zhang Tao, J. Llach

引用次数: 0

Predicting interruptions in dyadic spoken interactions 预测二元口语互动中的中断

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.1109/ICASSP.2010.5494991

Chi-Chun Lee, Shrikanth S. Narayanan

引用次数: 25

Simple methods for improving speaker-similarity of HMM-based speech synthesis 提高基于hmm的语音合成中说话人相似度的简单方法

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.1109/ICASSP.2010.5495562

J. Yamagishi, Simon King

引用次数: 18

Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition 基于logmelspec域模型的鲁棒远距离语音识别去噪

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.1109/ICASSP.2010.5495671

A. Sehr, R. Maas, Walter Kellermann

引用次数: 18

A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification 一个混合物理和统计动态发音框架，结合综合分析改进电话分类

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.1109/ICASSP.2010.5495696

Ziad Al Bawab, B. Raj, R. Stern

引用次数: 1

Search error risk minimization in Viterbi beam search for speech recognition 语音识别中维特比波束搜索误差风险最小化

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.21437/Interspeech.2010-101

Takaaki Hori, Shinji Watanabe, Atsushi Nakamura

{"title":"Search error risk minimization in Viterbi beam search for speech recognition","authors":"Takaaki Hori, Shinji Watanabe, Atsushi Nakamura","doi":"10.21437/Interspeech.2010-101","DOIUrl":"https://doi.org/10.21437/Interspeech.2010-101","url":null,"abstract":"This paper proposes a method to optimize Viterbi beam search based on search error risk minimization in large vocabulary continuous speech recognition (LVCSR). Most speech recognizers employ beam search to speed up the decoding process, in which unpromising partial hypotheses are pruned during decoding. However, the pruning step involves the risk of missing the best complete hypothesis by discarding a partial hypothesis that might grow into the best. Missing the best hypothesis is called search error. Our purpose is to reduce search error by optimizing the pruning step. While conventional methods use heuristic criteria to prune each hypothesis based on its score, rank, and so on, our proposed method introduces a pruning function that makes a more precise decision using the rich features extracted from each hypothesis. The parameters of the function can be estimated efficiently to minimize the search error risk using recognition lattices at the training step. We implemented the new method in a WFST-based decoder and achieved a significant reduction of search errors in a 200K-word LVCSR task.","PeriodicalId":293333,"journal":{"name":"2010 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129897991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Convergence analysis of consensus-based distributed clustering 基于共识的分布式聚类收敛性分析

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-06-28 DOI: 10.1109/ICASSP.2010.5495344

P. Forero, A. Cano, G. Giannakis

引用次数: 3

Sparse variable noisy PCA using l0 penalty 稀疏变量噪声PCA使用10惩罚

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495788

M. Ulfarsson, V. Solo

引用次数: 6

A bounded trust region optimization for discriminative training of HMMS in speech recognition 语音识别中hmm识别训练的有界信任域优化

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5495111

Cong Liu, Yu Hu, Hui Jiang, Lirong Dai

引用次数: 1

Music dereverberation using harmonic structure source model and Wiener filter 利用谐波结构源模型和维纳滤波器实现音乐去噪

2010 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2010-03-14 DOI: 10.1109/ICASSP.2010.5496223

Naoki Yasuraoka, Takuya Yoshioka, T. Nakatani, Atsushi Nakamura, HIroshi G. Okuno

引用次数: 10