2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第10页

Attaining fundamental bounds on timing synchronization 获得时序同步的基本边界

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6289099

P. Bidigare, Upamanyu Madhow, R. Mudumbai, D. Scherber

引用次数: 30

Audio event detection from acoustic unit occurrence patterns 从声学单元发生模式中检测音频事件

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6287923

Anurag Kumar, Pranay Dighe, Rita Singh, Sourish Chaudhuri, B. Raj

引用次数: 58

A Bayesian framework for robust speech enhancement under varying contexts 不同语境下稳健语音增强的贝叶斯框架

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288932

D. Hanumantha, Rao Naidu, Sriram Srinivasan

引用次数: 6

Improving arabic broadcast transcription using automatic topic clustering 利用自动主题聚类改进阿拉伯语广播转录

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288907

Stephen M. Chu, L. Mangu

引用次数: 2

Design and implementation of a fully integrated compressed-sensing signal acquisition system 全集成压缩传感信号采集系统的设计与实现

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6289123

Juhwan Yoo, Stephen Becker, M. Monge, M. Loh, E. Candès, A. Emami-Neyestanak

引用次数: 73

A model structure integration based on a Bayesian framework for speech recognition 基于贝叶斯框架的语音识别模型结构集成

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288996

Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, K. Tokuda

引用次数: 0

Generalized k-labelset ensemble for multi-label classification 多标签分类的广义k-标签集集成

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288315

Hung-Yi Lo, Shou-de Lin, H. Wang

引用次数: 3

On the identifiability of multi-observer hidden Markov models 多观测器隐马尔可夫模型的可辨识性

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288268

H. Nguyen, M. Roughan

引用次数: 4

Adaptive parameter selection for asynchronous intrafascicular multi-electrode stimulation 异步束内多电极刺激的自适应参数选择

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6287993

M. A. Frankel, G. Clark, S. Meek, R. Normann, V. J. Mathews

引用次数: 2

Robust speech recognition through selection of speaker and environment transforms 通过说话人选择和环境变换实现鲁棒语音识别

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI: 10.1109/ICASSP.2012.6288878

Raghavendra Bilgi, Vikas Joshi, S. Umesh, Luz García, M. C. Benítez

{"title":"Robust speech recognition through selection of speaker and environment transforms","authors":"Raghavendra Bilgi, Vikas Joshi, S. Umesh, Luz García, M. C. Benítez","doi":"10.1109/ICASSP.2012.6288878","DOIUrl":"https://doi.org/10.1109/ICASSP.2012.6288878","url":null,"abstract":"In this paper, we address the problem of robustness to both noise and speaker-variability in automatic speech recognition (ASR). We propose the use of pre-computed Noise and Speaker transforms, and an optimal combination of these two transforms are chosen during test using maximum-likelihood (ML) criterion. These pre-computed transforms are obtained during training by using data obtained from different noise conditions that are usually encountered for that particular ASR task. The environment transforms are obtained during training using constrained-MLLR (CMLLR) framework, while for speaker-transforms we use the analytically determined linear-VTLN matrices. Even though the exact noise environment may not be encountered during test, the ML-based choice of the closest Environment transform provides “sufficient” cleaning and this is corroborated by experimental results with performance comparable to histogram equalization or Vector Taylor Series approaches on Aurora-2 task. The proposed method is simple since it involves only the choice of pre-computed environment and speaker transforms and therefore, can be applied with very little test data unlike many other speaker and noise-compensation methods.","PeriodicalId":6443,"journal":{"name":"2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"36 1","pages":"4333-4336"},"PeriodicalIF":0.0,"publicationDate":"2012-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81343088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0