2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第10页

Resolution enhancement for hyperspectral images: A super-resolution and fusion approach 高光谱图像的分辨率增强:一种超分辨率和融合方法

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-08 DOI: 10.1109/ICASSP.2017.7953344

C. Kwan, J. H. Choi, Stanley H. Chan, Jin Zhou, Bence Budavari

引用次数: 58

Statistics of natural fused image distortions 自然融合图像失真统计

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7952355

D. E. Moreno-Villamarín, H. Benítez-Restrepo, A. Bovik

引用次数: 3

A multiple bandwidth objective speech intelligibility estimator based on articulation index band correlations and attention 基于清晰度指标、频带相关性和注意力的多带宽客观语音可理解度估计

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953128

S. Voran

引用次数: 3

An LSTM-CTC based verification system for proxy-word based OOV keyword search 基于LSTM-CTC的基于代理词的OOV关键字搜索验证系统

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953239

Zhiqiang Lv, Jian Kang, Weiqiang Zhang, Jia Liu

引用次数: 4

Sparse eigenvectors of graphs 图的稀疏特征向量

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7952888

Oguzhan Teke, P. Vaidyanathan

引用次数: 2

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection 使用多语言瓶颈特征的两两学习，用于低资源按例查询的口语术语检测

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953237

Yougen Yuan, C. Leung, Lei Xie, Hongjie Chen, B. Ma, Haizhou Li

引用次数: 29

A non-intrusive Short-Time Objective Intelligibility measure 一种非侵入性的短期客观可理解性测量

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953125

A. H. Andersen, Jan Mark de Haan, Z. Tan, J. Jensen

引用次数: 32

Unsupervised learning of asymmetric high-order autoregressive stochastic volatility model 非对称高阶自回归随机波动模型的无监督学习

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953064

I. Gorynin, E. Monfrini, W. Pieczynski

引用次数: 0

Non-separable quadruple lifting structure for four-dimensional integer Wavelet Transform with reduced rounding noise 四维整数小波变换中不可分离的四重提升结构，降低了舍入噪声

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7952336

Fairoza Amira Hamzah, Taichi Yoshida, M. Iwahashi

引用次数: 5

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition 鲁棒语音识别的改进倒频谱最小均方误差降噪算法

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-03-07 DOI: 10.1109/ICASSP.2017.7953081

Jinyu Li, Yan Huang, Y. Gong

{"title":"Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition","authors":"Jinyu Li, Yan Huang, Y. Gong","doi":"10.1109/ICASSP.2017.7953081","DOIUrl":"https://doi.org/10.1109/ICASSP.2017.7953081","url":null,"abstract":"In the era of deep learning, although beam-forming multi-channel signal processing is still very helpful, it was reported that single-channel robust front-ends usually cannot benefit deep learning models because the layer-by-layer structure of deep learning models provides a feature extraction strategy that automatically derives powerful noise-resistant features from primitive raw data for senone classification. In this study, we show that the single-channel robust front-end is still very beneficial to deep learning modelling as long as it is well designed. We improve a robust front-end, cepstra minimum mean square error (CMMSE), by using more reliable voice activity detector, refined prior SNR estimation, better gain smoothing and two-stage processing. This new front-end, improved CMMSE (ICMMSE), is evaluated on the standard Aurora 2 and Chime 3 tasks, and a 3400 hour Microsoft Cortana digital assistant task using Gaussian mixture models, feed-forward deep neural networks, and long short-term memory recurrent neural networks, respectively. It is shown that ICMMSE is superior regardless of the underlying acoustic models and the scale of evaluation tasks, with 25.46% relative WER reduction on Aurora 2, up to 11.98% relative WER reduction on Chime 3, and up to 11.01% relative WER reduction on Cortana digital assistant task, respectively.","PeriodicalId":118243,"journal":{"name":"2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122525172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6