A noise-robust ASR front-end using Wiener filter constructed from MMSE estimation of clean speech and noise

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721) Pub Date : 2003-11-30 DOI:10.1109/ASRU.2003.1318461

Jian Wu, J. Droppo, L. Deng, A. Acero

引用次数: 25

Abstract

In this paper, we present a novel two-stage framework for designing a noise-robust front-end for automatic speech recognition. In the first stage, a parametric model of acoustic distortion is used to estimate the clean speech and noise spectra in a principled way so that no heuristic parameters need to be set manually. To reduce possible flaws caused by the simplifying assumptions in the parametric model, a second-stage Wiener filtering is applied to further reduce the noise while preserving speech spectra unharmed. This front-end is evaluated on the Aurora2 task. For the multi-condition training scenario, a relative error reduction of 28.4% is achieved.

查看原文本刊更多论文

基于噪声和干净语音的MMSE估计构建了一种基于维纳滤波器的抗噪ASR前端

在本文中，我们提出了一种新的两阶段框架来设计自动语音识别的噪声鲁棒前端。在第一阶段，采用参数化的声失真模型，有原则地估计干净的语音和噪声频谱，从而不需要手动设置启发式参数。为了减少参数模型中简化假设可能造成的缺陷，在保持语音频谱不受损害的情况下，应用第二阶段维纳滤波进一步降低噪声。该前端在Aurora2任务上进行评估。对于多条件训练场景，实现了28.4%的相对误差降低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)

自引率

0.00%

发文量