Advanced speech enhancement with partial speech reconstruction

21st European Signal Processing Conference (EUSIPCO 2013) Pub Date : 2013-09-09 DOI:10.5281/ZENODO.43592

P. Hannon, M. Krini, Ingo Schalk-Schupp

引用次数: 2

Abstract

An advanced speech enhancement algorithm is proposed, which employs partial speech reconstruction of highly disturbed speech. The speech reconstruction algorithms assume the source-filter model of speech production and construct estimates of clean speech source and filter signals using features extracted from noisy input. A nonlinear harmonic regeneration scheme for source signals is presented followed by two methods for the estimation of the vocal tract filter characteristics. The quantization method applies a priori trained codebooks using clean speech training data and the parametric estimation method assumes a parabolic continuation of low frequency envelope values. The predicted speech quality of the enhanced speech output is assessed with composite objective measures, while the accuracy of the spectral envelope estimations is analyzed with the log-spectral distance over four manually generated signal-to-noise ratio scenarios.

查看原文本刊更多论文

部分语音重建的高级语音增强

提出了一种对高干扰语音进行部分重构的语音增强算法。语音重建算法假设语音产生的源-滤波器模型，并使用从噪声输入中提取的特征构造干净语音源和滤波器信号的估计。提出了一种源信号的非线性谐波再生方案，并给出了两种估计声道滤波器特性的方法。量化方法采用使用干净语音训练数据的先验训练码本，参数估计方法假设低频包络值的抛物线延连续。利用复合客观度量对增强语音输出的预测语音质量进行评估，同时利用对数谱距离对四种人工生成的信噪比场景下的频谱包络估计的准确性进行分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

21st European Signal Processing Conference (EUSIPCO 2013)

自引率

0.00%

发文量