Advanced speech enhancement with partial speech reconstruction

P. Hannon, M. Krini, Ingo Schalk-Schupp
{"title":"Advanced speech enhancement with partial speech reconstruction","authors":"P. Hannon, M. Krini, Ingo Schalk-Schupp","doi":"10.5281/ZENODO.43592","DOIUrl":null,"url":null,"abstract":"An advanced speech enhancement algorithm is proposed, which employs partial speech reconstruction of highly disturbed speech. The speech reconstruction algorithms assume the source-filter model of speech production and construct estimates of clean speech source and filter signals using features extracted from noisy input. A nonlinear harmonic regeneration scheme for source signals is presented followed by two methods for the estimation of the vocal tract filter characteristics. The quantization method applies a priori trained codebooks using clean speech training data and the parametric estimation method assumes a parabolic continuation of low frequency envelope values. The predicted speech quality of the enhanced speech output is assessed with composite objective measures, while the accuracy of the spectral envelope estimations is analyzed with the log-spectral distance over four manually generated signal-to-noise ratio scenarios.","PeriodicalId":400766,"journal":{"name":"21st European Signal Processing Conference (EUSIPCO 2013)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"21st European Signal Processing Conference (EUSIPCO 2013)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.43592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

An advanced speech enhancement algorithm is proposed, which employs partial speech reconstruction of highly disturbed speech. The speech reconstruction algorithms assume the source-filter model of speech production and construct estimates of clean speech source and filter signals using features extracted from noisy input. A nonlinear harmonic regeneration scheme for source signals is presented followed by two methods for the estimation of the vocal tract filter characteristics. The quantization method applies a priori trained codebooks using clean speech training data and the parametric estimation method assumes a parabolic continuation of low frequency envelope values. The predicted speech quality of the enhanced speech output is assessed with composite objective measures, while the accuracy of the spectral envelope estimations is analyzed with the log-spectral distance over four manually generated signal-to-noise ratio scenarios.
部分语音重建的高级语音增强
提出了一种对高干扰语音进行部分重构的语音增强算法。语音重建算法假设语音产生的源-滤波器模型,并使用从噪声输入中提取的特征构造干净语音源和滤波器信号的估计。提出了一种源信号的非线性谐波再生方案,并给出了两种估计声道滤波器特性的方法。量化方法采用使用干净语音训练数据的先验训练码本,参数估计方法假设低频包络值的抛物线延连续。利用复合客观度量对增强语音输出的预测语音质量进行评估,同时利用对数谱距离对四种人工生成的信噪比场景下的频谱包络估计的准确性进行分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信