基于估计理论的语音增强方法

Mirishkar Sai Ganesh, M. Karthik, B. Patnaik
{"title":"基于估计理论的语音增强方法","authors":"Mirishkar Sai Ganesh, M. Karthik, B. Patnaik","doi":"10.1109/ICSCN.2017.8085702","DOIUrl":null,"url":null,"abstract":"This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.","PeriodicalId":383458,"journal":{"name":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An estimation theory-based approach for speech enhancement\",\"authors\":\"Mirishkar Sai Ganesh, M. Karthik, B. Patnaik\",\"doi\":\"10.1109/ICSCN.2017.8085702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.\",\"PeriodicalId\":383458,\"journal\":{\"name\":\"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)\",\"volume\":\"238 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSCN.2017.8085702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCN.2017.8085702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

这一贡献提出了一种利用基于平方幅度谱的统计估计器对信号进行语音增强的有效技术。在任何语音增强系统中,都需要对功率谱密度进行估计。由于语音信号的非平稳特性,传统的噪声消除方法难以实现,在此背景下,基于高斯统计模型推导出最小均方误差(MMSE)和最大后验(MAP)估计量。在MAP估计器中得到的采集函数与理想二值掩码中使用的采集函数相同。由于二进制掩蔽取决于信噪比(SNR),如果SNR值超过0 dB,则该值假定为1,否则为0。结果表明,该估计器比标准最小均方误差谱功率估计器对语音信号有更好的增强效果,且残差小,语音失真小。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An estimation theory-based approach for speech enhancement
This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信