基于Unscented变换的双通道噪声估计:在智能手机语音增强中的应用

I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado
{"title":"基于Unscented变换的双通道噪声估计:在智能手机语音增强中的应用","authors":"I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado","doi":"10.1109/TSP.2018.8441269","DOIUrl":null,"url":null,"abstract":"Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.","PeriodicalId":383018,"journal":{"name":"2018 41st International Conference on Telecommunications and Signal Processing (TSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones\",\"authors\":\"I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado\",\"doi\":\"10.1109/TSP.2018.8441269\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.\",\"PeriodicalId\":383018,\"journal\":{\"name\":\"2018 41st International Conference on Telecommunications and Signal Processing (TSP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 41st International Conference on Telecommunications and Signal Processing (TSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSP.2018.8441269\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 41st International Conference on Telecommunications and Signal Processing (TSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSP.2018.8441269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

一些语音处理方法依赖于噪声估计阶段,其性能取决于噪声估计的准确性。在本文中,我们提出了一种新的最小均方误差(MMSE)噪声估计器,它利用双通道噪声观测来避免使用干净的语音模型,同时保持简单的公式。该估计器的参数是通过unscented变换(UT)获得的,与经典的向量泰勒级数(VTS)线性化相比,该方法能够更有效地计算出质量更好的统计量。为了进行评估,考虑了双麦克风智能手机在近距离通话条件下的语音增强,这是一个特别感兴趣的应用。结果表明,我们的方法在估计精度等不同度量以及增强语音信号(即语音质量和可理解性)方面优于其他单通道和双通道噪声估计方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones
Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信