基于Unscented变换的双通道噪声估计:在智能手机语音增强中的应用

2018 41st International Conference on Telecommunications and Signal Processing (TSP) Pub Date : 2018-07-01 DOI:10.1109/TSP.2018.8441269

I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado

{"title":"基于Unscented变换的双通道噪声估计:在智能手机语音增强中的应用","authors":"I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado","doi":"10.1109/TSP.2018.8441269","DOIUrl":null,"url":null,"abstract":"Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.","PeriodicalId":383018,"journal":{"name":"2018 41st International Conference on Telecommunications and Signal Processing (TSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones\",\"authors\":\"I. López-Espejo, J. M. Martín-Doñas, A. Gómez, A. Peinado\",\"doi\":\"10.1109/TSP.2018.8441269\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.\",\"PeriodicalId\":383018,\"journal\":{\"name\":\"2018 41st International Conference on Telecommunications and Signal Processing (TSP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 41st International Conference on Telecommunications and Signal Processing (TSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSP.2018.8441269\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 41st International Conference on Telecommunications and Signal Processing (TSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSP.2018.8441269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

一些语音处理方法依赖于噪声估计阶段，其性能取决于噪声估计的准确性。在本文中，我们提出了一种新的最小均方误差(MMSE)噪声估计器，它利用双通道噪声观测来避免使用干净的语音模型，同时保持简单的公式。该估计器的参数是通过unscented变换(UT)获得的，与经典的向量泰勒级数(VTS)线性化相比，该方法能够更有效地计算出质量更好的统计量。为了进行评估，考虑了双麦克风智能手机在近距离通话条件下的语音增强，这是一个特别感兴趣的应用。结果表明，我们的方法在估计精度等不同度量以及增强语音信号(即语音质量和可理解性)方面优于其他单通道和双通道噪声估计方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones

Some speech processing approaches rely on a noise estimation stage and their performance depends on the accuracy of the noise estimates. In this paper we propose a novel minimum mean square error (MMSE) noise estimator that takes advantage of dual-channel noisy observations to avoid the use of a clean speech model while keeping a simple formulation. The parameters of this estimator are obtained through the unscented transform (UT), which is able to compute better quality statistics in a more efficient way than through classical vector Taylor series (VTS) linearization. For evaluation, speech enhancement on a dual-microphone smartphone in close-talk conditions is considered, which is a particular application of interest. Results show the superiority of our proposal with respect to other single-and dual-channel noise estimation methods in terms of different measures such as estimation accuracy as well as on the enhanced speech signal, i.e., speech quality and intelligibility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 41st International Conference on Telecommunications and Signal Processing (TSP)

自引率

0.00%

发文量