Noise Reduction Based Random Matrix Theory

2008 6th International Symposium on Chinese Spoken Language Processing Pub Date : 2008-12-30 DOI:10.1109/CHINSL.2008.ECP.83

Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura

{"title":"Noise Reduction Based Random Matrix Theory","authors":"Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura","doi":"10.1109/CHINSL.2008.ECP.83","DOIUrl":null,"url":null,"abstract":"In speech enhancement literature, the signal subspace based method gains a lot of attention because of its simplicity in analytical formulations. The original idea in this method is based on the assumption that clean speech signal occupies a certain low dimensional space, while the noise signal which is a white additive noise spread the whole observation space. In this method, accurate estimation of the noise power (or variance) is required. However, in real applications, the noise power can only be estimated with some degree of uncertainty. This uncertainty will degrade the signal subspace based speech enhancement algorithms, especially in heavy noisy situations since it does not take this uncertainty into consideration. In this study, we took the uncertainty of the estimation of noise power into consideration by using the statistical property of noise based on random matrix theory. The noise statistical property (eigenvalue distribution) was analytically formulated based on the maximum and minimum eigenvalues of the noise random matrix. Based on the statistical property of the eigenvalues of noise, we reduced the part contributed by noise from the covariance matrix of noisy speech. We tested our method for speech enhancement using AURORA-2J speech corpus. Our initial experiments showed that the proposed method performed better than the traditional signal subspace based speech enhancement method.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 6th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2008.ECP.83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In speech enhancement literature, the signal subspace based method gains a lot of attention because of its simplicity in analytical formulations. The original idea in this method is based on the assumption that clean speech signal occupies a certain low dimensional space, while the noise signal which is a white additive noise spread the whole observation space. In this method, accurate estimation of the noise power (or variance) is required. However, in real applications, the noise power can only be estimated with some degree of uncertainty. This uncertainty will degrade the signal subspace based speech enhancement algorithms, especially in heavy noisy situations since it does not take this uncertainty into consideration. In this study, we took the uncertainty of the estimation of noise power into consideration by using the statistical property of noise based on random matrix theory. The noise statistical property (eigenvalue distribution) was analytically formulated based on the maximum and minimum eigenvalues of the noise random matrix. Based on the statistical property of the eigenvalues of noise, we reduced the part contributed by noise from the covariance matrix of noisy speech. We tested our method for speech enhancement using AURORA-2J speech corpus. Our initial experiments showed that the proposed method performed better than the traditional signal subspace based speech enhancement method.

查看原文本刊更多论文

基于随机矩阵理论的降噪

在语音增强文献中，基于信号子空间的语音增强方法因其解析公式简单而受到广泛关注。该方法的原始思想是假设干净的语音信号占据一定的低维空间，而噪声信号是白加性噪声，分布在整个观测空间。在这种方法中，需要准确估计噪声功率(或方差)。然而，在实际应用中，噪声功率只能有一定程度的不确定性来估计。这种不确定性会降低基于信号子空间的语音增强算法，特别是在重噪声情况下，因为它没有考虑到这种不确定性。在本研究中，我们基于随机矩阵理论，利用噪声的统计特性，考虑了噪声功率估计的不确定性。基于噪声随机矩阵的最大和最小特征值解析表达了噪声统计特性(特征值分布)。基于噪声特征值的统计特性，从含噪语音的协方差矩阵中剔除噪声所占的分量。我们使用AURORA-2J语音语料库测试了我们的语音增强方法。初步实验表明，该方法优于传统的基于信号子空间的语音增强方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 6th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量