Noise Reduction Based Random Matrix Theory

Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura
{"title":"Noise Reduction Based Random Matrix Theory","authors":"Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura","doi":"10.1109/CHINSL.2008.ECP.83","DOIUrl":null,"url":null,"abstract":"In speech enhancement literature, the signal subspace based method gains a lot of attention because of its simplicity in analytical formulations. The original idea in this method is based on the assumption that clean speech signal occupies a certain low dimensional space, while the noise signal which is a white additive noise spread the whole observation space. In this method, accurate estimation of the noise power (or variance) is required. However, in real applications, the noise power can only be estimated with some degree of uncertainty. This uncertainty will degrade the signal subspace based speech enhancement algorithms, especially in heavy noisy situations since it does not take this uncertainty into consideration. In this study, we took the uncertainty of the estimation of noise power into consideration by using the statistical property of noise based on random matrix theory. The noise statistical property (eigenvalue distribution) was analytically formulated based on the maximum and minimum eigenvalues of the noise random matrix. Based on the statistical property of the eigenvalues of noise, we reduced the part contributed by noise from the covariance matrix of noisy speech. We tested our method for speech enhancement using AURORA-2J speech corpus. Our initial experiments showed that the proposed method performed better than the traditional signal subspace based speech enhancement method.","PeriodicalId":291958,"journal":{"name":"2008 6th International Symposium on Chinese Spoken Language Processing","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 6th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2008.ECP.83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In speech enhancement literature, the signal subspace based method gains a lot of attention because of its simplicity in analytical formulations. The original idea in this method is based on the assumption that clean speech signal occupies a certain low dimensional space, while the noise signal which is a white additive noise spread the whole observation space. In this method, accurate estimation of the noise power (or variance) is required. However, in real applications, the noise power can only be estimated with some degree of uncertainty. This uncertainty will degrade the signal subspace based speech enhancement algorithms, especially in heavy noisy situations since it does not take this uncertainty into consideration. In this study, we took the uncertainty of the estimation of noise power into consideration by using the statistical property of noise based on random matrix theory. The noise statistical property (eigenvalue distribution) was analytically formulated based on the maximum and minimum eigenvalues of the noise random matrix. Based on the statistical property of the eigenvalues of noise, we reduced the part contributed by noise from the covariance matrix of noisy speech. We tested our method for speech enhancement using AURORA-2J speech corpus. Our initial experiments showed that the proposed method performed better than the traditional signal subspace based speech enhancement method.
基于随机矩阵理论的降噪
在语音增强文献中,基于信号子空间的语音增强方法因其解析公式简单而受到广泛关注。该方法的原始思想是假设干净的语音信号占据一定的低维空间,而噪声信号是白加性噪声,分布在整个观测空间。在这种方法中,需要准确估计噪声功率(或方差)。然而,在实际应用中,噪声功率只能有一定程度的不确定性来估计。这种不确定性会降低基于信号子空间的语音增强算法,特别是在重噪声情况下,因为它没有考虑到这种不确定性。在本研究中,我们基于随机矩阵理论,利用噪声的统计特性,考虑了噪声功率估计的不确定性。基于噪声随机矩阵的最大和最小特征值解析表达了噪声统计特性(特征值分布)。基于噪声特征值的统计特性,从含噪语音的协方差矩阵中剔除噪声所占的分量。我们使用AURORA-2J语音语料库测试了我们的语音增强方法。初步实验表明,该方法优于传统的基于信号子空间的语音增强方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信