语音增强的正交匹配追踪算法

Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu
{"title":"语音增强的正交匹配追踪算法","authors":"Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu","doi":"10.1109/ICOT.2014.6956609","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Speech enhancement using orthogonal matching pursuit algorithm\",\"authors\":\"Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu\",\"doi\":\"10.1109/ICOT.2014.6956609\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.\",\"PeriodicalId\":343641,\"journal\":{\"name\":\"2014 International Conference on Orange Technologies\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Orange Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOT.2014.6956609\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

提出了一种基于压缩感知(CS)理论的语音增强方法。首先利用离散余弦变换(DCT)对含噪语音信号进行稀疏化处理。然后用软阈值法将每帧图像分成有噪子帧和无噪子帧,得到有噪子帧的阈值DCT系数。然后,利用部分Hadamard系综作为感知矩阵,实现对噪声子帧DCT系数的压缩测量。最后,利用正交匹配追踪从噪声子帧中恢复去噪语音信号。通过客观实验和主观实验,将该方法与子空间法和谱减法进行了比较。实验结果表明,该方法对高斯白噪声和大多数彩色噪声具有最高的PESQ、ABX和MOS分数,优于其他方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Speech enhancement using orthogonal matching pursuit algorithm
This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信