基于多字典卷积非负矩阵分解的语音增强

Shan He, Jiawen Wu, Lin Li
{"title":"基于多字典卷积非负矩阵分解的语音增强","authors":"Shan He, Jiawen Wu, Lin Li","doi":"10.1109/ICASID.2015.7405671","DOIUrl":null,"url":null,"abstract":"We introduce an effective framework for speech enhancement using convolutive nonnegative matrix factorization (CNMF) with no a prior knowledge of the clean speech or noise types. On the assumption that the speech signal is irrelevant to noisy signal, CNMF would attenuate the background noise from noisy speech with a mix dictionary which consists of speech bases and noise bases. To be practical, in training phrase, a universal speech dictionary is trained using clean speech utterances irrelevant to noisy speech, and noise dictionary is well-prepared on different kinds of noise signal. Then in the enhancement process only the speech bases are updated according to noisy utterances using Kullback-Leibler divergence (KLD). The experiments were implemented on the NOIZUES library with babble, pink and white noisy speech utterances in a wide range of input SNR from 0dB to 15dB. The results show that the proposed framework achieves better perceptual evaluation of speech quality (PESQ) scores and output SNR compared to other conventional methods.","PeriodicalId":403184,"journal":{"name":"2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech enhancement using convolutive non-negative matrix factorization with multiple dictionaries\",\"authors\":\"Shan He, Jiawen Wu, Lin Li\",\"doi\":\"10.1109/ICASID.2015.7405671\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We introduce an effective framework for speech enhancement using convolutive nonnegative matrix factorization (CNMF) with no a prior knowledge of the clean speech or noise types. On the assumption that the speech signal is irrelevant to noisy signal, CNMF would attenuate the background noise from noisy speech with a mix dictionary which consists of speech bases and noise bases. To be practical, in training phrase, a universal speech dictionary is trained using clean speech utterances irrelevant to noisy speech, and noise dictionary is well-prepared on different kinds of noise signal. Then in the enhancement process only the speech bases are updated according to noisy utterances using Kullback-Leibler divergence (KLD). The experiments were implemented on the NOIZUES library with babble, pink and white noisy speech utterances in a wide range of input SNR from 0dB to 15dB. The results show that the proposed framework achieves better perceptual evaluation of speech quality (PESQ) scores and output SNR compared to other conventional methods.\",\"PeriodicalId\":403184,\"journal\":{\"name\":\"2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASID.2015.7405671\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 9th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASID.2015.7405671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们引入了一个有效的框架,使用卷积非负矩阵分解(CNMF)来增强语音,而不需要事先了解干净的语音或噪声类型。在假设语音信号与噪声信号无关的前提下,CNMF利用由语音基和噪声基组成的混合字典来衰减噪声语音的背景噪声。在训练短语中,使用与噪声无关的干净语音进行通用语音词典的训练,并针对不同类型的噪声信号编制噪声词典。然后在增强过程中,仅根据有噪声的话语使用Kullback-Leibler散度(KLD)更新语音基。实验在NOIZUES库上进行,在0 ~ 15dB的宽输入信噪比范围内,含杂音、粉噪和白噪语音。结果表明,与其他传统方法相比,该框架可以更好地感知语音质量(PESQ)分数和输出信噪比。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Speech enhancement using convolutive non-negative matrix factorization with multiple dictionaries
We introduce an effective framework for speech enhancement using convolutive nonnegative matrix factorization (CNMF) with no a prior knowledge of the clean speech or noise types. On the assumption that the speech signal is irrelevant to noisy signal, CNMF would attenuate the background noise from noisy speech with a mix dictionary which consists of speech bases and noise bases. To be practical, in training phrase, a universal speech dictionary is trained using clean speech utterances irrelevant to noisy speech, and noise dictionary is well-prepared on different kinds of noise signal. Then in the enhancement process only the speech bases are updated according to noisy utterances using Kullback-Leibler divergence (KLD). The experiments were implemented on the NOIZUES library with babble, pink and white noisy speech utterances in a wide range of input SNR from 0dB to 15dB. The results show that the proposed framework achieves better perceptual evaluation of speech quality (PESQ) scores and output SNR compared to other conventional methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信