{"title":"Supervised speech enhancement using compressed sensing","authors":"Pulkit Sharma, V. Abrol, A. Sao","doi":"10.1109/NCC.2015.7084919","DOIUrl":null,"url":null,"abstract":"Supervised approaches for speech enhancement require models to be learned for different noisy environments, which is a difficult criterion to meet in practical scenarios. In this paper, compressed sensing (CS) based supervised speech enhancement approach is proposed, where model (dictionary) for noise is derived from the noisy speech signal. It exploits the observation that unvoiced/silence regions of noisy speech signal will be predominantly noise and a method is proposed to measure the same, thus eliminating pre-training of noise model. The proposed method is particularly effective in scenarios where noise type is not known a priori. Experimental results validate that the proposed approach can be an alternative to the existing approaches for speech enhancement.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Twenty First National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2015.7084919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Supervised approaches for speech enhancement require models to be learned for different noisy environments, which is a difficult criterion to meet in practical scenarios. In this paper, compressed sensing (CS) based supervised speech enhancement approach is proposed, where model (dictionary) for noise is derived from the noisy speech signal. It exploits the observation that unvoiced/silence regions of noisy speech signal will be predominantly noise and a method is proposed to measure the same, thus eliminating pre-training of noise model. The proposed method is particularly effective in scenarios where noise type is not known a priori. Experimental results validate that the proposed approach can be an alternative to the existing approaches for speech enhancement.