{"title":"Kernel fitting for speech detection and enhancement","authors":"Benyong Liu, Jing Zhang, Xiang Liao","doi":"10.1109/ICOSP.2010.5656090","DOIUrl":null,"url":null,"abstract":"A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.","PeriodicalId":281876,"journal":{"name":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2010.5656090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.