语音检测与增强的核拟合

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS Pub Date : 2010-12-03 DOI:10.1109/ICOSP.2010.5656090

Benyong Liu, Jing Zhang, Xiang Liao

{"title":"语音检测与增强的核拟合","authors":"Benyong Liu, Jing Zhang, Xiang Liao","doi":"10.1109/ICOSP.2010.5656090","DOIUrl":null,"url":null,"abstract":"A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.","PeriodicalId":281876,"journal":{"name":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Kernel fitting for speech detection and enhancement\",\"authors\":\"Benyong Liu, Jing Zhang, Xiang Liao\",\"doi\":\"10.1109/ICOSP.2010.5656090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.\",\"PeriodicalId\":281876,\"journal\":{\"name\":\"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.2010.5656090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2010.5656090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

为了提高语音活动检测(VAD)的精度和一些常用算法的语音增强性能，提出了一种用于语音去噪的核拟合算法。该算法首先对带有噪声的语音帧进行核拟合滤波，然后利用帧能量和过零率构造的增益因子对其功率谱密度进行估计和加权，从而明显区分语音信号和非语音信号。通过在核拟合过程中加入VAD输出和噪声效应，语音帧的增强性能优于谱减法算法。实验结果表明，本文提出的算法在语音检测和增强方面具有一定的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Kernel fitting for speech detection and enhancement

A kernel fitting algorithm is proposed for speech denoising to improve the precision of voice activity detection (VAD) and the performance of speech enhancement, of some popular algorithms. In the algorithm, a noisy speech frame is filtered by kernel fitting, and then its power spectral density is estimated and weighted by a gain factor constructed from frame energy and zero-crossing rate, so that a speech signal is obviously discriminated from a nonspeech one. By incorporation of the VAD outputs and the noise effect into the kernel fitting process, a speech frame is enhanced with better performance than the spectra subtraction algorithm. Experiments are taken on a real life speech signal plus simulated noises, and the results show the potentiality of the proposed algorithms in speech detection and enhancement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS

自引率

0.00%

发文量