噪声环境下的语音增强视频检索

2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services Pub Date : 2008-05-07 DOI:10.1109/WIAMIS.2008.38

Huiyu Zhou, A. Sadka, Richard M. Jiang

{"title":"噪声环境下的语音增强视频检索","authors":"Huiyu Zhou, A. Sadka, Richard M. Jiang","doi":"10.1109/WIAMIS.2008.38","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum likelihood estimate (MLE). This scheme attempts to simulate the probability distribution of useful speech signals and hence maximally reduce the noise. To evaluate the quality of speech enhancement, we extract cepstral features from the enhanced signals, and then apply them to a dynamic time warping framework for similarity check between the clean and filtered signals. The performance of the proposed enhancement method is compared to that of other classical techniques. The entire framework does not assume any model for the background noise and does not require any noise training data.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"149 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speech Enhancement in Noisy Environments for Video Retrieval\",\"authors\":\"Huiyu Zhou, A. Sadka, Richard M. Jiang\",\"doi\":\"10.1109/WIAMIS.2008.38\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum likelihood estimate (MLE). This scheme attempts to simulate the probability distribution of useful speech signals and hence maximally reduce the noise. To evaluate the quality of speech enhancement, we extract cepstral features from the enhanced signals, and then apply them to a dynamic time warping framework for similarity check between the clean and filtered signals. The performance of the proposed enhancement method is compared to that of other classical techniques. The entire framework does not assume any model for the background noise and does not require any noise training data.\",\"PeriodicalId\":325635,\"journal\":{\"name\":\"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services\",\"volume\":\"149 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WIAMIS.2008.38\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIAMIS.2008.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文提出了一种基于最大似然估计(MLE)的频谱减法语音增强方法。该方案试图模拟有用语音信号的概率分布，从而最大限度地降低噪声。为了评估语音增强的质量，我们从增强信号中提取倒谱特征，然后将其应用于动态时间规整框架中，以检查干净信号和滤波信号之间的相似性。将所提增强方法的性能与其他经典技术进行了比较。整个框架没有对背景噪声假设任何模型，也不需要任何噪声训练数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speech Enhancement in Noisy Environments for Video Retrieval

In this paper, we propose a novel spectral subtraction approach for speech enhancement via maximum likelihood estimate (MLE). This scheme attempts to simulate the probability distribution of useful speech signals and hence maximally reduce the noise. To evaluate the quality of speech enhancement, we extract cepstral features from the enhanced signals, and then apply them to a dynamic time warping framework for similarity check between the clean and filtered signals. The performance of the proposed enhancement method is compared to that of other classical techniques. The entire framework does not assume any model for the background noise and does not require any noise training data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services

自引率

0.00%

发文量