Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-04 DOI:10.1109/ISCSLP.2012.6423522

Chen-Yu Yang, Georgina Brown, Liang Lu, J. Yamagishi, Simon King

引用次数: 28

Abstract

In this paper, we introduce a newly-created corpus of whispered speech simultaneously recorded via a close-talking microphone and a non-audible murmur (NAM) microphone in both clean and noisy conditions. To benchmark the corpus, which has been freely released recently, experiments on automatic recognition of continuous whispered speech were conducted. When training and test conditions are matched, the NAM microphone is found to be more robust against background noise than the close-talking microphone. In mismatched conditions (noisy data, models trained on clean speech), we found that Vector Taylor Series (VTS) compensation is particularly effective for the NAM signal.

查看原文本刊更多论文

噪声鲁棒低声语音识别使用一个无听杂音麦克风与VTS补偿

在本文中，我们介绍了一个新创建的耳语语料库，该语料库通过近距离交谈麦克风和非可听杂音(NAM)麦克风在清洁和嘈杂条件下同时录制。为了对最近免费发布的语料库进行基准测试，对连续低语语音进行了自动识别实验。当训练和测试条件相匹配时，发现NAM麦克风比近距离说话麦克风对背景噪声的鲁棒性更强。在不匹配的条件下(有噪声的数据，在干净语音上训练的模型)，我们发现向量泰勒级数(VTS)补偿对NAM信号特别有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量