Phone recognition experiments with 2D-DCT spectro-temporal features

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI) Pub Date : 2011-05-19 DOI:10.1109/SACI.2011.5872988

György Kovács, L. Tóth

引用次数: 10

Abstract

Localized spectro-temporal analysis is a novel feature extraction strategy in speech recognition, which was inspired by neurophysiological findings. Here we perform phone recognition experiments on features that are extracted from the patches of the critical-band log-energy spectrum by applying the two-dimensional cosine trans-form. We find that in phone recognition experiments the proposed feature set yields results similar to the standard MFCC features under clean conditions, while it provides a significantly smaller performance degradation in noisy conditions. Moreover, we show that the new and the standard features can be readily combined to improve the recognition accuracy still further.

查看原文本刊更多论文

基于2D-DCT光谱-时间特征的手机识别实验

局部光谱时间分析是一种新的语音识别特征提取策略，受到神经生理学研究成果的启发。在这里，我们通过应用二维余弦变换对从关键波段对数能谱斑块中提取的特征进行手机识别实验。我们发现，在手机识别实验中，所提出的特征集在清洁条件下产生的结果与标准MFCC特征相似，而在噪声条件下提供的性能下降明显较小。此外，我们还表明，新特征和标准特征可以很容易地结合起来，进一步提高识别精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI)

自引率

0.00%

发文量