Nonlinear resampling transformation for automatic speech recognition

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop Pub Date : 1991-09-30 DOI:10.1109/NNSP.1991.239510

Y.D. Liu, Y. Lee, H. Chen, G. Sun

引用次数: 4

Abstract

A new technique for speech signal processing called nonlinear resampling transformation (NRT) is proposed. The representation of a speech pattern derived from this technique has two important features: first, it reduces redundancy; second, it effectively removes the nonlinear variations of speech signals in time. The authors have applied NRT to the TI isolated-word database achieving a 99.66% recognition rate on a 10 digits multi-speaker task for a linear predictive neural net classifier. In their experiment, the authors have also found that discriminative training is superior to nondiscriminative training for linear predictive neural network classifiers.<>

查看原文本刊更多论文

自动语音识别的非线性重采样变换

提出了一种新的语音信号处理技术——非线性重采样变换。基于该技术的语音模式表示有两个重要特点:第一，它减少了冗余;其次，有效地消除了语音信号在时间上的非线性变化。作者将NRT应用于TI隔离词数据库，实现了线性预测神经网络分类器对10位多说话人任务的99.66%的识别率。在他们的实验中，作者还发现判别训练优于线性预测神经网络分类器的非判别训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop

自引率

0.00%

发文量