Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition

2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems Pub Date : 2008-12-01 DOI:10.1109/ICIINFS.2008.4798363

N. S. Nehe, R. S. Holambe

引用次数: 3

Abstract

This paper presents the use of auditory perception based admissible wavelet packet tree (WPT) for partitioning of speech frequencies into different bands based on the Mel scale or the Bark Scale. The proposed WPTs selected using root mean square error (RMSE) criterion mimic the Mel scale or the bark scale more accurately and hence the human auditory system. Performance of the features obtained from the proposed WPTs is compared with Mel frequency cepstral coefficients (MFCC). The algorithms are evaluated using NIST TI-46 isolated-word database using hidden Markov model (HMM) as a classifier. Experimental results show that the performance of proposed features is better than MFCC and other wavelet features for isolated word recognition (IWR).

查看原文本刊更多论文

基于听觉感知的可接受小波包树语音识别

本文提出了基于听觉感知的可接受小波包树(WPT)在Mel尺度或Bark尺度的基础上将语音频率划分为不同的频带。使用均方根误差(RMSE)标准选择的WPTs更准确地模拟了Mel量表或bark量表，从而更准确地模拟了人类听觉系统。从所提出的WPTs中获得的特征性能与Mel频率倒谱系数(MFCC)进行了比较。使用NIST TI-46孤立词数据库，使用隐马尔可夫模型(HMM)作为分类器对算法进行评估。实验结果表明，所提特征在孤立词识别方面的性能优于MFCC和其他小波特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems

自引率

0.00%

发文量