{"title":"Robust feature extraction from spectrum estimated using bispectrum for Isolated Word Recognition","authors":"N. S. Nehe, P. Ajmera, D. Jadhav, R. S. Holambe","doi":"10.1109/INDCON.2011.6139389","DOIUrl":null,"url":null,"abstract":"Extraction of robust features from noisy speech signals is one of the challenging problems in Automatic Speech Recognition (ASR). For Gaussian process, its bispectrum and all higher order spectra are identically zero, which means that bispectrum removes the additive white Gaussian noise while preserving the magnitude and phase information of original signal. Using this bispectrum property, spectrum of original signal can be recovered from its noisy version. Robust Mel Frequency Cepstral Coefficients (MFCC) are extracted from the estimated spectral magnitude (denoted as Bispectral-MFCC (BMFCC)). The effectiveness of BMFCC has been tested on TI-46 isolated word database in noisy (additive white Gaussian) environment. The experimental results show the superiority of the proposed technique over conventional methods for Isolated Word Recognition (IWR).","PeriodicalId":425080,"journal":{"name":"2011 Annual IEEE India Conference","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Annual IEEE India Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDCON.2011.6139389","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Extraction of robust features from noisy speech signals is one of the challenging problems in Automatic Speech Recognition (ASR). For Gaussian process, its bispectrum and all higher order spectra are identically zero, which means that bispectrum removes the additive white Gaussian noise while preserving the magnitude and phase information of original signal. Using this bispectrum property, spectrum of original signal can be recovered from its noisy version. Robust Mel Frequency Cepstral Coefficients (MFCC) are extracted from the estimated spectral magnitude (denoted as Bispectral-MFCC (BMFCC)). The effectiveness of BMFCC has been tested on TI-46 isolated word database in noisy (additive white Gaussian) environment. The experimental results show the superiority of the proposed technique over conventional methods for Isolated Word Recognition (IWR).