An improved parallel model combination method for noisy speech recognition

2009 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 2009-12-01 DOI:10.1109/ASRU.2009.5373332

H. Veisi, H. Sameti

{"title":"An improved parallel model combination method for noisy speech recognition","authors":"H. Veisi, H. Sameti","doi":"10.1109/ASRU.2009.5373332","DOIUrl":null,"url":null,"abstract":"In this paper a novel method, called PC-PMC, is proposed to improve the performance of automatic speech recognition systems in noisy environments. This method is based on the parallel model combination (PMC) technique and uses the Cepstral Mean Subtraction (CMS) normalization ability and Principal Component Analysis (PCA) compression and de-correlation capabilities. It takes the advantages of both additive noise compensation of PMC and convolutive noise removal ability of CMS and PCA. The first problem to be solved in the realizing of PC-PMC is that PMC algorithm requires invertible modules in the front-end of the system while CMS normalization is not an invertible process. Also, it is required to design a framework for adaptation of the PCA transform in the presence of noise. The method proposed in this paper provides solutions to the both problems. Our evaluations are done on four different real noisy tasks using Nevisa Persian continuous speech recognition system. Experimental results demonstrate significant reduction in word error rate using PC-PMC in comparison with the standard robustness methods.","PeriodicalId":292194,"journal":{"name":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.2009.5373332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In this paper a novel method, called PC-PMC, is proposed to improve the performance of automatic speech recognition systems in noisy environments. This method is based on the parallel model combination (PMC) technique and uses the Cepstral Mean Subtraction (CMS) normalization ability and Principal Component Analysis (PCA) compression and de-correlation capabilities. It takes the advantages of both additive noise compensation of PMC and convolutive noise removal ability of CMS and PCA. The first problem to be solved in the realizing of PC-PMC is that PMC algorithm requires invertible modules in the front-end of the system while CMS normalization is not an invertible process. Also, it is required to design a framework for adaptation of the PCA transform in the presence of noise. The method proposed in this paper provides solutions to the both problems. Our evaluations are done on four different real noisy tasks using Nevisa Persian continuous speech recognition system. Experimental results demonstrate significant reduction in word error rate using PC-PMC in comparison with the standard robustness methods.

查看原文本刊更多论文

一种改进的并行模型组合方法用于噪声语音识别

为了提高语音识别系统在噪声环境下的性能，本文提出了一种新的方法——PC-PMC。该方法基于并行模型组合(PMC)技术，利用倒谱均值减法(CMS)的归一化能力和主成分分析(PCA)的压缩和去相关能力。它既具有PMC的加性噪声补偿能力，又具有CMS和PCA的卷积去噪能力。PC-PMC的实现首先要解决的问题是PMC算法需要系统前端的可逆模块，而CMS归一化并不是一个可逆过程。此外，还需要设计一个框架来适应存在噪声的PCA变换。本文提出的方法解决了这两个问题。我们使用Nevisa波斯语连续语音识别系统对四种不同的真实噪声任务进行了评估。实验结果表明，与标准鲁棒性方法相比，PC-PMC方法显著降低了单词错误率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量