Using Phase Space based processing to extract proper features for ASR systems

Y. Shekofteh, F. Almasganj
{"title":"Using Phase Space based processing to extract proper features for ASR systems","authors":"Y. Shekofteh, F. Almasganj","doi":"10.1109/ISTEL.2010.5734094","DOIUrl":null,"url":null,"abstract":"In this paper a feature extraction technique using Reconstructed Phase Spaces (RPS) is presented, which improves the overall performances of typical speech recognition systems. Unlike conventional feature extraction methods that use FFT based algorithm as power spectrum estimation (PSE) of speech signal, the proposed method is based on the trajectory and flow matrix of signal's RPS. In this manner, a new representation of power spectrum is obtained using two dimensional DFT algorithm by which, we can gain modify versions of common feature extraction methods such as MFCC. We conducted some speech recognition experiments using HTK, the known HMM-based toolkit, over FARSDAT, a known Persian speech corpus. Through this modified version of feature extraction method, we gained 1.35% word error rate improvement in comparison to the baseline system which exploits the typical MFCC feature extraction method.","PeriodicalId":306663,"journal":{"name":"2010 5th International Symposium on Telecommunications","volume":"1949 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 5th International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2010.5734094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

In this paper a feature extraction technique using Reconstructed Phase Spaces (RPS) is presented, which improves the overall performances of typical speech recognition systems. Unlike conventional feature extraction methods that use FFT based algorithm as power spectrum estimation (PSE) of speech signal, the proposed method is based on the trajectory and flow matrix of signal's RPS. In this manner, a new representation of power spectrum is obtained using two dimensional DFT algorithm by which, we can gain modify versions of common feature extraction methods such as MFCC. We conducted some speech recognition experiments using HTK, the known HMM-based toolkit, over FARSDAT, a known Persian speech corpus. Through this modified version of feature extraction method, we gained 1.35% word error rate improvement in comparison to the baseline system which exploits the typical MFCC feature extraction method.
采用基于相空间的处理方法提取ASR系统的特征
本文提出了一种基于重构相空间(RPS)的特征提取技术,提高了典型语音识别系统的整体性能。与传统的基于FFT算法的语音信号功率谱估计(PSE)特征提取方法不同,该方法基于信号RPS的轨迹和流矩阵。在此基础上,利用二维DFT算法得到了一种新的功率谱表示形式,从而得到了MFCC等常用特征提取方法的改进版本。我们使用HTK(已知的基于hmm的工具包)在FARSDAT(已知的波斯语语音语料库)上进行了一些语音识别实验。通过改进的特征提取方法,与利用典型的MFCC特征提取方法的基线系统相比,我们的错误率提高了1.35%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信