Multi-resolution Speech Recognition Based on Hyper-rectangular Fuzzy System

Chih-Hsu Hsu
{"title":"Multi-resolution Speech Recognition Based on Hyper-rectangular Fuzzy System","authors":"Chih-Hsu Hsu","doi":"10.51505/ijaemr.2022.7604","DOIUrl":null,"url":null,"abstract":"This paper presents a multi-resolution feature extraction technique to speech recognition. The proposed multi-resolution feature extraction technique uses wavelet transform and wavelet packet to calculate features of each sub-band in order not to spread noise distortions over the entire feature space. In our previous works, we had developed a method for speech classification. For speech classification, the universe of discourse is divided into many types, and each type is treated as a class. The hyper-rectangular fuzzy system is used to classify frames and integrate the rule-based approach. The variances of each sub-band are utilized to extract both crisp and fuzzy classification rules. In our experiments, the Texas Instruments/Massachusetts Institute of Technology database is used and extracts features of phonemes. The results demonstrate the superior performance to Mel frequency cepstral coefficients. The effectiveness of the proposed system is encouraging.","PeriodicalId":354718,"journal":{"name":"International Journal of Advanced Engineering and Management Research","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Advanced Engineering and Management Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51505/ijaemr.2022.7604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents a multi-resolution feature extraction technique to speech recognition. The proposed multi-resolution feature extraction technique uses wavelet transform and wavelet packet to calculate features of each sub-band in order not to spread noise distortions over the entire feature space. In our previous works, we had developed a method for speech classification. For speech classification, the universe of discourse is divided into many types, and each type is treated as a class. The hyper-rectangular fuzzy system is used to classify frames and integrate the rule-based approach. The variances of each sub-band are utilized to extract both crisp and fuzzy classification rules. In our experiments, the Texas Instruments/Massachusetts Institute of Technology database is used and extracts features of phonemes. The results demonstrate the superior performance to Mel frequency cepstral coefficients. The effectiveness of the proposed system is encouraging.
基于超矩形模糊系统的多分辨率语音识别
提出了一种用于语音识别的多分辨率特征提取技术。本文提出的多分辨率特征提取技术利用小波变换和小波包计算每个子带的特征,避免了噪声在整个特征空间上的传播。在我们之前的工作中,我们已经开发了一种语音分类方法。在言语分类中,话语世界被划分为许多类型,每种类型被视为一个类。采用超矩形模糊系统对框架进行分类,并与基于规则的方法相结合。利用每个子带的方差提取清晰和模糊的分类规则。在我们的实验中,使用了德州仪器/麻省理工学院的数据库,提取音素特征。结果表明,该方法比Mel倒谱系数具有更好的性能。拟议制度的有效性令人鼓舞。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信