Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm

M. Jamali, Vahid Ghafarinia, M. A. Montazeri
{"title":"Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm","authors":"M. Jamali, Vahid Ghafarinia, M. A. Montazeri","doi":"10.1109/SPIS.2015.7422333","DOIUrl":null,"url":null,"abstract":"Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.","PeriodicalId":424434,"journal":{"name":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIS.2015.7422333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.
使用增强的矢量量化算法识别与说话人无关的孤立波斯语数字
矢量量化(VQ)是一种快速、简单的分类算法,已被广泛应用于孤立口语单词的识别。然而,该算法及其大多数改进版本都无法准确区分元音相似的单词。波斯语中数字的发音模式/dow/和/noh/(分别为2和9)就是这种相似性的一个很好的例子。在本文中,我们提出了一种增强的矢量量化算法,其中考虑了单词开头短辅音的确定性作用。该算法根据两组码本的分类结果判断未知向量。第一组码本由单词的初始部分组成,而另一组则由整个单词组成。通过实验验证了该算法与其他基于vq的算法的性能。虽然该算法的整体性能优于其他算法,但在相似词的情况下,它可以显著减少错误分类的数量。这种改进是通过计算负载的小幅增加来实现的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信