Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm

2015 Signal Processing and Intelligent Systems Conference (SPIS) Pub Date : 2015-12-01 DOI:10.1109/SPIS.2015.7422333

M. Jamali, Vahid Ghafarinia, M. A. Montazeri

{"title":"Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm","authors":"M. Jamali, Vahid Ghafarinia, M. A. Montazeri","doi":"10.1109/SPIS.2015.7422333","DOIUrl":null,"url":null,"abstract":"Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.","PeriodicalId":424434,"journal":{"name":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIS.2015.7422333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.

查看原文本刊更多论文

使用增强的矢量量化算法识别与说话人无关的孤立波斯语数字

矢量量化(VQ)是一种快速、简单的分类算法，已被广泛应用于孤立口语单词的识别。然而，该算法及其大多数改进版本都无法准确区分元音相似的单词。波斯语中数字的发音模式/dow/和/noh/(分别为2和9)就是这种相似性的一个很好的例子。在本文中，我们提出了一种增强的矢量量化算法，其中考虑了单词开头短辅音的确定性作用。该算法根据两组码本的分类结果判断未知向量。第一组码本由单词的初始部分组成，而另一组则由整个单词组成。通过实验验证了该算法与其他基于vq的算法的性能。虽然该算法的整体性能优于其他算法，但在相似词的情况下，它可以显著减少错误分类的数量。这种改进是通过计算负载的小幅增加来实现的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Signal Processing and Intelligent Systems Conference (SPIS)

自引率

0.00%

发文量