{"title":"Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm","authors":"M. Jamali, Vahid Ghafarinia, M. A. Montazeri","doi":"10.1109/SPIS.2015.7422333","DOIUrl":null,"url":null,"abstract":"Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.","PeriodicalId":424434,"journal":{"name":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIS.2015.7422333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.