{"title":"Voice waveform vector quantization using a competitive algorithm","authors":"R.M.V. Franca, B. Neto","doi":"10.1109/GLOCOM.1994.512719","DOIUrl":null,"url":null,"abstract":"A competitive algorithm is used to train dictionaries for voice waveform vector quantization with a phonetically balanced group of sentences as training sequence. The algorithm follows the standard unsupervised competitive rule used in training neural networks and it is suited to most distortion measures and to any practical dimension. An investigation is carried out to find the best range for the algorithm's parameters and its performance is compared to the results obtained when using the LBG algorithm with the same input data. The testing sequence is another phonetically balanced group of sentences uttered by different speakers.","PeriodicalId":323626,"journal":{"name":"1994 IEEE GLOBECOM. Communications: The Global Bridge","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1994 IEEE GLOBECOM. Communications: The Global Bridge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOCOM.1994.512719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
A competitive algorithm is used to train dictionaries for voice waveform vector quantization with a phonetically balanced group of sentences as training sequence. The algorithm follows the standard unsupervised competitive rule used in training neural networks and it is suited to most distortion measures and to any practical dimension. An investigation is carried out to find the best range for the algorithm's parameters and its performance is compared to the results obtained when using the LBG algorithm with the same input data. The testing sequence is another phonetically balanced group of sentences uttered by different speakers.