{"title":"Tied mixture modeling optimization for Korean-digit in the embedded ASR system","authors":"Kihyeon Kim, Hanseok Ko","doi":"10.1109/ISCE.2004.1376017","DOIUrl":null,"url":null,"abstract":"In the embedded Automatic Speech Recognition (ASR) system, Semi-Contimrorrs Hidrlen Markov Model (SCHMM) or Tierf-Mi.rtirre (TM) model is one of tlie most promisirig acoustic modeling metliods that solve the size problem of the existing Continirons Hirldcri Markov Model (CHMM) while minimizirig the recognition peifiinnancr rlegra(iation. Moreover. f o r a geiierul isolated n,ord task, coiite.rt rlepenrlent nior1el.v sirch us tri-phones are nsed to guarantee high recognition performance of the embedded sy tem. However. tu nse the models constructed only in this way alone cannot be siifJicienr to render improved recognition rate in Korean-digit speech task 4 w r e a lurge niirtrral similarin e.rists. Hence. w e consfrnct new deilicated HMM ' S f o r all or parts of Korean-digit that has exclusive srafes using the same Gaussian pool of previoirs tri-phone mode1.s. This remedial actiori allows rlie strncture qf entire HMM.s maintained while minimizing the occupied memory space. Representative esperiments are rrpecred to reduce worderror-rate on the Korean-digit task by about 56% in enniporison with using only general rr-plione models. ' Mixture Model, Embedded ASR System. Index Terms Exclusive HMM's, Korean Digits, Tied","PeriodicalId":169376,"journal":{"name":"IEEE International Symposium on Consumer Electronics, 2004","volume":"79 11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on Consumer Electronics, 2004","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCE.2004.1376017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In the embedded Automatic Speech Recognition (ASR) system, Semi-Contimrorrs Hidrlen Markov Model (SCHMM) or Tierf-Mi.rtirre (TM) model is one of tlie most promisirig acoustic modeling metliods that solve the size problem of the existing Continirons Hirldcri Markov Model (CHMM) while minimizirig the recognition peifiinnancr rlegra(iation. Moreover. f o r a geiierul isolated n,ord task, coiite.rt rlepenrlent nior1el.v sirch us tri-phones are nsed to guarantee high recognition performance of the embedded sy tem. However. tu nse the models constructed only in this way alone cannot be siifJicienr to render improved recognition rate in Korean-digit speech task 4 w r e a lurge niirtrral similarin e.rists. Hence. w e consfrnct new deilicated HMM ' S f o r all or parts of Korean-digit that has exclusive srafes using the same Gaussian pool of previoirs tri-phone mode1.s. This remedial actiori allows rlie strncture qf entire HMM.s maintained while minimizing the occupied memory space. Representative esperiments are rrpecred to reduce worderror-rate on the Korean-digit task by about 56% in enniporison with using only general rr-plione models. ' Mixture Model, Embedded ASR System. Index Terms Exclusive HMM's, Korean Digits, Tied