{"title":"Towards non-uniform unit HMMs for speech recognition","authors":"T. Matsumura, S. Matsunaga","doi":"10.1109/IVTTA.1994.341538","DOIUrl":null,"url":null,"abstract":"A novel acoustic modeling algorithm that generates non-uniform unit HMMs to effectively cope with spectral variations in fluent speech is proposed. The algorithm is devised for the automatic iterative generation of long-span units for the non-uniform modeling. This generation algorithm is based on an entropy reduction criterion using text data and a maximum likelihood criterion using speech data. The effectiveness of the non-uniform models was confirmed by comparing likelihood values between the long-span unit HMMs and the conventional phoneme-unit HMMs. Preliminary results suggest that non-uniform unit HMMs achieve higher performance than phoneme-unit HMMs.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVTTA.1994.341538","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
A novel acoustic modeling algorithm that generates non-uniform unit HMMs to effectively cope with spectral variations in fluent speech is proposed. The algorithm is devised for the automatic iterative generation of long-span units for the non-uniform modeling. This generation algorithm is based on an entropy reduction criterion using text data and a maximum likelihood criterion using speech data. The effectiveness of the non-uniform models was confirmed by comparing likelihood values between the long-span unit HMMs and the conventional phoneme-unit HMMs. Preliminary results suggest that non-uniform unit HMMs achieve higher performance than phoneme-unit HMMs.<>