{"title":"Non-uniform unit parsing for SSS-LR continuous speech recognition","authors":"H. Singer, J. Takami, S. Matsunaga","doi":"10.1109/ICASSP.1994.389697","DOIUrl":null,"url":null,"abstract":"We describe recent improvements in ATR's experimental speech recognition system ATREUS, which serves as a recognition font end for the speech translation system ASURA. Our next goal is spontaneous speech translation. To constrain the potentially huge search space, better prosodic control, better probabilistic language models and better acoustic models are proposed. The SSS-LR parser was modified to work with non-uniform unit type acoustic and duration models. Experimental results showed, that, for example, use of mora trigram probabilities improved the phrase error rate from 17% to 14%.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"126 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389697","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
We describe recent improvements in ATR's experimental speech recognition system ATREUS, which serves as a recognition font end for the speech translation system ASURA. Our next goal is spontaneous speech translation. To constrain the potentially huge search space, better prosodic control, better probabilistic language models and better acoustic models are proposed. The SSS-LR parser was modified to work with non-uniform unit type acoustic and duration models. Experimental results showed, that, for example, use of mora trigram probabilities improved the phrase error rate from 17% to 14%.<>