Daria Vazhenina, I. Kipyatkova, K. Markov, Alexey Karpov
{"title":"State-of-the-art speech recognition technologies for Russian language","authors":"Daria Vazhenina, I. Kipyatkova, K. Markov, Alexey Karpov","doi":"10.1145/2160749.2160763","DOIUrl":null,"url":null,"abstract":"In this paper, we present a review of the latest developments in the Russian speech recognition research. Although the underlying speech technology is mostly language-independent, differences between languages with respect to their structure and grammar have substantial effect on the recognition systems performance. The Russian language has a complicated word formation system, which is characterized by a high degree of inflection and unrigidness of the word order. This greatly reduces the predictive power of the conventional language models and consequently increases the error rate. Current statistical approach to speech recognition requires large amount of both speech and text data. There exist several Russian speech databases and their descriptions are given in this paper. In addition, we describe and compare several speech recognition systems developed in Russia as well as in some other countries. Finally we suggest some promising directions for further research in Russian speech technology.","PeriodicalId":407345,"journal":{"name":"Joint International Conference on Human-Centered Computer Environments","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Joint International Conference on Human-Centered Computer Environments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2160749.2160763","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
In this paper, we present a review of the latest developments in the Russian speech recognition research. Although the underlying speech technology is mostly language-independent, differences between languages with respect to their structure and grammar have substantial effect on the recognition systems performance. The Russian language has a complicated word formation system, which is characterized by a high degree of inflection and unrigidness of the word order. This greatly reduces the predictive power of the conventional language models and consequently increases the error rate. Current statistical approach to speech recognition requires large amount of both speech and text data. There exist several Russian speech databases and their descriptions are given in this paper. In addition, we describe and compare several speech recognition systems developed in Russia as well as in some other countries. Finally we suggest some promising directions for further research in Russian speech technology.