H. T. Diep, Thi-My-Thanh Nguyen, Ngoc-Bich Le, Xuan-Quy Dao
{"title":"Evaluation of Vietnamese Speech Recognition Platforms","authors":"H. T. Diep, Thi-My-Thanh Nguyen, Ngoc-Bich Le, Xuan-Quy Dao","doi":"10.1145/3453800.3453826","DOIUrl":null,"url":null,"abstract":"∗The purpose of this paper is to evaluate the performance of Vietnamese speech recognition systems provided by top Vietnamese companies such as Vais, Vtcc, Fpt, and Google. This paper presents the results in applying Vietnamese automatic speech recognition systems in news, interview, and music domains. We use recorded audios as inputs to compare the performance of Vietnamese automatic speech recognition systems by calculating Word Error Rate. Vais and Viettel obtain good results in news and interview domains while Google has good results in the music domain. The results demonstrated that all the providers Vais, Viettel, Google, and Fpt achieve good results but Vais is more dominant.","PeriodicalId":109559,"journal":{"name":"International Conference on Machine Learning and Soft Computing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453800.3453826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
∗The purpose of this paper is to evaluate the performance of Vietnamese speech recognition systems provided by top Vietnamese companies such as Vais, Vtcc, Fpt, and Google. This paper presents the results in applying Vietnamese automatic speech recognition systems in news, interview, and music domains. We use recorded audios as inputs to compare the performance of Vietnamese automatic speech recognition systems by calculating Word Error Rate. Vais and Viettel obtain good results in news and interview domains while Google has good results in the music domain. The results demonstrated that all the providers Vais, Viettel, Google, and Fpt achieve good results but Vais is more dominant.