H. T. Diep, Thi-My-Thanh Nguyen, Ngoc-Bich Le, Xuan-Quy Dao
{"title":"越南语语音识别平台的评价","authors":"H. T. Diep, Thi-My-Thanh Nguyen, Ngoc-Bich Le, Xuan-Quy Dao","doi":"10.1145/3453800.3453826","DOIUrl":null,"url":null,"abstract":"∗The purpose of this paper is to evaluate the performance of Vietnamese speech recognition systems provided by top Vietnamese companies such as Vais, Vtcc, Fpt, and Google. This paper presents the results in applying Vietnamese automatic speech recognition systems in news, interview, and music domains. We use recorded audios as inputs to compare the performance of Vietnamese automatic speech recognition systems by calculating Word Error Rate. Vais and Viettel obtain good results in news and interview domains while Google has good results in the music domain. The results demonstrated that all the providers Vais, Viettel, Google, and Fpt achieve good results but Vais is more dominant.","PeriodicalId":109559,"journal":{"name":"International Conference on Machine Learning and Soft Computing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Evaluation of Vietnamese Speech Recognition Platforms\",\"authors\":\"H. T. Diep, Thi-My-Thanh Nguyen, Ngoc-Bich Le, Xuan-Quy Dao\",\"doi\":\"10.1145/3453800.3453826\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"∗The purpose of this paper is to evaluate the performance of Vietnamese speech recognition systems provided by top Vietnamese companies such as Vais, Vtcc, Fpt, and Google. This paper presents the results in applying Vietnamese automatic speech recognition systems in news, interview, and music domains. We use recorded audios as inputs to compare the performance of Vietnamese automatic speech recognition systems by calculating Word Error Rate. Vais and Viettel obtain good results in news and interview domains while Google has good results in the music domain. The results demonstrated that all the providers Vais, Viettel, Google, and Fpt achieve good results but Vais is more dominant.\",\"PeriodicalId\":109559,\"journal\":{\"name\":\"International Conference on Machine Learning and Soft Computing\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Machine Learning and Soft Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3453800.3453826\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3453800.3453826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluation of Vietnamese Speech Recognition Platforms
∗The purpose of this paper is to evaluate the performance of Vietnamese speech recognition systems provided by top Vietnamese companies such as Vais, Vtcc, Fpt, and Google. This paper presents the results in applying Vietnamese automatic speech recognition systems in news, interview, and music domains. We use recorded audios as inputs to compare the performance of Vietnamese automatic speech recognition systems by calculating Word Error Rate. Vais and Viettel obtain good results in news and interview domains while Google has good results in the music domain. The results demonstrated that all the providers Vais, Viettel, Google, and Fpt achieve good results but Vais is more dominant.