Nattapong Kurpukdee, Surasak Boonkla, P. Sertsi, Vataya Chunwijitra
{"title":"用于自动语音识别服务的异步并行解码","authors":"Nattapong Kurpukdee, Surasak Boonkla, P. Sertsi, Vataya Chunwijitra","doi":"10.1109/JCSSE53117.2021.9493832","DOIUrl":null,"url":null,"abstract":"We proposed a new automatic speech recognition (ASR) service architecture that is extendable to medium-scale ASR service and more flexible than the previous architecture. Improvement aims to substitute the distributed processing approach with an asynchronous parallel thread for decoding multiple voice streams. We replace our TCP-based communication protocol with a remote procedure call developed by Google (gRPC) that makes our ASR service become a developer-friendly, less overhead connection. Besides, the API gateway is employed to reinforce the ASR services by multiple servers so that we can increase our new ASR service to a larger scale. The experimental result shows that our new architecture performs faster than the previous architecture in terms of real-time factor.","PeriodicalId":437534,"journal":{"name":"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Asynchronously Parallel Decoding For Automatic Speech Recognition Services\",\"authors\":\"Nattapong Kurpukdee, Surasak Boonkla, P. Sertsi, Vataya Chunwijitra\",\"doi\":\"10.1109/JCSSE53117.2021.9493832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We proposed a new automatic speech recognition (ASR) service architecture that is extendable to medium-scale ASR service and more flexible than the previous architecture. Improvement aims to substitute the distributed processing approach with an asynchronous parallel thread for decoding multiple voice streams. We replace our TCP-based communication protocol with a remote procedure call developed by Google (gRPC) that makes our ASR service become a developer-friendly, less overhead connection. Besides, the API gateway is employed to reinforce the ASR services by multiple servers so that we can increase our new ASR service to a larger scale. The experimental result shows that our new architecture performs faster than the previous architecture in terms of real-time factor.\",\"PeriodicalId\":437534,\"journal\":{\"name\":\"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCSSE53117.2021.9493832\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE53117.2021.9493832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Asynchronously Parallel Decoding For Automatic Speech Recognition Services
We proposed a new automatic speech recognition (ASR) service architecture that is extendable to medium-scale ASR service and more flexible than the previous architecture. Improvement aims to substitute the distributed processing approach with an asynchronous parallel thread for decoding multiple voice streams. We replace our TCP-based communication protocol with a remote procedure call developed by Google (gRPC) that makes our ASR service become a developer-friendly, less overhead connection. Besides, the API gateway is employed to reinforce the ASR services by multiple servers so that we can increase our new ASR service to a larger scale. The experimental result shows that our new architecture performs faster than the previous architecture in terms of real-time factor.