Sila Chunwijitra, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, P. Sertsi, S. Kasuriya
{"title":"自动语音识别应用程序的资源分配和共享","authors":"Sila Chunwijitra, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, P. Sertsi, S. Kasuriya","doi":"10.1109/O-COCOSDA46868.2019.9041201","DOIUrl":null,"url":null,"abstract":"Implementation of automatic speech recognition (ASR) system to the real scenarios has been discovered many difficulties in two main topics: processing time and resource demands. These obstructions are such big issues in deploying ASR system. This paper proposed three approaches to deal with those problems, which are applying multithread processing to separate sub-processes, exploiting multiplexing and demultiplexing technique to network socket, and improving the distribution of speech recognition engine in audio streaming. In the experiment, we evaluated our approaches with two types of speech input (audio files and audio streams). The results showed that our approaches are using fewer resources (sharing working memory) and also reduce the processing time since the real-time factor (RTF) is reduced by 15 % approximately comparing with the baseline system.","PeriodicalId":263209,"journal":{"name":"2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Distributing and Sharing Resources for Automatic Speech Recognition Applications\",\"authors\":\"Sila Chunwijitra, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, P. Sertsi, S. Kasuriya\",\"doi\":\"10.1109/O-COCOSDA46868.2019.9041201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Implementation of automatic speech recognition (ASR) system to the real scenarios has been discovered many difficulties in two main topics: processing time and resource demands. These obstructions are such big issues in deploying ASR system. This paper proposed three approaches to deal with those problems, which are applying multithread processing to separate sub-processes, exploiting multiplexing and demultiplexing technique to network socket, and improving the distribution of speech recognition engine in audio streaming. In the experiment, we evaluated our approaches with two types of speech input (audio files and audio streams). The results showed that our approaches are using fewer resources (sharing working memory) and also reduce the processing time since the real-time factor (RTF) is reduced by 15 % approximately comparing with the baseline system.\",\"PeriodicalId\":263209,\"journal\":{\"name\":\"2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/O-COCOSDA46868.2019.9041201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/O-COCOSDA46868.2019.9041201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Distributing and Sharing Resources for Automatic Speech Recognition Applications
Implementation of automatic speech recognition (ASR) system to the real scenarios has been discovered many difficulties in two main topics: processing time and resource demands. These obstructions are such big issues in deploying ASR system. This paper proposed three approaches to deal with those problems, which are applying multithread processing to separate sub-processes, exploiting multiplexing and demultiplexing technique to network socket, and improving the distribution of speech recognition engine in audio streaming. In the experiment, we evaluated our approaches with two types of speech input (audio files and audio streams). The results showed that our approaches are using fewer resources (sharing working memory) and also reduce the processing time since the real-time factor (RTF) is reduced by 15 % approximately comparing with the baseline system.