{"title":"Transfer learning for domain and environment adaptation in Serbian ASR","authors":"B. Popović, E. Pakoci, D. Pekar","doi":"10.5937/TELFOR2002110P","DOIUrl":null,"url":null,"abstract":"In automatic speech recognition systems, the training data used for system development and the data actually obtained from the users of the system sometimes significantly differ in practice. However, other, more similar data may be available. Transfer learning can help to exploit such similar data for training in order to boost the automatic speech recognizer's performance for a certain domain. This paper presents a few applications of transfer learning in the context of speech recognition, specifically for the Serbian language. Several methods are proposed, with the goal of optimizing system performance on a specific part of the existing speech database for Serbian, or in a noisy environment. The experimental results evaluated on a test set from the desired domain show significant improvement in both word error rate and character error rate.","PeriodicalId":37719,"journal":{"name":"Telfor Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Telfor Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5937/TELFOR2002110P","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
In automatic speech recognition systems, the training data used for system development and the data actually obtained from the users of the system sometimes significantly differ in practice. However, other, more similar data may be available. Transfer learning can help to exploit such similar data for training in order to boost the automatic speech recognizer's performance for a certain domain. This paper presents a few applications of transfer learning in the context of speech recognition, specifically for the Serbian language. Several methods are proposed, with the goal of optimizing system performance on a specific part of the existing speech database for Serbian, or in a noisy environment. The experimental results evaluated on a test set from the desired domain show significant improvement in both word error rate and character error rate.
期刊介绍:
The TELFOR Journal is an open access international scientific journal publishing improved and extended versions of the selected best papers initially reported at the annual TELFOR Conference (www.telfor.rs), papers invited by the Editorial Board, and papers submitted by authors themselves for publishing. All papers are subject to reviewing. The TELFOR Journal is published in the English language, with both electronic and printed versions. Being an IEEE co-supported publication, it will follow all the IEEE rules and procedures. The TELFOR Journal covers all the essential branches of modern telecommunications and information technology: Telecommunications Policy and Services, Telecommunications Networks, Radio Communications, Communications Systems, Signal Processing, Optical Communications, Applied Electromagnetics, Applied Electronics, Multimedia, Software Tools and Applications, as well as other fields related to ICT. This large spectrum of topics accounts for the rapid convergence through telecommunications of the underlying technologies towards the information and knowledge society. The Journal provides a medium for exchanging research results and technological achievements accomplished by the scientific community from academia and industry.