Dynamic Improvements in a Cloud-Based Speech Recognition Engine by Incorporating Trending Data

Milind Bhavsar, Prudhvi Kosaraju, G. Ananthakrishnan, Gurudas Subray Shet, Saurav Anand
{"title":"Dynamic Improvements in a Cloud-Based Speech Recognition Engine by Incorporating Trending Data","authors":"Milind Bhavsar, Prudhvi Kosaraju, G. Ananthakrishnan, Gurudas Subray Shet, Saurav Anand","doi":"10.1109/MobileCloud.2016.12","DOIUrl":null,"url":null,"abstract":"With the advancement of speech recognition technologies, there is an increase in the adoption of voice interfaces on mobile-based platforms. While, developing a general purpose Automatic Speech Recognition (ASR) which can understand voice commands is important, the contexts of how people interact with their mobile device change very rapidly. Due to the high processing complexity of the ASR engine, much of the processing of trending data is being carried out on cloudplatforms. Changed content regarding news, music, movies and TV series change the focus of interaction with voice based interfaces. Hence ASR engines trained on a static vocabulary may not be able to adapt to the changing contexts. The focus of this paper is to first describe the problems faced in incorporating dynamically changing vocabulary and contexts into an ASR engine. We then propose a novel solution which shows a relative improvement of 38 percent utterance accuracy on newly added content without compromising on the overall accuracy and stability of the system.","PeriodicalId":176270,"journal":{"name":"2016 4th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering (MobileCloud)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 4th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering (MobileCloud)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MobileCloud.2016.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

With the advancement of speech recognition technologies, there is an increase in the adoption of voice interfaces on mobile-based platforms. While, developing a general purpose Automatic Speech Recognition (ASR) which can understand voice commands is important, the contexts of how people interact with their mobile device change very rapidly. Due to the high processing complexity of the ASR engine, much of the processing of trending data is being carried out on cloudplatforms. Changed content regarding news, music, movies and TV series change the focus of interaction with voice based interfaces. Hence ASR engines trained on a static vocabulary may not be able to adapt to the changing contexts. The focus of this paper is to first describe the problems faced in incorporating dynamically changing vocabulary and contexts into an ASR engine. We then propose a novel solution which shows a relative improvement of 38 percent utterance accuracy on newly added content without compromising on the overall accuracy and stability of the system.
结合趋势数据的基于云的语音识别引擎的动态改进
随着语音识别技术的进步,在移动平台上采用语音接口的人越来越多。虽然开发一种能够理解语音命令的通用自动语音识别(ASR)很重要,但人们与移动设备交互的环境变化非常迅速。由于ASR引擎的高处理复杂性,趋势数据的大部分处理都在云平台上进行。新闻、音乐、电影和电视剧等内容的改变改变了基于语音界面的交互重点。因此,在静态词汇表上训练的ASR引擎可能无法适应不断变化的上下文。本文的重点是首先描述将动态变化的词汇和上下文合并到ASR引擎中所面临的问题。然后,我们提出了一种新的解决方案,在不影响系统整体准确性和稳定性的情况下,在新添加的内容上相对提高了38%的话语准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信