Speech recognition results for voice-controlled assistive applications

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI:10.1109/SPED.2017.7990438

Alexandru Caranica, H. Cucu, C. Burileanu, François Portet, Michel Vacher

{"title":"Speech recognition results for voice-controlled assistive applications","authors":"Alexandru Caranica, H. Cucu, C. Burileanu, François Portet, Michel Vacher","doi":"10.1109/SPED.2017.7990438","DOIUrl":null,"url":null,"abstract":"Until recently, controlling a “smart home” consisted in setting up a series of applications and automation tools: scheduling when the air conditioning system could cool the room, turn on the lighting system at sunset, or just use ones phone to control several TV appliances or the garage door. Recent advances in speech recognition technology have made voice-controlled smart homes attainable, and many companies and communities are providing interfaces or home boxes to make this voice control available. However, they lack customization ability, and interoperability with appliances or applications is not guaranteed. Moreover, most of these systems are not focused in supporting specific voice recognition scenarios, such as assistive applications for elder or disabled people or consider a triggered close talking voice interaction. Although state of the art speech processing has achieved great performance for most widely used languages, little to no efforts were made for under-resourced languages, such as Romanian. This paper focuses on a set of experiments in building a series of acoustic and grammar models for Romanian language, to be used in distant speech recognition scenarios, for voice driven speech applications in intelligent homes or buildings, using previously acquired speech databases in Romanian language, in real life conditions, by our research group.","PeriodicalId":345314,"journal":{"name":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPED.2017.7990438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Until recently, controlling a “smart home” consisted in setting up a series of applications and automation tools: scheduling when the air conditioning system could cool the room, turn on the lighting system at sunset, or just use ones phone to control several TV appliances or the garage door. Recent advances in speech recognition technology have made voice-controlled smart homes attainable, and many companies and communities are providing interfaces or home boxes to make this voice control available. However, they lack customization ability, and interoperability with appliances or applications is not guaranteed. Moreover, most of these systems are not focused in supporting specific voice recognition scenarios, such as assistive applications for elder or disabled people or consider a triggered close talking voice interaction. Although state of the art speech processing has achieved great performance for most widely used languages, little to no efforts were made for under-resourced languages, such as Romanian. This paper focuses on a set of experiments in building a series of acoustic and grammar models for Romanian language, to be used in distant speech recognition scenarios, for voice driven speech applications in intelligent homes or buildings, using previously acquired speech databases in Romanian language, in real life conditions, by our research group.

查看原文本刊更多论文

语音控制辅助应用程序的语音识别结果

直到最近，控制一个“智能家居”包括设置一系列的应用程序和自动化工具:调度空调系统何时可以冷却房间，在日落时打开照明系统，或者只是用手机控制几个电视设备或车库门。语音识别技术的最新进展使语音控制的智能家居成为可能，许多公司和社区都在提供接口或家用盒子来实现这种语音控制。但是，它们缺乏定制能力，并且不能保证与设备或应用程序的互操作性。此外，这些系统中的大多数都没有专注于支持特定的语音识别场景，例如老年人或残疾人的辅助应用程序，也没有考虑触发近距离语音交互。尽管最先进的语音处理技术已经在大多数广泛使用的语言中取得了很好的表现，但对于资源不足的语言，如罗马尼亚语，几乎没有任何努力。本文重点研究了一组实验，用于建立罗马尼亚语的一系列声学和语法模型，用于远程语音识别场景，用于智能家庭或建筑物中的语音驱动语音应用，使用先前获得的罗马尼亚语语音数据库，在现实生活条件下，由我们的研究小组。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

自引率

0.00%

发文量