Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition

Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani
{"title":"Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition","authors":"Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani","doi":"10.1109/GlobalSIP.2014.7032172","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.","PeriodicalId":362306,"journal":{"name":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP.2014.7032172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.
击败混响:先进的去混响和识别技术,免提语音识别
自动语音识别在越来越多的产品中得到了成功的应用。然而,目前的识别系统通常需要使用近距离通话麦克风。这种约束限制了语音识别在新应用程序中的部署。在免提情况下,噪声和混响会严重降低识别性能。噪声鲁棒性问题引起了广泛的关注,并提出了实用的解决方案,并使用通用基准进行了评估。相比之下,混响一直被认为是一个无法解决的问题。最近,混响语音识别领域取得了重大进展,这一进展已经通过2014年的REVERB挑战进行了评估。在本文中,我们描述了我们针对REVERB挑战提出的混响语音识别系统,即使在严重混响条件下也能表现出很高的识别性能。我们将我们的系统与其他提出的方法进行比较,以提出该领域潜在的未来研究方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信