Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani
{"title":"Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition","authors":"Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani","doi":"10.1109/GlobalSIP.2014.7032172","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.","PeriodicalId":362306,"journal":{"name":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP.2014.7032172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.