T. Nakatania, T. Yoshiokaa, K. Kinoshita, M. Miyoshi, B. Juang
{"title":"Speech Dereverberation in Short Time Fourier Transform Domain with Crossband Effect Compensation","authors":"T. Nakatania, T. Yoshiokaa, K. Kinoshita, M. Miyoshi, B. Juang","doi":"10.1109/HSCMA.2008.4538726","DOIUrl":null,"url":null,"abstract":"It has recently been shown that the maximum likelihood estimation approach with a time-varying source model is very effective in achieving speech dereverberation based only on a short observation. In addition, STFT domain processing has been shown to be promising for implementing this dereverberation approach in a computationally efficient way. This paper presents a way of further improving the STFT domain speech dereverberation in terms of both computational cost and accuracy. One important issue here is how to calculate time-domain convolution with a long filter precisely using STFT. We introduce an STFT domain filtering method with crossband effect compensation for this purpose. Experimental results show that the proposed method allows us to implement the dereverberation algorithm in the STFT domain more precisely with less computational cost than the existing method.","PeriodicalId":129827,"journal":{"name":"2008 Hands-Free Speech Communication and Microphone Arrays","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Hands-Free Speech Communication and Microphone Arrays","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HSCMA.2008.4538726","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
It has recently been shown that the maximum likelihood estimation approach with a time-varying source model is very effective in achieving speech dereverberation based only on a short observation. In addition, STFT domain processing has been shown to be promising for implementing this dereverberation approach in a computationally efficient way. This paper presents a way of further improving the STFT domain speech dereverberation in terms of both computational cost and accuracy. One important issue here is how to calculate time-domain convolution with a long filter precisely using STFT. We introduce an STFT domain filtering method with crossband effect compensation for this purpose. Experimental results show that the proposed method allows us to implement the dereverberation algorithm in the STFT domain more precisely with less computational cost than the existing method.