{"title":"Phase-Domain Statistical Analysis for Audio Source Localization","authors":"A. Said, T. Kalker, R. Schafer","doi":"10.1109/MMSP.2007.4412826","DOIUrl":null,"url":null,"abstract":"We consider the problem of estimating the time-difference-of-arrival (TDOA) for audio source localization in noisy environments, defining a framework for statistical analysis in the phase domain which enables more reliable estimates. This is motivated by the fact that with complex sources, noise, and interfering signals, different frequency bands have significantly different signal-to-noise ratios, creating non-uniform distributions of errors in the phase measurements. Through a new method for analysis of the variations of the phase in frequency windows, we first estimate the signal-to-noise ratio for frequency, and then use it in a maximum-likelihood estimation of the time difference of arrival. We show that this corresponds to a generalization of the Phase Transform method (PHAT), and provides a theoretical justification of why it works so well. Numerical results show how the proposed technique compares favorably with PHAT.","PeriodicalId":225295,"journal":{"name":"2007 IEEE 9th Workshop on Multimedia Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE 9th Workshop on Multimedia Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2007.4412826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We consider the problem of estimating the time-difference-of-arrival (TDOA) for audio source localization in noisy environments, defining a framework for statistical analysis in the phase domain which enables more reliable estimates. This is motivated by the fact that with complex sources, noise, and interfering signals, different frequency bands have significantly different signal-to-noise ratios, creating non-uniform distributions of errors in the phase measurements. Through a new method for analysis of the variations of the phase in frequency windows, we first estimate the signal-to-noise ratio for frequency, and then use it in a maximum-likelihood estimation of the time difference of arrival. We show that this corresponds to a generalization of the Phase Transform method (PHAT), and provides a theoretical justification of why it works so well. Numerical results show how the proposed technique compares favorably with PHAT.