Ami Berger, Vladimir Tourbabin, Jacob Donley, Zamir Ben-Hur, Boaz Rafaely
{"title":"Insights into the Incorporation of Signal Information in Binaural Signal Matching with Wearable Microphone Arrays","authors":"Ami Berger, Vladimir Tourbabin, Jacob Donley, Zamir Ben-Hur, Boaz Rafaely","doi":"arxiv-2409.11731","DOIUrl":null,"url":null,"abstract":"The increasing popularity of spatial audio in applications such as\nteleconferencing, entertainment, and virtual reality has led to the recent\ndevelopments of binaural reproduction methods. However, only a few of these\nmethods are well-suited for wearable and mobile arrays, which typically consist\nof a small number of microphones. One such method is binaural signal matching\n(BSM), which has been shown to produce high-quality binaural signals for\nwearable arrays. However, BSM may be suboptimal in cases of high\ndirect-to-reverberant ratio (DRR) as it is based on the diffuse sound field\nassumption. To overcome this limitation, previous studies incorporated\nsound-field models other than diffuse. However, this approach was not studied\ncomprehensively. This paper extensively investigates two BSM-based methods\ndesigned for high DRR scenarios. The methods incorporate a sound field model\ncomposed of direct and reverberant components.The methods are investigated both\nmathematically and using simulations, finally validated by a listening test.\nThe results show that the proposed methods can significantly improve the\nperformance of BSM , in particular in the direction of the source, while\npresenting only a negligible degradation in other directions. Furthermore, when\nsource direction estimation is inaccurate, performance of these methods degrade\nto equal that of the BSM, presenting a desired robustness quality.","PeriodicalId":501284,"journal":{"name":"arXiv - EE - Audio and Speech Processing","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Audio and Speech Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing popularity of spatial audio in applications such as
teleconferencing, entertainment, and virtual reality has led to the recent
developments of binaural reproduction methods. However, only a few of these
methods are well-suited for wearable and mobile arrays, which typically consist
of a small number of microphones. One such method is binaural signal matching
(BSM), which has been shown to produce high-quality binaural signals for
wearable arrays. However, BSM may be suboptimal in cases of high
direct-to-reverberant ratio (DRR) as it is based on the diffuse sound field
assumption. To overcome this limitation, previous studies incorporated
sound-field models other than diffuse. However, this approach was not studied
comprehensively. This paper extensively investigates two BSM-based methods
designed for high DRR scenarios. The methods incorporate a sound field model
composed of direct and reverberant components.The methods are investigated both
mathematically and using simulations, finally validated by a listening test.
The results show that the proposed methods can significantly improve the
performance of BSM , in particular in the direction of the source, while
presenting only a negligible degradation in other directions. Furthermore, when
source direction estimation is inaccurate, performance of these methods degrade
to equal that of the BSM, presenting a desired robustness quality.