Head-Related Transfer Function Modeling Based on Finite-Impulse Response

Jian Zhang, Risheng Xia, Chundong Xu, Junfeng Li, Yonghong Yan, S. Sakamoto
{"title":"Head-Related Transfer Function Modeling Based on Finite-Impulse Response","authors":"Jian Zhang, Risheng Xia, Chundong Xu, Junfeng Li, Yonghong Yan, S. Sakamoto","doi":"10.1109/IIH-MSP.2013.87","DOIUrl":null,"url":null,"abstract":"Head-related transfer functions (HRTFs) or its time-domain counterpart, head-related impulse responses (HRIRs) play a crucial role in 3-D audio rendering. The HRIRs with a couple of hundred-sample lengths result in the high computation cost for the real-time 3-D audio applications especially when multiple sound sources are rendered simultaneously. To overcome this problem, various modeling approaches have been reported to shorten HRIRs. In this research, an efficient finite-impulse-response (FIR) model is studied which is essentially based on the concept of the minimum-phase modeling technique. In this method, the measured HRIRs are represented by the interaural time delay (ITD) and the magnitude spectra that are approximated by two FIR filters. To investigate the accuracy dependence of this modeling approach on the order of FIR filter, two psychoacoustic listening experiments were conducted. In the examinations, two HRIR databases (MIT and CIPIC) are exploited and evaluated in terms of sound source localization and sound quality by comparing the synthesized stimuli with the measured HRIRs and those with the FIR models of different orders. Experimental results indicated that the measured hundred-sample-length HRIRs can be sufficiently modeled by the low-order (a dozen of coefficients) FIR model from the perceptual point of view, which represents an important reduction in the computational cost for implementing real-time 3-D audio applications.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIH-MSP.2013.87","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Head-related transfer functions (HRTFs) or its time-domain counterpart, head-related impulse responses (HRIRs) play a crucial role in 3-D audio rendering. The HRIRs with a couple of hundred-sample lengths result in the high computation cost for the real-time 3-D audio applications especially when multiple sound sources are rendered simultaneously. To overcome this problem, various modeling approaches have been reported to shorten HRIRs. In this research, an efficient finite-impulse-response (FIR) model is studied which is essentially based on the concept of the minimum-phase modeling technique. In this method, the measured HRIRs are represented by the interaural time delay (ITD) and the magnitude spectra that are approximated by two FIR filters. To investigate the accuracy dependence of this modeling approach on the order of FIR filter, two psychoacoustic listening experiments were conducted. In the examinations, two HRIR databases (MIT and CIPIC) are exploited and evaluated in terms of sound source localization and sound quality by comparing the synthesized stimuli with the measured HRIRs and those with the FIR models of different orders. Experimental results indicated that the measured hundred-sample-length HRIRs can be sufficiently modeled by the low-order (a dozen of coefficients) FIR model from the perceptual point of view, which represents an important reduction in the computational cost for implementing real-time 3-D audio applications.
基于有限脉冲响应的头部相关传递函数建模
头部相关传递函数(hrtf)或其时域对应的头部相关脉冲响应(HRIRs)在三维音频渲染中起着至关重要的作用。数百个样本长度的hrir导致了实时三维音频应用的高计算成本,特别是当多个声源同时呈现时。为了克服这个问题,已经报道了各种建模方法来缩短hrir。本研究主要基于最小相位建模技术的概念,研究了一种有效的有限脉冲响应(FIR)模型。在该方法中,测量的hrir由两个FIR滤波器近似的声间时间延迟(ITD)和幅度谱表示。为了研究这种建模方法对FIR滤波器阶数的精度依赖性,进行了两次心理声学聆听实验。在实验中,利用MIT和CIPIC两个HRIR数据库,通过将合成刺激与测量的HRIR和不同阶次的FIR模型进行比较,从声源定位和音质方面进行了评估。实验结果表明,从感知的角度来看,低阶(十几个系数)FIR模型可以充分地模拟测量到的百样本长度的hrir,这对于实现实时三维音频应用来说是一个重要的计算成本降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信