Audio personalization using head related transfer function in 3DTV

2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) Pub Date : 2011-05-16 DOI:10.1109/3DTV.2011.5877191

Yongqing Tang, Yong Fang, Qinghua Huang

引用次数: 4

Abstract

In 3DTV, head related transfer function (HRTFs) can promote immersive feeling of listeners because it contains spatial information on sound source. Audio can be customized through using personalized HRTF. So, listening distortions are caused if HRTFs do not match anthropometric parameters concerning different listeners. In this paper, personalized method is proposed to customize individual HRTF based on non-negative matrix factorization (NMF) and support vector regression (SVR). The anthropometric parameters are selected and high dimensional HRTFs are decomposed into low dimensional matrix using NMF. Nonlinear regression model is derived between the selected anthropometric parameters and low dimensional matrix by SVR. Experimental results demonstrated that personalized HRTF has better performance than using the same HRTF for different listeners in 3DTV audio.

查看原文本刊更多论文

3DTV中使用头部相关传输功能的音频个性化

在3DTV中，头部相关传递函数(head related transfer function, HRTFs)由于包含了声源的空间信息，可以促进听者的沉浸感。音频可以通过使用个性化的HRTF来定制。因此，如果hrtf与不同听众的人体测量参数不匹配，就会导致听力失真。本文提出了基于非负矩阵分解(NMF)和支持向量回归(SVR)的个性化HRTF定制方法。选择人体测量参数，利用NMF将高维hrtf分解为低维矩阵。采用支持向量回归法建立了所选人体参数与低维矩阵之间的非线性回归模型。实验结果表明，在3DTV音频中，个性化HRTF比针对不同听者使用相同的HRTF具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)

自引率

0.00%

发文量