{"title":"Audio personalization using head related transfer function in 3DTV","authors":"Yongqing Tang, Yong Fang, Qinghua Huang","doi":"10.1109/3DTV.2011.5877191","DOIUrl":null,"url":null,"abstract":"In 3DTV, head related transfer function (HRTFs) can promote immersive feeling of listeners because it contains spatial information on sound source. Audio can be customized through using personalized HRTF. So, listening distortions are caused if HRTFs do not match anthropometric parameters concerning different listeners. In this paper, personalized method is proposed to customize individual HRTF based on non-negative matrix factorization (NMF) and support vector regression (SVR). The anthropometric parameters are selected and high dimensional HRTFs are decomposed into low dimensional matrix using NMF. Nonlinear regression model is derived between the selected anthropometric parameters and low dimensional matrix by SVR. Experimental results demonstrated that personalized HRTF has better performance than using the same HRTF for different listeners in 3DTV audio.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3DTV.2011.5877191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In 3DTV, head related transfer function (HRTFs) can promote immersive feeling of listeners because it contains spatial information on sound source. Audio can be customized through using personalized HRTF. So, listening distortions are caused if HRTFs do not match anthropometric parameters concerning different listeners. In this paper, personalized method is proposed to customize individual HRTF based on non-negative matrix factorization (NMF) and support vector regression (SVR). The anthropometric parameters are selected and high dimensional HRTFs are decomposed into low dimensional matrix using NMF. Nonlinear regression model is derived between the selected anthropometric parameters and low dimensional matrix by SVR. Experimental results demonstrated that personalized HRTF has better performance than using the same HRTF for different listeners in 3DTV audio.
在3DTV中,头部相关传递函数(head related transfer function, HRTFs)由于包含了声源的空间信息,可以促进听者的沉浸感。音频可以通过使用个性化的HRTF来定制。因此,如果hrtf与不同听众的人体测量参数不匹配,就会导致听力失真。本文提出了基于非负矩阵分解(NMF)和支持向量回归(SVR)的个性化HRTF定制方法。选择人体测量参数,利用NMF将高维hrtf分解为低维矩阵。采用支持向量回归法建立了所选人体参数与低维矩阵之间的非线性回归模型。实验结果表明,在3DTV音频中,个性化HRTF比针对不同听者使用相同的HRTF具有更好的性能。