Prediction of Head Related Transfer Functions Using Machine Learning Approaches

IF 1.3 Q3 ACOUSTICS

Acoustics (Basel, Switzerland) Pub Date : 2023-03-01 DOI:10.3390/acoustics5010015

R. Fernandez Martinez, P. Jimbert, E. M. Sumner, M. Riedel, Runar Unnthorsson

{"title":"Prediction of Head Related Transfer Functions Using Machine Learning Approaches","authors":"R. Fernandez Martinez, P. Jimbert, E. M. Sumner, M. Riedel, Runar Unnthorsson","doi":"10.3390/acoustics5010015","DOIUrl":null,"url":null,"abstract":"The generation of a virtual, personal, auditory space to obtain a high-quality sound experience when using headphones is of great significance. Normally this experience is improved using personalized head-related transfer functions (HRTFs) that depend on a large degree of personal anthropometric information on pinnae. Most of the studies focus their personal auditory optimization analysis on the study of amplitude versus frequency on HRTFs, mainly in the search for significant elevation cues of frequency maps. Therefore, knowing the HRTFs of each individual is of considerable help to improve sound quality. The following work proposes a methodology to model HRTFs according to the individual structure of pinnae using multilayer perceptron and linear regression techniques. It is proposed to generate several models that allow knowing HRTFs amplitude for each frequency based on the personal anthropometric data on pinnae, the azimuth angle, and the elevation of the sound source, thus predicting frequency magnitudes. Experiments show that the prediction of new personal HRTF generates low errors, thus this model can be applied to new heads with different pinnae characteristics with high confidence. Improving the results obtained with the standard KEMAR pinna, usually used in cases where there is a lack of information.","PeriodicalId":72045,"journal":{"name":"Acoustics (Basel, Switzerland)","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustics (Basel, Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/acoustics5010015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

Abstract

The generation of a virtual, personal, auditory space to obtain a high-quality sound experience when using headphones is of great significance. Normally this experience is improved using personalized head-related transfer functions (HRTFs) that depend on a large degree of personal anthropometric information on pinnae. Most of the studies focus their personal auditory optimization analysis on the study of amplitude versus frequency on HRTFs, mainly in the search for significant elevation cues of frequency maps. Therefore, knowing the HRTFs of each individual is of considerable help to improve sound quality. The following work proposes a methodology to model HRTFs according to the individual structure of pinnae using multilayer perceptron and linear regression techniques. It is proposed to generate several models that allow knowing HRTFs amplitude for each frequency based on the personal anthropometric data on pinnae, the azimuth angle, and the elevation of the sound source, thus predicting frequency magnitudes. Experiments show that the prediction of new personal HRTF generates low errors, thus this model can be applied to new heads with different pinnae characteristics with high confidence. Improving the results obtained with the standard KEMAR pinna, usually used in cases where there is a lack of information.

查看原文本刊更多论文

利用机器学习方法预测头部相关传递函数

在使用耳机时，生成一个虚拟的、个人的听觉空间以获得高质量的声音体验具有重要意义。通常，这种体验是通过使用个性化的头部相关传递函数（HRTF）来改善的，该传递函数依赖于耳廓上的大量个人人体测量信息。大多数研究将其个人听觉优化分析集中在HRTF上振幅与频率的研究上，主要是在搜索频率图的显著仰角线索。因此，了解每个人的HRTF对提高音质有很大帮助。以下工作提出了一种使用多层感知器和线性回归技术根据耳廓的个体结构对HRTF进行建模的方法。提出了基于耳廓的个人人体测量数据、方位角和声源的仰角来生成几个模型，这些模型允许知道每个频率的HRTF幅度，从而预测频率幅度。实验表明，新的个人HRTF的预测误差较小，因此该模型可以高置信度地应用于具有不同耳廓特征的新型头部。改进使用标准KEMAR耳廓获得的结果，通常用于缺乏信息的情况。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊