Yunan Wang , Dingding Yao , Zhi Zhou , Daocheng Chen , Wenquan Feng , Junfeng Li
{"title":"Effects of spectral peaks and notches in head-related transfer function on median plane sound localization with dynamic binaural playback","authors":"Yunan Wang , Dingding Yao , Zhi Zhou , Daocheng Chen , Wenquan Feng , Junfeng Li","doi":"10.1016/j.apacoust.2024.110424","DOIUrl":null,"url":null,"abstract":"<div><div>Spectral peaks and notches in the head-related transfer function (HRTF) are considered pivotal for elevation perception in virtual auditory displays (VAD), especially during static binaural signal playback. However, studies on dynamic binaural signal playback in VAD have shown that the auditory system can still utilize dynamic cues for elevation localization, even when these high-frequency spectral components are missing, although this may compromise localization accuracy. This study investigated the effects of spectral peaks and notches in dynamic playback, examining how distorting these features and their contrasts at various levels (33%, 66%, and 100% removal) influenced elevation localization along different rotational axes (yaw and pitch rotation). The results revealed that at the same distortion level, the impact of these features on median plane localization decreased sequentially from spectral contrast, to peaks, to notches. At a distortion level of 33%, notch removal enabled dynamic playback results that were not significantly different from control conditions. As distortion levels increased to 66% and 100%, localization performance progressively deteriorated, including increased localization errors and up-down confusion with head yaw rotation as well as front-back and up-down confusion with head pitch rotation. Simultaneously, localization performance with head pitch rotation exhibited poorer performance compared to yaw rotation, particularly in cases involving peak removal and contrast compression. The experimental results further revealed that auditory elevation localization benefits from multiple localization cues generated by head movements, including dynamic spectral cues produced during large head rotations when all spectral cues are available or distorted at a level of 33%.</div></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":"230 ","pages":"Article 110424"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X24005759","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Spectral peaks and notches in the head-related transfer function (HRTF) are considered pivotal for elevation perception in virtual auditory displays (VAD), especially during static binaural signal playback. However, studies on dynamic binaural signal playback in VAD have shown that the auditory system can still utilize dynamic cues for elevation localization, even when these high-frequency spectral components are missing, although this may compromise localization accuracy. This study investigated the effects of spectral peaks and notches in dynamic playback, examining how distorting these features and their contrasts at various levels (33%, 66%, and 100% removal) influenced elevation localization along different rotational axes (yaw and pitch rotation). The results revealed that at the same distortion level, the impact of these features on median plane localization decreased sequentially from spectral contrast, to peaks, to notches. At a distortion level of 33%, notch removal enabled dynamic playback results that were not significantly different from control conditions. As distortion levels increased to 66% and 100%, localization performance progressively deteriorated, including increased localization errors and up-down confusion with head yaw rotation as well as front-back and up-down confusion with head pitch rotation. Simultaneously, localization performance with head pitch rotation exhibited poorer performance compared to yaw rotation, particularly in cases involving peak removal and contrast compression. The experimental results further revealed that auditory elevation localization benefits from multiple localization cues generated by head movements, including dynamic spectral cues produced during large head rotations when all spectral cues are available or distorted at a level of 33%.
期刊介绍:
Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense.
Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems.
Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.