{"title":"基于人体测量学和KEMAR系数的深度神经网络过渡段建模","authors":"Saif S. Alotaibi, M. Wickert","doi":"10.1109/GlobalSIP45357.2019.8969348","DOIUrl":null,"url":null,"abstract":"ITD and ILD, versus source arrival direction, serve as essential binaural cues for spatial hearing. Individualized ITD and ILD can be used to render better 3D audio than a non-individualized one. Due to the correlation between ITD and some anthropometric features, machine learning, such as principal component analysis (PCA) and deep neural networks (DNNs), have become important methods to deploy individualized ITDs. The available measured ITDs do not match the exact sound source directions. An ITD correction method will be presented to overcome the irregularities that occurr due to subject head movements during database creation measurements. KEMAR’s ITD coefficients are utilized to correct the misplacement of a subject’s ITD. DNNs are used to obtain a new subject’s ITD for 1250 different azimuth and elevation angles. Mean absolute error (MAE) is used to compare the proposed ITD model with the available analytical models.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ITD Modeling Based on Anthropometrics and KEMAR Coefficients Using Deep Neural Networks\",\"authors\":\"Saif S. Alotaibi, M. Wickert\",\"doi\":\"10.1109/GlobalSIP45357.2019.8969348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ITD and ILD, versus source arrival direction, serve as essential binaural cues for spatial hearing. Individualized ITD and ILD can be used to render better 3D audio than a non-individualized one. Due to the correlation between ITD and some anthropometric features, machine learning, such as principal component analysis (PCA) and deep neural networks (DNNs), have become important methods to deploy individualized ITDs. The available measured ITDs do not match the exact sound source directions. An ITD correction method will be presented to overcome the irregularities that occurr due to subject head movements during database creation measurements. KEMAR’s ITD coefficients are utilized to correct the misplacement of a subject’s ITD. DNNs are used to obtain a new subject’s ITD for 1250 different azimuth and elevation angles. Mean absolute error (MAE) is used to compare the proposed ITD model with the available analytical models.\",\"PeriodicalId\":221378,\"journal\":{\"name\":\"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GlobalSIP45357.2019.8969348\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP45357.2019.8969348","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
ITD Modeling Based on Anthropometrics and KEMAR Coefficients Using Deep Neural Networks
ITD and ILD, versus source arrival direction, serve as essential binaural cues for spatial hearing. Individualized ITD and ILD can be used to render better 3D audio than a non-individualized one. Due to the correlation between ITD and some anthropometric features, machine learning, such as principal component analysis (PCA) and deep neural networks (DNNs), have become important methods to deploy individualized ITDs. The available measured ITDs do not match the exact sound source directions. An ITD correction method will be presented to overcome the irregularities that occurr due to subject head movements during database creation measurements. KEMAR’s ITD coefficients are utilized to correct the misplacement of a subject’s ITD. DNNs are used to obtain a new subject’s ITD for 1250 different azimuth and elevation angles. Mean absolute error (MAE) is used to compare the proposed ITD model with the available analytical models.