Speaker Localization in Smartphones using Adaptive Eigenvalue Decomposition with Noise Reduction

J. M. Mendoza, Franz A. de Leon
{"title":"Speaker Localization in Smartphones using Adaptive Eigenvalue Decomposition with Noise Reduction","authors":"J. M. Mendoza, Franz A. de Leon","doi":"10.1109/TENCON54134.2021.9707231","DOIUrl":null,"url":null,"abstract":"Most smartphones are dual microphone devices capable of determining the direction of arrival of an utterance from a speaker source. The widespread use of such devices helps in improving hearing aid systems without increased expenses. These types of sound source localization (SSL) systems with two sensors take advantage of time delay estimation (TDE) techniques such as cross-correlation and adaptive eigenvalue decomposition (AED). The former lacks reliability in situations with reverb, while the latter suffers from background noise. In this paper, we observed the effect of integrating a noise reduction algorithm to AED for SSL applications. Given the robustness of AED with room reverb, we expect performance improvement of TDE from noise-reduced outputs. The minimum mean-square error with decision-directed (MMSE-DD) noise estimation algorithm acts as a filter for the received signals. We proposed $\\text{MMSE}-\\text{DD}+\\text{AED}$ to obtain an SSL algorithm in poor environment conditions. The empirical results of the system yielded 69.87%, which is a significant improvement from previous SSL algorithms in smartphones. Furthermore, a tilt compensation solution boosted the accuracy to 79.28%, addressing the dynamic behavior of the built-in microphones of the device.","PeriodicalId":405859,"journal":{"name":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON54134.2021.9707231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Most smartphones are dual microphone devices capable of determining the direction of arrival of an utterance from a speaker source. The widespread use of such devices helps in improving hearing aid systems without increased expenses. These types of sound source localization (SSL) systems with two sensors take advantage of time delay estimation (TDE) techniques such as cross-correlation and adaptive eigenvalue decomposition (AED). The former lacks reliability in situations with reverb, while the latter suffers from background noise. In this paper, we observed the effect of integrating a noise reduction algorithm to AED for SSL applications. Given the robustness of AED with room reverb, we expect performance improvement of TDE from noise-reduced outputs. The minimum mean-square error with decision-directed (MMSE-DD) noise estimation algorithm acts as a filter for the received signals. We proposed $\text{MMSE}-\text{DD}+\text{AED}$ to obtain an SSL algorithm in poor environment conditions. The empirical results of the system yielded 69.87%, which is a significant improvement from previous SSL algorithms in smartphones. Furthermore, a tilt compensation solution boosted the accuracy to 79.28%, addressing the dynamic behavior of the built-in microphones of the device.
基于自适应特征值分解和降噪的智能手机说话人定位
大多数智能手机都是双麦克风设备,能够确定来自扬声器源的话语到达的方向。这种设备的广泛使用有助于在不增加费用的情况下改善助听器系统。这些具有两个传感器的声源定位(SSL)系统利用了时间延迟估计(TDE)技术,如互相关和自适应特征值分解(AED)。前者在有混响的情况下缺乏可靠性,而后者则受到背景噪声的影响。在本文中,我们观察了将降噪算法集成到SSL应用中的AED的效果。考虑到带有室内混响的AED的鲁棒性,我们期望从降噪输出中改善TDE的性能。最小均方误差与决策导向(MMSE-DD)噪声估计算法作为接收信号的滤波器。我们提出$\text{MMSE}-\text{DD}+\text{AED}$来获得恶劣环境条件下的SSL算法。该系统的实证结果为69.87%,与之前智能手机上的SSL算法相比有了显著的提高。此外,倾斜补偿解决方案将精度提高到79.28%,解决了设备内置麦克风的动态行为。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信