Sound source localization method based on dual microarrays and deep learning

P. Su, Qingning Zeng, Chao Long
{"title":"Sound source localization method based on dual microarrays and deep learning","authors":"P. Su, Qingning Zeng, Chao Long","doi":"10.1117/12.2655183","DOIUrl":null,"url":null,"abstract":"In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.","PeriodicalId":105577,"journal":{"name":"International Conference on Signal Processing and Communication Security","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Communication Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2655183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.
基于双微阵列和深度学习的声源定位方法
为了提高复杂环境下的声源定位精度,研究了一种基于双微阵列(DMA)和深度学习的声源定位方法。以GCCPHAT序列和序列的最大值信息作为定位线索,以声源的三维坐标作为网络的输出,通过基于VGG16网络结构的改进CNN网络(简称V_CNN)学习输入特征到输出的映射规则。通过仿真实验,对基于圆形阵列和V_CNN的声源定位方法、基于双微阵列和普通卷积神经网络(CNN)的声源定位方法以及基于双微阵列和V_CNN的声源定位方法进行了比较。实验结果表明,本文提出的声源定位方法在不同噪声和混响环境下具有较高的定位精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信