基于双微阵列和深度学习的声源定位方法

International Conference on Signal Processing and Communication Security Pub Date : 2022-11-02 DOI:10.1117/12.2655183

P. Su, Qingning Zeng, Chao Long

{"title":"基于双微阵列和深度学习的声源定位方法","authors":"P. Su, Qingning Zeng, Chao Long","doi":"10.1117/12.2655183","DOIUrl":null,"url":null,"abstract":"In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.","PeriodicalId":105577,"journal":{"name":"International Conference on Signal Processing and Communication Security","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sound source localization method based on dual microarrays and deep learning\",\"authors\":\"P. Su, Qingning Zeng, Chao Long\",\"doi\":\"10.1117/12.2655183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.\",\"PeriodicalId\":105577,\"journal\":{\"name\":\"International Conference on Signal Processing and Communication Security\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Signal Processing and Communication Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2655183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Communication Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2655183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了提高复杂环境下的声源定位精度，研究了一种基于双微阵列(DMA)和深度学习的声源定位方法。以GCCPHAT序列和序列的最大值信息作为定位线索，以声源的三维坐标作为网络的输出，通过基于VGG16网络结构的改进CNN网络(简称V_CNN)学习输入特征到输出的映射规则。通过仿真实验，对基于圆形阵列和V_CNN的声源定位方法、基于双微阵列和普通卷积神经网络(CNN)的声源定位方法以及基于双微阵列和V_CNN的声源定位方法进行了比较。实验结果表明，本文提出的声源定位方法在不同噪声和混响环境下具有较高的定位精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sound source localization method based on dual microarrays and deep learning

In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Signal Processing and Communication Security

自引率

0.00%

发文量