{"title":"基于双微阵列和深度学习的声源定位方法","authors":"P. Su, Qingning Zeng, Chao Long","doi":"10.1117/12.2655183","DOIUrl":null,"url":null,"abstract":"In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.","PeriodicalId":105577,"journal":{"name":"International Conference on Signal Processing and Communication Security","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sound source localization method based on dual microarrays and deep learning\",\"authors\":\"P. Su, Qingning Zeng, Chao Long\",\"doi\":\"10.1117/12.2655183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.\",\"PeriodicalId\":105577,\"journal\":{\"name\":\"International Conference on Signal Processing and Communication Security\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Signal Processing and Communication Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2655183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Signal Processing and Communication Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2655183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sound source localization method based on dual microarrays and deep learning
In order to improve the localization accuracy in complex environments, a sound source localization method based on dual microarrays (DMA) and deep learning is studied. Generalized cross correlation-phase transform (GCCPHAT) sequence and the maximum value information of the sequence are used as localization cues, the three-dimensional coordinates of the sound source are used as the output of the network, and the mapping rules from input features to output are learned through the improved CNN network based on VGG16 network structure (referred to as V_CNN for short). Through simulation experiments, the sound source localization method based on circular array and V_CNN, the sound source localization method based on dual microarrays and ordinary convolutional neural network (CNN), and the sound source localization method based on dual microarrays and V_CNN are compared. The experimental results show that the sound source localization method in this paper has high localization accuracy under different noise and reverberation environments.