{"title":"Sound Source Localization Based on Convolutional Neural Network","authors":"Chen Peng, Niu Changliu","doi":"10.1109/CCET55412.2022.9906394","DOIUrl":null,"url":null,"abstract":"As one of the important ways to transmit and receive information, sound contains a lot of information, and the application scope of sound covers all aspects of life, because the transmission of sound has directionality. In this paper, a room impulse response (RIR) model is established based on the Mono and Multichannel Recording Database (SMARD), and the sound features are extracted using the generalized cross-correlation phase transformation (GCC-PHAT) algorithm. Based on this model, the sound source localization based on convolutional neural network (CNN) is studied. With the help of the microphone array, the system collects randomly generated sound source signals in confined space, and then transmits the collected sound source signals to the CNN model for training. Classify the signal using a well-trained model and finally get the location of the sound source. Experimental results show that the CNN-based sound source localization algorithm has high positioning accuracy under different reverberation conditions and different signal-to-noise ratio environments.","PeriodicalId":329327,"journal":{"name":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET55412.2022.9906394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
As one of the important ways to transmit and receive information, sound contains a lot of information, and the application scope of sound covers all aspects of life, because the transmission of sound has directionality. In this paper, a room impulse response (RIR) model is established based on the Mono and Multichannel Recording Database (SMARD), and the sound features are extracted using the generalized cross-correlation phase transformation (GCC-PHAT) algorithm. Based on this model, the sound source localization based on convolutional neural network (CNN) is studied. With the help of the microphone array, the system collects randomly generated sound source signals in confined space, and then transmits the collected sound source signals to the CNN model for training. Classify the signal using a well-trained model and finally get the location of the sound source. Experimental results show that the CNN-based sound source localization algorithm has high positioning accuracy under different reverberation conditions and different signal-to-noise ratio environments.