{"title":"基于学习的共素圆共形传声器阵列鲁棒DOA估计方法","authors":"Raj Prakash Gohil, Gyanajyoti Routray, R. Hegde","doi":"10.1109/NCC52529.2021.9530130","DOIUrl":null,"url":null,"abstract":"Sound source localization in 1-Dimensional (1D) and 2-Dimensional (2D) is one of the most familiar problems in signal processing. Various types of microphone arrays and their geometry have been explored to find an optimal solution to this problem. The problem becomes more challenging for a reverberate and noisy environment. Localization of the source both in the azimuth and elevation increases the complexity further. In this paper, a convolutional neural network (CNN) based learning approach has been proposed to estimate the primary source in 2D space. Further, a noble co-prime circular conformal microphone array (C3MA) geometry has been developed for sound acquisition. The generalized cross-correlation with phase transform (GCC-PHAT)features have been extracted from the C3MA recordings, which are the input features for training purposes. The experimental results show that the learning-based estimation is more robust compared to the conventional signal processing approach. The learning-based approach also explores the GCC-PHAT features and can be adapted in an adverse acoustic environment. The performance of the proposed algorithm shows significant improvement in the root mean squared error (RMSE) and mean absolute error (MAE) scores compared to the available state-of-art methods.","PeriodicalId":414087,"journal":{"name":"2021 National Conference on Communications (NCC)","volume":"256 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Learning Based Method for Robust DOA Estimation using Co-prime Circular Conformal Microphone Array\",\"authors\":\"Raj Prakash Gohil, Gyanajyoti Routray, R. Hegde\",\"doi\":\"10.1109/NCC52529.2021.9530130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sound source localization in 1-Dimensional (1D) and 2-Dimensional (2D) is one of the most familiar problems in signal processing. Various types of microphone arrays and their geometry have been explored to find an optimal solution to this problem. The problem becomes more challenging for a reverberate and noisy environment. Localization of the source both in the azimuth and elevation increases the complexity further. In this paper, a convolutional neural network (CNN) based learning approach has been proposed to estimate the primary source in 2D space. Further, a noble co-prime circular conformal microphone array (C3MA) geometry has been developed for sound acquisition. The generalized cross-correlation with phase transform (GCC-PHAT)features have been extracted from the C3MA recordings, which are the input features for training purposes. The experimental results show that the learning-based estimation is more robust compared to the conventional signal processing approach. The learning-based approach also explores the GCC-PHAT features and can be adapted in an adverse acoustic environment. The performance of the proposed algorithm shows significant improvement in the root mean squared error (RMSE) and mean absolute error (MAE) scores compared to the available state-of-art methods.\",\"PeriodicalId\":414087,\"journal\":{\"name\":\"2021 National Conference on Communications (NCC)\",\"volume\":\"256 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC52529.2021.9530130\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC52529.2021.9530130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Based Method for Robust DOA Estimation using Co-prime Circular Conformal Microphone Array
Sound source localization in 1-Dimensional (1D) and 2-Dimensional (2D) is one of the most familiar problems in signal processing. Various types of microphone arrays and their geometry have been explored to find an optimal solution to this problem. The problem becomes more challenging for a reverberate and noisy environment. Localization of the source both in the azimuth and elevation increases the complexity further. In this paper, a convolutional neural network (CNN) based learning approach has been proposed to estimate the primary source in 2D space. Further, a noble co-prime circular conformal microphone array (C3MA) geometry has been developed for sound acquisition. The generalized cross-correlation with phase transform (GCC-PHAT)features have been extracted from the C3MA recordings, which are the input features for training purposes. The experimental results show that the learning-based estimation is more robust compared to the conventional signal processing approach. The learning-based approach also explores the GCC-PHAT features and can be adapted in an adverse acoustic environment. The performance of the proposed algorithm shows significant improvement in the root mean squared error (RMSE) and mean absolute error (MAE) scores compared to the available state-of-art methods.