{"title":"基于全卷积网络的声矢量传感器DOA估计","authors":"Sifan Wang, J. Geng, Xin Lou","doi":"10.1109/SiPS52927.2021.00014","DOIUrl":null,"url":null,"abstract":"In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.","PeriodicalId":103894,"journal":{"name":"2021 IEEE Workshop on Signal Processing Systems (SiPS)","volume":"292 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fully Convolutional Network-Based DOA Estimation with Acoustic Vector Sensor\",\"authors\":\"Sifan Wang, J. Geng, Xin Lou\",\"doi\":\"10.1109/SiPS52927.2021.00014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.\",\"PeriodicalId\":103894,\"journal\":{\"name\":\"2021 IEEE Workshop on Signal Processing Systems (SiPS)\",\"volume\":\"292 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Workshop on Signal Processing Systems (SiPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SiPS52927.2021.00014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Workshop on Signal Processing Systems (SiPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SiPS52927.2021.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fully Convolutional Network-Based DOA Estimation with Acoustic Vector Sensor
In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.