基于全卷积网络的声矢量传感器DOA估计

2021 IEEE Workshop on Signal Processing Systems (SiPS) Pub Date : 2021-10-01 DOI:10.1109/SiPS52927.2021.00014

Sifan Wang, J. Geng, Xin Lou

{"title":"基于全卷积网络的声矢量传感器DOA估计","authors":"Sifan Wang, J. Geng, Xin Lou","doi":"10.1109/SiPS52927.2021.00014","DOIUrl":null,"url":null,"abstract":"In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.","PeriodicalId":103894,"journal":{"name":"2021 IEEE Workshop on Signal Processing Systems (SiPS)","volume":"292 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fully Convolutional Network-Based DOA Estimation with Acoustic Vector Sensor\",\"authors\":\"Sifan Wang, J. Geng, Xin Lou\",\"doi\":\"10.1109/SiPS52927.2021.00014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.\",\"PeriodicalId\":103894,\"journal\":{\"name\":\"2021 IEEE Workshop on Signal Processing Systems (SiPS)\",\"volume\":\"292 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Workshop on Signal Processing Systems (SiPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SiPS52927.2021.00014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Workshop on Signal Processing Systems (SiPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SiPS52927.2021.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

提出了一种基于学习的声矢量传感器到达方向(DOA)估计管道。在该管道中，引入了一种全卷积网络(FCN)来进行无污染的时频(TF)点提取，这是基于avs的DOA估计的关键步骤。与传统的直接路径主导(DPD)或单源点(SSP)检测不同，未污染TF点提取问题被建模为图像分割问题，其中利用AVS空间响应的直接DOA线索进行地面真值标记以生成网络的训练数据。利用提取的未受污染的TF点，利用所提出的模糊几何中位数聚类方法生成最终的DOA。仿真结果表明，该方法在声源角差较小的情况下能够提高精度，在强混响和强噪声情况下能够提高鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Fully Convolutional Network-Based DOA Estimation with Acoustic Vector Sensor

In this paper, a learning-based direction of arrival (DOA) estimation pipeline for acoustic vector sensor (AVS) is proposed. In the proposed pipeline, a fully convolutional network (FCN) is introduced for uncontaminated time-frequency (TF) point extraction, which is a crucial step for AVS-based DOA estimation. Unlike conventional direct path dominant (DPD) or single source points (SSP) detection, the uncontaminated TF point extraction problem is modeled as an image segmentation problem, where the direct DOA cues from the spatial response of AVS is utilized for ground truth labeling to generate the training data of the network. With the extracted uncontaminated TF points, the final DOA can be generated using the proposed fuzzy geometric median (FGM) clustering. Simulation results show that the proposed pipeline is capable of improving the accuracy in the cases of small angular difference between acoustic sources and improving robustness in strong reverberation and noise situations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Workshop on Signal Processing Systems (SiPS)

自引率

0.00%

发文量