{"title":"深度神经网络在声波测向中的研究进展","authors":"Mohiz Ahmad, Muhammad Muaz, M. Adeel","doi":"10.1109/ICoDT252288.2021.9441527","DOIUrl":null,"url":null,"abstract":"Direction of Arrival (DoA) estimation has importance in many industries such as speech enhancement, spatial audio coding, radio frequency and radio telescope. Deep Neural Network (DNN) has find its way into DoA applications along with the well-known methods such as subspace-based or time difference of arrival methods, which opens-up the data-driven approach towards estimating the DoA. This paper first surveys different DNN architectures and their supporting methods and datasets that are used for estimating DoA in different scenarios. Then a promising architecture based on convolutional recurrent neural network (CRNN) is re-presented on the Spatially Oriented Format for Acoustics (SOFA) dataset, where the average error rate of 9.68° has been achieved.","PeriodicalId":207832,"journal":{"name":"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A Survey of Deep Neural Network in Acoustic Direction Finding\",\"authors\":\"Mohiz Ahmad, Muhammad Muaz, M. Adeel\",\"doi\":\"10.1109/ICoDT252288.2021.9441527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Direction of Arrival (DoA) estimation has importance in many industries such as speech enhancement, spatial audio coding, radio frequency and radio telescope. Deep Neural Network (DNN) has find its way into DoA applications along with the well-known methods such as subspace-based or time difference of arrival methods, which opens-up the data-driven approach towards estimating the DoA. This paper first surveys different DNN architectures and their supporting methods and datasets that are used for estimating DoA in different scenarios. Then a promising architecture based on convolutional recurrent neural network (CRNN) is re-presented on the Spatially Oriented Format for Acoustics (SOFA) dataset, where the average error rate of 9.68° has been achieved.\",\"PeriodicalId\":207832,\"journal\":{\"name\":\"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoDT252288.2021.9441527\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Digital Futures and Transformative Technologies (ICoDT2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoDT252288.2021.9441527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Survey of Deep Neural Network in Acoustic Direction Finding
Direction of Arrival (DoA) estimation has importance in many industries such as speech enhancement, spatial audio coding, radio frequency and radio telescope. Deep Neural Network (DNN) has find its way into DoA applications along with the well-known methods such as subspace-based or time difference of arrival methods, which opens-up the data-driven approach towards estimating the DoA. This paper first surveys different DNN architectures and their supporting methods and datasets that are used for estimating DoA in different scenarios. Then a promising architecture based on convolutional recurrent neural network (CRNN) is re-presented on the Spatially Oriented Format for Acoustics (SOFA) dataset, where the average error rate of 9.68° has been achieved.