{"title":"Convolutional Neural Network-based Direction-of-Arrival Estimation using Stereo Microphones for Drone","authors":"Jeonghwan Choi, Joon‐Hyuk Chang","doi":"10.1109/ICEIC49074.2020.9051364","DOIUrl":null,"url":null,"abstract":"Recently, with the development of drone technology, various researches applying drones have been conducted. Among them, sound source localization for the drone is an important research topic because it can be utilized to find the person in an emergency. However, microphones mounted on the drone capture the ego-noise such as wind and fan noise generated from propellers, resulting in the extremely low signal-to-ratio condition. In this paper, we propose a method of direction-of-arrival (DOA) estimation using stereo microphones for the drone. To cope with the ego-noise, parametric multi-channel Wiener filter is used. After that, power level-based features are fed into the convolutional neural network model to classify the DOA of the desired speech signal. To evaluate our method, we used mean square error between the estimate and reference DOA as a metric. In our experiments, we recorded all utterances used to train and test in a real environment, and the result showed that the proposed system can be used to localize the sound source for the drone.","PeriodicalId":271345,"journal":{"name":"2020 International Conference on Electronics, Information, and Communication (ICEIC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Electronics, Information, and Communication (ICEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEIC49074.2020.9051364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Recently, with the development of drone technology, various researches applying drones have been conducted. Among them, sound source localization for the drone is an important research topic because it can be utilized to find the person in an emergency. However, microphones mounted on the drone capture the ego-noise such as wind and fan noise generated from propellers, resulting in the extremely low signal-to-ratio condition. In this paper, we propose a method of direction-of-arrival (DOA) estimation using stereo microphones for the drone. To cope with the ego-noise, parametric multi-channel Wiener filter is used. After that, power level-based features are fed into the convolutional neural network model to classify the DOA of the desired speech signal. To evaluate our method, we used mean square error between the estimate and reference DOA as a metric. In our experiments, we recorded all utterances used to train and test in a real environment, and the result showed that the proposed system can be used to localize the sound source for the drone.