Yaqin Wang, Zhiwei Chu, Ilmun Ku, E. C. Smith, E. Matson
{"title":"A Large-Scale UAV Audio Dataset and Audio-Based UAV Classification Using CNN","authors":"Yaqin Wang, Zhiwei Chu, Ilmun Ku, E. C. Smith, E. Matson","doi":"10.1109/IRC55401.2022.00039","DOIUrl":null,"url":null,"abstract":"The increased popularity and accessibility of UAVs may create potential threats. Researchers have been developing UAV detection and classification systems with different methods, including audio-based approach. However, the number of publicly available UAV audio datasets is limited. To fill this gap, we selected 10 different UAVs, ranging from toy hand drones to Class I drones, and recorded a total of 5215 seconds length of audio data generated from the flying UAVs. To the best of our knowledge, the proposed dataset is the largest audio dataset for UAVs so far. We further implemented a convolutional neural network (CNN) model for 10-class UAV classification and trained the model with the collected data. The overall test accuracy of the trained model is 97.7% and the test loss is 0.085.","PeriodicalId":282759,"journal":{"name":"2022 Sixth IEEE International Conference on Robotic Computing (IRC)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Sixth IEEE International Conference on Robotic Computing (IRC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRC55401.2022.00039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The increased popularity and accessibility of UAVs may create potential threats. Researchers have been developing UAV detection and classification systems with different methods, including audio-based approach. However, the number of publicly available UAV audio datasets is limited. To fill this gap, we selected 10 different UAVs, ranging from toy hand drones to Class I drones, and recorded a total of 5215 seconds length of audio data generated from the flying UAVs. To the best of our knowledge, the proposed dataset is the largest audio dataset for UAVs so far. We further implemented a convolutional neural network (CNN) model for 10-class UAV classification and trained the model with the collected data. The overall test accuracy of the trained model is 97.7% and the test loss is 0.085.