Sai Srinadhu Katta, Kide Vuojärvi, S. Nandyala, Ulla-Maria Kovalainen, Lauren Baddeley
{"title":"Real-World On-Board Uav Audio Data Set For Propeller Anomalies","authors":"Sai Srinadhu Katta, Kide Vuojärvi, S. Nandyala, Ulla-Maria Kovalainen, Lauren Baddeley","doi":"10.1109/ICASSP43922.2022.9747789","DOIUrl":null,"url":null,"abstract":"Detecting propeller damage in Unmanned Aerial Vehicles (UAV) is a crucial step in ensuring their operational resilience and safety. In this work, we present a novel real-world audio data set of propeller anomalies, and use several deep learning models to classify the damage. This data set consists of more than 5 hours of audio recordings, covering all configurations of intact and broken propellers in a UAV quadcopter. A microphone array was mounted onto a UAV, and numerous autonomous indoor missions were flown. Our on-board setup has provided clean audio recordings containing little background noise. We have developed classification models for this data set, using different deep learning architectures: Deep Neural Networks (DNNs), Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM), and Transformer Encoder (TrEnc). We conclude that the TrEnc outperforms other architectures, having 11k parameters, .57M Flops, 98.30% accuracy, .98 precision, and .98 recall. Finally, we make our data set publicly available here⊙.","PeriodicalId":272439,"journal":{"name":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP43922.2022.9747789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Detecting propeller damage in Unmanned Aerial Vehicles (UAV) is a crucial step in ensuring their operational resilience and safety. In this work, we present a novel real-world audio data set of propeller anomalies, and use several deep learning models to classify the damage. This data set consists of more than 5 hours of audio recordings, covering all configurations of intact and broken propellers in a UAV quadcopter. A microphone array was mounted onto a UAV, and numerous autonomous indoor missions were flown. Our on-board setup has provided clean audio recordings containing little background noise. We have developed classification models for this data set, using different deep learning architectures: Deep Neural Networks (DNNs), Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTM), and Transformer Encoder (TrEnc). We conclude that the TrEnc outperforms other architectures, having 11k parameters, .57M Flops, 98.30% accuracy, .98 precision, and .98 recall. Finally, we make our data set publicly available here⊙.
无人机螺旋桨损伤检测是保证无人机运行弹性和安全性的关键环节。在这项工作中,我们提出了一种新的真实世界螺旋桨异常音频数据集,并使用几种深度学习模型对损伤进行分类。该数据集由超过5小时的录音组成,涵盖了无人机四轴飞行器中完整和损坏的螺旋桨的所有配置。一个麦克风阵列被安装在一架无人机上,并执行了许多自主室内任务。我们的车载设置提供了包含少量背景噪音的干净音频记录。我们为该数据集开发了分类模型,使用不同的深度学习架构:深度神经网络(dnn)、卷积神经网络(cnn)、长短期记忆(LSTM)和变压器编码器(TrEnc)。我们得出结论,TrEnc优于其他架构,具有11k参数,0.57 m Flops, 98.30%准确率,0.98精度和0.98召回率。最后,我们在这里公开了我们的数据集⊙。