从回声定位呼叫中识别蝙蝠物种的微型CNN架构

I. Zualkernan, J. Judas, Taslim Mahbub, Azadan Bhagwagar, Priyanka Chand
{"title":"从回声定位呼叫中识别蝙蝠物种的微型CNN架构","authors":"I. Zualkernan, J. Judas, Taslim Mahbub, Azadan Bhagwagar, Priyanka Chand","doi":"10.1109/AI4G50087.2020.9311084","DOIUrl":null,"url":null,"abstract":"Effective monitoring of bat populations will contribute towards the United Nations' SGD 15 which is tied to maintaining biodiversity and SGD 3 which is about maintaining good health and well-being. Bat species are particularly sensitive to anthropogenic pressures and monitoring bat populations trends can serve as a good indicator of an ecosystem's health. Bats have also been linked to at least 60 strains of viruses (including Covid-19) that can infect humans. Monitoring bats can help contribute towards achieving both SGD goals. However, monitoring bats is a difficult and resource-consuming task. This paper investigates how monitoring can be enhanced by automatically identifying bats using audio data from their echolocation calls. Such a system will ease assessing their populations status, trends and habitats. A Convolutional Neural Network (CNN) was developed to identify eight different bat species based on their vocalizations. Alternative CNN models using Short-Term Fourier Transforms (STFTs), Mel-spectrograms Filter banks (MSFB), and Mel Frequency Cepstral Coefficients (MFCC) were developed. The CNN models were optimized using Hyperband. The best model used MSFB features and had only 220K parameters (0.892 MB) and can easily be embedded into a small handheld device. Using 10-fold testing, the best CNN model had an average Accuracy of 97.51% and Average F1-score of 0.9578.","PeriodicalId":286271,"journal":{"name":"2020 IEEE / ITU International Conference on Artificial Intelligence for Good (AI4G)","volume":"224 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A Tiny CNN Architecture for Identifying Bat Species from Echolocation Calls\",\"authors\":\"I. Zualkernan, J. Judas, Taslim Mahbub, Azadan Bhagwagar, Priyanka Chand\",\"doi\":\"10.1109/AI4G50087.2020.9311084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Effective monitoring of bat populations will contribute towards the United Nations' SGD 15 which is tied to maintaining biodiversity and SGD 3 which is about maintaining good health and well-being. Bat species are particularly sensitive to anthropogenic pressures and monitoring bat populations trends can serve as a good indicator of an ecosystem's health. Bats have also been linked to at least 60 strains of viruses (including Covid-19) that can infect humans. Monitoring bats can help contribute towards achieving both SGD goals. However, monitoring bats is a difficult and resource-consuming task. This paper investigates how monitoring can be enhanced by automatically identifying bats using audio data from their echolocation calls. Such a system will ease assessing their populations status, trends and habitats. A Convolutional Neural Network (CNN) was developed to identify eight different bat species based on their vocalizations. Alternative CNN models using Short-Term Fourier Transforms (STFTs), Mel-spectrograms Filter banks (MSFB), and Mel Frequency Cepstral Coefficients (MFCC) were developed. The CNN models were optimized using Hyperband. The best model used MSFB features and had only 220K parameters (0.892 MB) and can easily be embedded into a small handheld device. Using 10-fold testing, the best CNN model had an average Accuracy of 97.51% and Average F1-score of 0.9578.\",\"PeriodicalId\":286271,\"journal\":{\"name\":\"2020 IEEE / ITU International Conference on Artificial Intelligence for Good (AI4G)\",\"volume\":\"224 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE / ITU International Conference on Artificial Intelligence for Good (AI4G)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AI4G50087.2020.9311084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE / ITU International Conference on Artificial Intelligence for Good (AI4G)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AI4G50087.2020.9311084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

对蝙蝠种群进行有效监测将有助于实现联合国与维持生物多样性有关的第15 SGD目标和与维持良好健康和福祉有关的第3 SGD目标。蝙蝠物种对人为压力特别敏感,监测蝙蝠种群的趋势可以作为生态系统健康状况的良好指标。蝙蝠还与至少60种可感染人类的病毒株(包括Covid-19)有关。监测蝙蝠可以帮助实现SGD的两个目标。然而,监测蝙蝠是一项困难且消耗资源的任务。本文研究了如何通过使用回声定位呼叫的音频数据自动识别蝙蝠来增强监测。这样一个系统将有助于评估它们的种群状况、趋势和栖息地。研究人员开发了卷积神经网络(CNN),根据蝙蝠的发声来识别8种不同的蝙蝠。利用短期傅里叶变换(STFTs)、Mel-谱图滤波器组(MSFB)和Mel频率倒谱系数(MFCC)开发了替代CNN模型。使用Hyperband对CNN模型进行优化。最好的型号使用了MSFB功能,只有220K参数(0.892 MB),可以很容易地嵌入到小型手持设备中。通过10倍检验,最佳CNN模型的平均准确率为97.51%,平均f1分数为0.9578。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Tiny CNN Architecture for Identifying Bat Species from Echolocation Calls
Effective monitoring of bat populations will contribute towards the United Nations' SGD 15 which is tied to maintaining biodiversity and SGD 3 which is about maintaining good health and well-being. Bat species are particularly sensitive to anthropogenic pressures and monitoring bat populations trends can serve as a good indicator of an ecosystem's health. Bats have also been linked to at least 60 strains of viruses (including Covid-19) that can infect humans. Monitoring bats can help contribute towards achieving both SGD goals. However, monitoring bats is a difficult and resource-consuming task. This paper investigates how monitoring can be enhanced by automatically identifying bats using audio data from their echolocation calls. Such a system will ease assessing their populations status, trends and habitats. A Convolutional Neural Network (CNN) was developed to identify eight different bat species based on their vocalizations. Alternative CNN models using Short-Term Fourier Transforms (STFTs), Mel-spectrograms Filter banks (MSFB), and Mel Frequency Cepstral Coefficients (MFCC) were developed. The CNN models were optimized using Hyperband. The best model used MSFB features and had only 220K parameters (0.892 MB) and can easily be embedded into a small handheld device. Using 10-fold testing, the best CNN model had an average Accuracy of 97.51% and Average F1-score of 0.9578.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信