Deep Learning for Recognizing Bat Species and Bat Behavior in Audio Recordings

M. Vogelbacher, Hicham Bellafkir, Jannis Gottwald, Daniel Schneider, Markus Muhling, Bernd Freisleben
{"title":"Deep Learning for Recognizing Bat Species and Bat Behavior in Audio Recordings","authors":"M. Vogelbacher, Hicham Bellafkir, Jannis Gottwald, Daniel Schneider, Markus Muhling, Bernd Freisleben","doi":"10.1109/SDS57534.2023.00014","DOIUrl":null,"url":null,"abstract":"Monitoring and mitigating the continuous decline of biodiversity is a key global challenge to preserve the existential basis of human life. Bats as one of the most widespread species among terrestrial mammals are excellent indicators for biodiversity and hence for the health of an ecosystem. Typically, bats are monitored by analyzing ultrasonic sound recordings. Stateof-the-art deep learning approaches for automatic bat detection and bat species recognition commonly rely on audio spectrogram classification models based on fixed time segments, lacking exact call boundaries. While great progress has been made on bat species recognition using echolocation calls, little attention has been paid to bat behavior recognition that provides valuable additional information about bat populations. In this paper, we present a novel end-to-end approach for bat species recognition and bat behavior recognition based on a deep neural network for object detection. In contrast to state-of-the-art approaches, the presented model provides accurate call boundaries. It recognizes 19 bat species and distinguishes between three different behaviors: orientation (echolocation calls), hunting (feeding buzzes), and social behavior (social calls). Our experiments with two data sets show that our method clearly outperforms previous approaches for bat species recognition, achieving up to 86.2% mean average precision. It also performs very well for bat behavior recognition, reaching up to 98.4%, 98.3%, and 95.6% average precision for recognizing echolocation calls, feeding buzzes, and social calls, respectively.","PeriodicalId":150544,"journal":{"name":"2023 10th IEEE Swiss Conference on Data Science (SDS)","volume":"32 11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 10th IEEE Swiss Conference on Data Science (SDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SDS57534.2023.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Monitoring and mitigating the continuous decline of biodiversity is a key global challenge to preserve the existential basis of human life. Bats as one of the most widespread species among terrestrial mammals are excellent indicators for biodiversity and hence for the health of an ecosystem. Typically, bats are monitored by analyzing ultrasonic sound recordings. Stateof-the-art deep learning approaches for automatic bat detection and bat species recognition commonly rely on audio spectrogram classification models based on fixed time segments, lacking exact call boundaries. While great progress has been made on bat species recognition using echolocation calls, little attention has been paid to bat behavior recognition that provides valuable additional information about bat populations. In this paper, we present a novel end-to-end approach for bat species recognition and bat behavior recognition based on a deep neural network for object detection. In contrast to state-of-the-art approaches, the presented model provides accurate call boundaries. It recognizes 19 bat species and distinguishes between three different behaviors: orientation (echolocation calls), hunting (feeding buzzes), and social behavior (social calls). Our experiments with two data sets show that our method clearly outperforms previous approaches for bat species recognition, achieving up to 86.2% mean average precision. It also performs very well for bat behavior recognition, reaching up to 98.4%, 98.3%, and 95.6% average precision for recognizing echolocation calls, feeding buzzes, and social calls, respectively.
深度学习识别蝙蝠种类和蝙蝠行为的录音
监测和缓解生物多样性的持续下降是维护人类生命生存基础的一项关键全球挑战。蝙蝠作为陆生哺乳动物中分布最广的物种之一,是生物多样性和生态系统健康的极好指标。通常,蝙蝠是通过分析超声波录音来监测的。目前最先进的用于蝙蝠自动检测和蝙蝠物种识别的深度学习方法通常依赖于基于固定时间段的音频频谱图分类模型,缺乏精确的呼叫边界。虽然利用回声定位识别蝙蝠物种取得了很大进展,但很少有人关注蝙蝠行为识别,这为蝙蝠种群提供了有价值的额外信息。在本文中,我们提出了一种基于深度神经网络的端到端蝙蝠物种识别和蝙蝠行为识别方法。与最先进的方法相比,所提出的模型提供了准确的调用边界。它可以识别19种蝙蝠,并区分三种不同的行为:定向(回声定位呼叫),狩猎(觅食嗡嗡声)和社会行为(社会呼叫)。在两个数据集上的实验表明,我们的方法明显优于以往的蝙蝠物种识别方法,平均准确率高达86.2%。它在识别蝙蝠行为方面也表现得很好,在识别回声定位呼叫、喂食蜂鸣声和社交呼叫时,平均精度分别达到98.4%、98.3%和95.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信