Deep learning in marine bioacoustics: a benchmark for baleen whale detection

IF 4.3 2区环境科学与生态学 Q1 ECOLOGY

Remote Sensing in Ecology and Conservation Pub Date : 2024-04-17 DOI:10.1002/rse2.392

Elena Schall, Idil Ilgaz Kaya, Elisabeth Debusschere, Paul Devos, Clea Parcerisas

{"title":"Deep learning in marine bioacoustics: a benchmark for baleen whale detection","authors":"Elena Schall, Idil Ilgaz Kaya, Elisabeth Debusschere, Paul Devos, Clea Parcerisas","doi":"10.1002/rse2.392","DOIUrl":null,"url":null,"abstract":"Passive acoustic monitoring (PAM) is commonly used to obtain year‐round continuous data on marine soundscapes harboring valuable information on species distributions or ecosystem dynamics. This continuously increasing amount of data requires highly efficient automated analysis techniques in order to exploit the full potential of the available data. Here, we propose a benchmark, which consists of a public dataset, a well‐defined task and evaluation procedure to develop and test automated analysis techniques. This benchmark focuses on the special case of detecting animal vocalizations in a real‐world dataset from the marine realm. We believe that such a benchmark is necessary to monitor the progress in the development of new detection algorithms in the field of marine bioacoustics. We ultimately use the proposed benchmark to test three detection approaches, namely ANIMAL‐SPOT, Koogu and a simple custom sequential convolutional neural network (CNN), and report performances. We report the performance of the three detection approaches in a blocked cross‐validation fashion with 11 site‐year blocks for a multi‐species detection scenario in a large marine passive acoustic dataset. Performance was measured with three simple metrics (i.e., true classification rate, noise misclassification rate and call misclassification rate) and one combined fitness metric, which allocates more weight to the minimization of false positives created by noise. Overall, ANIMAL‐SPOT performed the best with an average fitness metric of 0.6, followed by the custom CNN with an average fitness metric of 0.57 and finally Koogu with an average fitness metric of 0.42. The presented benchmark is an important step to advance in the automatic processing of the continuously growing amount of PAM data that are collected throughout the world's oceans. To ultimately achieve usability of developed algorithms, the focus of future work should be laid on the reduction of the false positives created by noise.","PeriodicalId":21132,"journal":{"name":"Remote Sensing in Ecology and Conservation","volume":"447 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing in Ecology and Conservation","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1002/rse2.392","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Passive acoustic monitoring (PAM) is commonly used to obtain year‐round continuous data on marine soundscapes harboring valuable information on species distributions or ecosystem dynamics. This continuously increasing amount of data requires highly efficient automated analysis techniques in order to exploit the full potential of the available data. Here, we propose a benchmark, which consists of a public dataset, a well‐defined task and evaluation procedure to develop and test automated analysis techniques. This benchmark focuses on the special case of detecting animal vocalizations in a real‐world dataset from the marine realm. We believe that such a benchmark is necessary to monitor the progress in the development of new detection algorithms in the field of marine bioacoustics. We ultimately use the proposed benchmark to test three detection approaches, namely ANIMAL‐SPOT, Koogu and a simple custom sequential convolutional neural network (CNN), and report performances. We report the performance of the three detection approaches in a blocked cross‐validation fashion with 11 site‐year blocks for a multi‐species detection scenario in a large marine passive acoustic dataset. Performance was measured with three simple metrics (i.e., true classification rate, noise misclassification rate and call misclassification rate) and one combined fitness metric, which allocates more weight to the minimization of false positives created by noise. Overall, ANIMAL‐SPOT performed the best with an average fitness metric of 0.6, followed by the custom CNN with an average fitness metric of 0.57 and finally Koogu with an average fitness metric of 0.42. The presented benchmark is an important step to advance in the automatic processing of the continuously growing amount of PAM data that are collected throughout the world's oceans. To ultimately achieve usability of developed algorithms, the focus of future work should be laid on the reduction of the false positives created by noise.

查看原文本刊更多论文

海洋生物声学中的深度学习：须鲸检测基准

被动声学监测（PAM）通常用于获取全年连续的海洋声景数据，这些数据蕴含着有关物种分布或生态系统动态的宝贵信息。这种持续增长的数据量需要高效的自动分析技术，以充分挖掘可用数据的潜力。在此，我们提出了一个基准，其中包括一个公共数据集、一个定义明确的任务和评估程序，用于开发和测试自动分析技术。该基准主要针对在海洋领域的真实数据集中检测动物发声的特殊情况。我们认为，这样一个基准对于监测海洋生物声学领域新检测算法的开发进度十分必要。我们最终使用提出的基准测试了三种检测方法，即 ANIMAL-SPOT、Koogu 和一个简单的自定义序列卷积神经网络（CNN），并报告了它们的性能。我们在大型海洋被动声学数据集中的多物种检测场景中，以 11 个站点年块的阻断交叉验证方式报告了三种检测方法的性能。性能用三个简单指标（即真实分类率、噪声误分类率和调用误分类率）和一个综合适应度指标来衡量，后者将更多权重分配给噪声造成的误报最小化。总体而言，ANIMAL-SPOT 的表现最好，平均适合度指标为 0.6，其次是定制 CNN，平均适合度指标为 0.57，最后是 Koogu，平均适合度指标为 0.42。所提出的基准是在自动处理全球海洋中收集到的持续增长的 PAM 数据方面迈出的重要一步。为了最终实现所开发算法的可用性，未来工作的重点应放在减少噪声造成的误报上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Remote Sensing in Ecology and Conservation Earth and Planetary Sciences-Computers in Earth Sciences

CiteScore

9.80

自引率

5.50%

发文量

审稿时长

18 weeks

期刊介绍： emote Sensing in Ecology and Conservation provides a forum for rapid, peer-reviewed publication of novel, multidisciplinary research at the interface between remote sensing science and ecology and conservation. The journal prioritizes findings that advance the scientific basis of ecology and conservation, promoting the development of remote-sensing based methods relevant to the management of land use and biological systems at all levels, from populations and species to ecosystems and biomes. The journal defines remote sensing in its broadest sense, including data acquisition by hand-held and fixed ground-based sensors, such as camera traps and acoustic recorders, and sensors on airplanes and satellites. The intended journal’s audience includes ecologists, conservation scientists, policy makers, managers of terrestrial and aquatic systems, remote sensing scientists, and students. Remote Sensing in Ecology and Conservation is a fully open access journal from Wiley and the Zoological Society of London. Remote sensing has enormous potential as to provide information on the state of, and pressures on, biological diversity and ecosystem services, at multiple spatial and temporal scales. This new publication provides a forum for multidisciplinary research in remote sensing science, ecological research and conservation science.