{"title":"几次迁移学习可以在景观尺度上对野生动物群落进行强大的声学监测","authors":"Giordano Jacuzzi, Julian D. Olden","doi":"10.1016/j.ecoinf.2025.103294","DOIUrl":null,"url":null,"abstract":"<div><div>Pre-trained classifiers for bioacoustic species detection have emerged as accessible tools for conservation and automated biodiversity monitoring at scale. Despite their wide use, these models are trained primarily on high-quality, weakly labeled species recordings from disparate regions that can fail to account for variation in passive ambient soundscape data due to poor signal-to-noise ratio and the presence of novel sounds such as abiotic noise and concurrent vocalizations. This domain shift may limit a pre-trained model's reliability and preclude its use in applied acoustic monitoring. Although relatively underexplored in bioacoustic monitoring, transfer learning may help overcome the limitations of existing pre-trained classifiers by enhancing predictive performance while avoiding the time, resources, and expertise required to develop entirely new models. We demonstrate the use of few-shot transfer learning as a strategy to efficiently adapt a pre-trained BirdNET source model to a new target domain with minimal local training data. We present an open-source workflow with guidelines to evaluate performance with ecologically meaningful metrics and train a target model with improved or missing classes for biotic and abiotic signals of interest. This process is demonstrated with an applied case study of community level acoustic monitoring across a managed temperate rainforest landscape in western Washington, United States.</div><div>The target model achieved a mean precision-recall AUC of 0.94 at the audio segment level and a mean accuracy of 92 % at the site use level across trained avian species classes, an increase of 0.15 and 13 % over the pre-trained source model, respectively. These improvements increased the probability of individual species detection and produced more accurate estimates of local species richness. Several species of high conservation and management priority that could not be reliably detected by the source model achieved high performance. Superior performance was achieved with an average of only 8 local training examples per species class. Training novel classes for other signals of interest (e.g., anurans, insects, aircraft, logging, rain) reduced confusion rates and mitigated the impact of the local environment on performance. Few-shot transfer learning leverages the learned knowledge of a pre-trained classifier to enable robust monitoring at the community level across the landscape and offers an approach that can be readily applied to other environments and taxa.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"90 ","pages":"Article 103294"},"PeriodicalIF":5.8000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-shot transfer learning enables robust acoustic monitoring of wildlife communities at the landscape scale\",\"authors\":\"Giordano Jacuzzi, Julian D. Olden\",\"doi\":\"10.1016/j.ecoinf.2025.103294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pre-trained classifiers for bioacoustic species detection have emerged as accessible tools for conservation and automated biodiversity monitoring at scale. Despite their wide use, these models are trained primarily on high-quality, weakly labeled species recordings from disparate regions that can fail to account for variation in passive ambient soundscape data due to poor signal-to-noise ratio and the presence of novel sounds such as abiotic noise and concurrent vocalizations. This domain shift may limit a pre-trained model's reliability and preclude its use in applied acoustic monitoring. Although relatively underexplored in bioacoustic monitoring, transfer learning may help overcome the limitations of existing pre-trained classifiers by enhancing predictive performance while avoiding the time, resources, and expertise required to develop entirely new models. We demonstrate the use of few-shot transfer learning as a strategy to efficiently adapt a pre-trained BirdNET source model to a new target domain with minimal local training data. We present an open-source workflow with guidelines to evaluate performance with ecologically meaningful metrics and train a target model with improved or missing classes for biotic and abiotic signals of interest. This process is demonstrated with an applied case study of community level acoustic monitoring across a managed temperate rainforest landscape in western Washington, United States.</div><div>The target model achieved a mean precision-recall AUC of 0.94 at the audio segment level and a mean accuracy of 92 % at the site use level across trained avian species classes, an increase of 0.15 and 13 % over the pre-trained source model, respectively. These improvements increased the probability of individual species detection and produced more accurate estimates of local species richness. Several species of high conservation and management priority that could not be reliably detected by the source model achieved high performance. Superior performance was achieved with an average of only 8 local training examples per species class. Training novel classes for other signals of interest (e.g., anurans, insects, aircraft, logging, rain) reduced confusion rates and mitigated the impact of the local environment on performance. Few-shot transfer learning leverages the learned knowledge of a pre-trained classifier to enable robust monitoring at the community level across the landscape and offers an approach that can be readily applied to other environments and taxa.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"90 \",\"pages\":\"Article 103294\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125003036\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125003036","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Few-shot transfer learning enables robust acoustic monitoring of wildlife communities at the landscape scale
Pre-trained classifiers for bioacoustic species detection have emerged as accessible tools for conservation and automated biodiversity monitoring at scale. Despite their wide use, these models are trained primarily on high-quality, weakly labeled species recordings from disparate regions that can fail to account for variation in passive ambient soundscape data due to poor signal-to-noise ratio and the presence of novel sounds such as abiotic noise and concurrent vocalizations. This domain shift may limit a pre-trained model's reliability and preclude its use in applied acoustic monitoring. Although relatively underexplored in bioacoustic monitoring, transfer learning may help overcome the limitations of existing pre-trained classifiers by enhancing predictive performance while avoiding the time, resources, and expertise required to develop entirely new models. We demonstrate the use of few-shot transfer learning as a strategy to efficiently adapt a pre-trained BirdNET source model to a new target domain with minimal local training data. We present an open-source workflow with guidelines to evaluate performance with ecologically meaningful metrics and train a target model with improved or missing classes for biotic and abiotic signals of interest. This process is demonstrated with an applied case study of community level acoustic monitoring across a managed temperate rainforest landscape in western Washington, United States.
The target model achieved a mean precision-recall AUC of 0.94 at the audio segment level and a mean accuracy of 92 % at the site use level across trained avian species classes, an increase of 0.15 and 13 % over the pre-trained source model, respectively. These improvements increased the probability of individual species detection and produced more accurate estimates of local species richness. Several species of high conservation and management priority that could not be reliably detected by the source model achieved high performance. Superior performance was achieved with an average of only 8 local training examples per species class. Training novel classes for other signals of interest (e.g., anurans, insects, aircraft, logging, rain) reduced confusion rates and mitigated the impact of the local environment on performance. Few-shot transfer learning leverages the learned knowledge of a pre-trained classifier to enable robust monitoring at the community level across the landscape and offers an approach that can be readily applied to other environments and taxa.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.