Morgan A. Ziegenhorn , Richard B. Lanctot , Stephen C. Brown , Miles Brengle , Shiloh Schulte , Sarah T. Saalfeld , Christopher J. Latty , Paul A. Smith , Nicolas Lecomte
{"title":"ArcticSoundsNET: BirdNET嵌入有助于改进北极物种的生物声学分类","authors":"Morgan A. Ziegenhorn , Richard B. Lanctot , Stephen C. Brown , Miles Brengle , Shiloh Schulte , Sarah T. Saalfeld , Christopher J. Latty , Paul A. Smith , Nicolas Lecomte","doi":"10.1016/j.ecoinf.2025.103270","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, deep learning has become a popular solution for processing large ecological monitoring datasets. This rise in use has resulted in global classification models for a variety of data types and taxa, such as BirdNET, which classifies vocalizations of more than 6000 avian species from acoustic data. These global models can be useful pre-trained models for transfer learning, allowing researchers to more easily develop classifiers specialized to their datasets. However, the development of such models hinges on the availability of comprehensive, high-quality training data, which can be difficult to acquire, produce, and use. We present a novel pipeline for creating training data from a large and unlabeled dataset with minimal human oversight. We used this pipeline and BirdNET as our base model to develop a transfer-learning-based model, ArcticSoundsNET, using acoustic monitoring data from 205 sites across Alaska's Arctic Coastal Plain. We compared performance of ArcticSoundsNET with that of BirdNET to evaluate the effectiveness of our pipeline and success of the new model. We found that the ability of ArcticSoundsNET to detect and classify avian vocalizations in our data greatly exceeded that of BirdNET (AUC ROC = 0.888 for ArcticSoundsNET, AUC ROC = 0.593 for BirdNET). Importantly, our method for developing a training dataset is widely applicable for ecologists who do not have large amounts of labeled data, facilitating the creation of task-specific classification models. Developing such models is an essential step in using large acoustic datasets to support ecological conservation of critical species and habitats.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"90 ","pages":"Article 103270"},"PeriodicalIF":7.3000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ArcticSoundsNET: BirdNET embeddings facilitate improved bioacoustic classification of Arctic species\",\"authors\":\"Morgan A. Ziegenhorn , Richard B. Lanctot , Stephen C. Brown , Miles Brengle , Shiloh Schulte , Sarah T. Saalfeld , Christopher J. Latty , Paul A. Smith , Nicolas Lecomte\",\"doi\":\"10.1016/j.ecoinf.2025.103270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, deep learning has become a popular solution for processing large ecological monitoring datasets. This rise in use has resulted in global classification models for a variety of data types and taxa, such as BirdNET, which classifies vocalizations of more than 6000 avian species from acoustic data. These global models can be useful pre-trained models for transfer learning, allowing researchers to more easily develop classifiers specialized to their datasets. However, the development of such models hinges on the availability of comprehensive, high-quality training data, which can be difficult to acquire, produce, and use. We present a novel pipeline for creating training data from a large and unlabeled dataset with minimal human oversight. We used this pipeline and BirdNET as our base model to develop a transfer-learning-based model, ArcticSoundsNET, using acoustic monitoring data from 205 sites across Alaska's Arctic Coastal Plain. We compared performance of ArcticSoundsNET with that of BirdNET to evaluate the effectiveness of our pipeline and success of the new model. We found that the ability of ArcticSoundsNET to detect and classify avian vocalizations in our data greatly exceeded that of BirdNET (AUC ROC = 0.888 for ArcticSoundsNET, AUC ROC = 0.593 for BirdNET). Importantly, our method for developing a training dataset is widely applicable for ecologists who do not have large amounts of labeled data, facilitating the creation of task-specific classification models. Developing such models is an essential step in using large acoustic datasets to support ecological conservation of critical species and habitats.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"90 \",\"pages\":\"Article 103270\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125002791\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125002791","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
ArcticSoundsNET: BirdNET embeddings facilitate improved bioacoustic classification of Arctic species
In recent years, deep learning has become a popular solution for processing large ecological monitoring datasets. This rise in use has resulted in global classification models for a variety of data types and taxa, such as BirdNET, which classifies vocalizations of more than 6000 avian species from acoustic data. These global models can be useful pre-trained models for transfer learning, allowing researchers to more easily develop classifiers specialized to their datasets. However, the development of such models hinges on the availability of comprehensive, high-quality training data, which can be difficult to acquire, produce, and use. We present a novel pipeline for creating training data from a large and unlabeled dataset with minimal human oversight. We used this pipeline and BirdNET as our base model to develop a transfer-learning-based model, ArcticSoundsNET, using acoustic monitoring data from 205 sites across Alaska's Arctic Coastal Plain. We compared performance of ArcticSoundsNET with that of BirdNET to evaluate the effectiveness of our pipeline and success of the new model. We found that the ability of ArcticSoundsNET to detect and classify avian vocalizations in our data greatly exceeded that of BirdNET (AUC ROC = 0.888 for ArcticSoundsNET, AUC ROC = 0.593 for BirdNET). Importantly, our method for developing a training dataset is widely applicable for ecologists who do not have large amounts of labeled data, facilitating the creation of task-specific classification models. Developing such models is an essential step in using large acoustic datasets to support ecological conservation of critical species and habitats.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.