加强卧龙11种濒危物种的野生动物行动识别

IF 5.8 2区 环境科学与生态学 Q1 ECOLOGY
Dan Liu , Qianqian Sun , Jin Hou , Bochuan Zheng , Jindong Zhang , Desheng Li , Tomas Norton , Jifeng Ning
{"title":"加强卧龙11种濒危物种的野生动物行动识别","authors":"Dan Liu ,&nbsp;Qianqian Sun ,&nbsp;Jin Hou ,&nbsp;Bochuan Zheng ,&nbsp;Jindong Zhang ,&nbsp;Desheng Li ,&nbsp;Tomas Norton ,&nbsp;Jifeng Ning","doi":"10.1016/j.ecoinf.2025.103148","DOIUrl":null,"url":null,"abstract":"<div><div>The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"89 ","pages":"Article 103148"},"PeriodicalIF":5.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Wild ActionFormer: Enhancing wildlife action recognition for 11 endangered species in Wolong\",\"authors\":\"Dan Liu ,&nbsp;Qianqian Sun ,&nbsp;Jin Hou ,&nbsp;Bochuan Zheng ,&nbsp;Jindong Zhang ,&nbsp;Desheng Li ,&nbsp;Tomas Norton ,&nbsp;Jifeng Ning\",\"doi\":\"10.1016/j.ecoinf.2025.103148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"89 \",\"pages\":\"Article 103148\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125001578\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001578","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

陷阱摄像机捕捉到的野生动物的视频为保护主义者提供了直观的动物行为信息,在动物行为学和生态学方面具有重要的潜力。本研究以卧龙自然保护区11个濒危野生动物物种视频为研究对象,开发了基于视频自监督学习的动物动作识别网络——wild ActionFormer,实现对野生动物动作类的自动分析。我们利用UniformerV2作为基础骨干网,结合自监督学习方法增强特征提取能力。我们构建了一个微分色散正则化损失函数来保持自监督学习特征的一致性,并提高网络对干扰的鲁棒性。焦点损失重新加权策略的引入优化了长尾类的损失,减轻了对头部数据的偏见。在我们发布的lot - animal开源数据集上的实验结果表明,所提出的动作识别网络达到了95.09%的Top-1准确率,比基线提高了约4个百分点。lotte - animal数据集包括中国四川卧龙自然保护区的大熊猫、桑巴、四川仰鼻猴等濒危野生动物的1万多段视频。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Wild ActionFormer: Enhancing wildlife action recognition for 11 endangered species in Wolong
The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信