Dan Liu , Qianqian Sun , Jin Hou , Bochuan Zheng , Jindong Zhang , Desheng Li , Tomas Norton , Jifeng Ning
{"title":"加强卧龙11种濒危物种的野生动物行动识别","authors":"Dan Liu , Qianqian Sun , Jin Hou , Bochuan Zheng , Jindong Zhang , Desheng Li , Tomas Norton , Jifeng Ning","doi":"10.1016/j.ecoinf.2025.103148","DOIUrl":null,"url":null,"abstract":"<div><div>The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"89 ","pages":"Article 103148"},"PeriodicalIF":5.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Wild ActionFormer: Enhancing wildlife action recognition for 11 endangered species in Wolong\",\"authors\":\"Dan Liu , Qianqian Sun , Jin Hou , Bochuan Zheng , Jindong Zhang , Desheng Li , Tomas Norton , Jifeng Ning\",\"doi\":\"10.1016/j.ecoinf.2025.103148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.</div></div>\",\"PeriodicalId\":51024,\"journal\":{\"name\":\"Ecological Informatics\",\"volume\":\"89 \",\"pages\":\"Article 103148\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ecological Informatics\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574954125001578\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001578","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
Wild ActionFormer: Enhancing wildlife action recognition for 11 endangered species in Wolong
The video of wild animals captured by trap cameras provides conservationists with intuitive information on animal action, holding significant potential in ethology and ecology. This study focuses on 11 endangered wild animal species videos in the Wolong Nature Reserve and develops a video self-supervised learning-based animal action recognition network—Wild ActionFormer, to achieve automated analysis of wild animal action classes. We utilize UniformerV2 as the base backbone network, integrating self-supervised learning methods to enhance feature extraction capabilities. We constructed a differential dispersion regularization loss function to maintain the alignment of self-supervised learning features and improve the network’s robustness against interference. The introduction of the Focal Loss reweighting strategy optimizes the loss for long-tail classes, mitigating the bias towards head data. Experimental results on our released LoTE-Animal open-source dataset show that the proposed action recognition network achieves a Top-1 accuracy of 95.09%, approximately 4 percentage points higher than the baseline. The LoTE-Animal dataset comprises 10k videos, including endangered wild animals from the Wolong Nature Reserve in Sichuan, China, such as the giant panda, sambar, and Sichuan snub-nosed monkey.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.