Yang Wang , Chang Ma , Chuanxin Zhao , Huijuan Xia , Congxi Chen , Ying Zhang
{"title":"WB-YOLO:一种用于复杂环境下生态监测的高效野生蝙蝠检测方法","authors":"Yang Wang , Chang Ma , Chuanxin Zhao , Huijuan Xia , Congxi Chen , Ying Zhang","doi":"10.1016/j.engappai.2025.111232","DOIUrl":null,"url":null,"abstract":"<div><div>The study of bat species and their distribution is vital for understanding the origins and transmission pathways of epidemic diseases. However, the detection of wild bats faces significant challenges due to their complex natural habitats and frequent occlusions caused by social behavior. To address these issues, we propose an object detection method based on the improved You Only Look Once version 7 (YOLOv7) to achieve efficient wild bat detection (WB-YOLO). This method integrates a Vision Transformer encoder module to improve global context integration, adopts deformable convolution, and optimizes the spatial pyramid pooling structure for effective multi-scale feature fusion while reducing computational complexity. Furthermore, a hybrid attention mechanism is introduced to capture both spatial and channel information, enhancing robustness in complex environments. Experimental results in a data set of wild bat images collected in Anhui Province demonstrate that WB-YOLO achieves a precision of 90. 7%, a recall of 89. 0%, and a mean average precision (mAP) of 94. 7%, significantly outperforming other deep learning models in detecting bats in complex scenes and under occlusion. Our approach offers an efficient and accurate solution for the detection of wild bats in real time, with potential applications in ecological research and disease prevention. Code and data related to this work are publicly available at <span><span>https://github.com/macandzzz/WB-YOLO</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"157 ","pages":"Article 111232"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"WB-YOLO: An efficient wild bat detection method for ecological monitoring in complex environments\",\"authors\":\"Yang Wang , Chang Ma , Chuanxin Zhao , Huijuan Xia , Congxi Chen , Ying Zhang\",\"doi\":\"10.1016/j.engappai.2025.111232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The study of bat species and their distribution is vital for understanding the origins and transmission pathways of epidemic diseases. However, the detection of wild bats faces significant challenges due to their complex natural habitats and frequent occlusions caused by social behavior. To address these issues, we propose an object detection method based on the improved You Only Look Once version 7 (YOLOv7) to achieve efficient wild bat detection (WB-YOLO). This method integrates a Vision Transformer encoder module to improve global context integration, adopts deformable convolution, and optimizes the spatial pyramid pooling structure for effective multi-scale feature fusion while reducing computational complexity. Furthermore, a hybrid attention mechanism is introduced to capture both spatial and channel information, enhancing robustness in complex environments. Experimental results in a data set of wild bat images collected in Anhui Province demonstrate that WB-YOLO achieves a precision of 90. 7%, a recall of 89. 0%, and a mean average precision (mAP) of 94. 7%, significantly outperforming other deep learning models in detecting bats in complex scenes and under occlusion. Our approach offers an efficient and accurate solution for the detection of wild bats in real time, with potential applications in ecological research and disease prevention. Code and data related to this work are publicly available at <span><span>https://github.com/macandzzz/WB-YOLO</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"157 \",\"pages\":\"Article 111232\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625012333\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625012333","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
对蝙蝠种类及其分布的研究对于了解流行病的起源和传播途径至关重要。然而,野生蝙蝠由于其复杂的自然栖息地和频繁的社会行为造成的闭塞,对其的检测面临着巨大的挑战。为了解决这些问题,我们提出了一种基于改进的You Only Look Once version 7 (YOLOv7)的目标检测方法,以实现高效的野生蝙蝠检测(WB-YOLO)。该方法集成了Vision Transformer编码器模块以提高全局上下文集成,采用可变形卷积,并优化空间金字塔池结构以实现有效的多尺度特征融合,同时降低了计算复杂度。此外,引入了混合注意机制来捕获空间和通道信息,增强了在复杂环境中的鲁棒性。在安徽省野生蝙蝠图像数据集上的实验结果表明,WB-YOLO的精度达到了90。7%,召回89件。,平均精度(mAP)为94。7%,在复杂场景和遮挡下检测蝙蝠方面显著优于其他深度学习模型。我们的方法为野生蝙蝠的实时检测提供了一种高效、准确的解决方案,在生态研究和疾病预防方面具有潜在的应用前景。与这项工作相关的代码和数据可在https://github.com/macandzzz/WB-YOLO上公开获取。
WB-YOLO: An efficient wild bat detection method for ecological monitoring in complex environments
The study of bat species and their distribution is vital for understanding the origins and transmission pathways of epidemic diseases. However, the detection of wild bats faces significant challenges due to their complex natural habitats and frequent occlusions caused by social behavior. To address these issues, we propose an object detection method based on the improved You Only Look Once version 7 (YOLOv7) to achieve efficient wild bat detection (WB-YOLO). This method integrates a Vision Transformer encoder module to improve global context integration, adopts deformable convolution, and optimizes the spatial pyramid pooling structure for effective multi-scale feature fusion while reducing computational complexity. Furthermore, a hybrid attention mechanism is introduced to capture both spatial and channel information, enhancing robustness in complex environments. Experimental results in a data set of wild bat images collected in Anhui Province demonstrate that WB-YOLO achieves a precision of 90. 7%, a recall of 89. 0%, and a mean average precision (mAP) of 94. 7%, significantly outperforming other deep learning models in detecting bats in complex scenes and under occlusion. Our approach offers an efficient and accurate solution for the detection of wild bats in real time, with potential applications in ecological research and disease prevention. Code and data related to this work are publicly available at https://github.com/macandzzz/WB-YOLO.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.