A lightweight self-attention metric network for bird species recognition in intelligent bird repellent equipment

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Jiangjian Xie , Shanshan Xie , Baican Li , Yujie Zhong , Chunhe Hu , Junguo Zhang , Björn W. Schuller
{"title":"A lightweight self-attention metric network for bird species recognition in intelligent bird repellent equipment","authors":"Jiangjian Xie ,&nbsp;Shanshan Xie ,&nbsp;Baican Li ,&nbsp;Yujie Zhong ,&nbsp;Chunhe Hu ,&nbsp;Junguo Zhang ,&nbsp;Björn W. Schuller","doi":"10.1016/j.engappai.2025.112546","DOIUrl":null,"url":null,"abstract":"<div><div>Bird damages to power transmission lines pose significant operational risks, and intelligent bird repellent equipment (IBRE) requires accurate species recognition for effective long-term repellent. We propose a novel lightweight self-attention metric network (LSAM-Net) for few-shot bird species recognition in the vicinity of power transmission lines, aiming to enhance the performance of IBRE. LSAM-Net integrates a simple attention mechanism (SimAM) to emphasize critical spatial and channel features, thereby enhancing the extraction of key semantic information from bird images. Additionally, a self-correlation representation (SCR) module is employed to capture local structural patterns, effectively mitigating the impact of pseudo-features and improving the network’s capacity to learn discriminative representations. To promote the utilization of local discriminative information in few-shot classification, LSAM-Net leverages earth mover’s distance (EMD) to compute structural similarity between images. For efficient deployment, we apply knowledge distillation to further reduce model complexity. Extensive experiments conducted on Bird-65, CUB200, 2011, miniImageNet, and Fewshot-CIFAR100 demonstrate that LSAM-Net achieves superior performance compared to state-of-the-art methods, while maintaining a compact architecture. On the Bird-65 and CUB200-2011 datasets, LSAM-Net requires only 4.75 and 1.18 giga floating-point operations (GFLOPs), and achieves inference speed improvements of 52.9 % and 48.9 %, respectively, over the self-attention metric network (SAM-Net). Further optimization with TensorRT yields additional reductions in inference time by 43.6 ms and 53.7 ms, respectively. These improvements significantly support species-specific repellent strategies, thereby enhancing the long-term effectiveness of IBRE systems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"162 ","pages":"Article 112546"},"PeriodicalIF":8.0000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625025771","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Bird damages to power transmission lines pose significant operational risks, and intelligent bird repellent equipment (IBRE) requires accurate species recognition for effective long-term repellent. We propose a novel lightweight self-attention metric network (LSAM-Net) for few-shot bird species recognition in the vicinity of power transmission lines, aiming to enhance the performance of IBRE. LSAM-Net integrates a simple attention mechanism (SimAM) to emphasize critical spatial and channel features, thereby enhancing the extraction of key semantic information from bird images. Additionally, a self-correlation representation (SCR) module is employed to capture local structural patterns, effectively mitigating the impact of pseudo-features and improving the network’s capacity to learn discriminative representations. To promote the utilization of local discriminative information in few-shot classification, LSAM-Net leverages earth mover’s distance (EMD) to compute structural similarity between images. For efficient deployment, we apply knowledge distillation to further reduce model complexity. Extensive experiments conducted on Bird-65, CUB200, 2011, miniImageNet, and Fewshot-CIFAR100 demonstrate that LSAM-Net achieves superior performance compared to state-of-the-art methods, while maintaining a compact architecture. On the Bird-65 and CUB200-2011 datasets, LSAM-Net requires only 4.75 and 1.18 giga floating-point operations (GFLOPs), and achieves inference speed improvements of 52.9 % and 48.9 %, respectively, over the self-attention metric network (SAM-Net). Further optimization with TensorRT yields additional reductions in inference time by 43.6 ms and 53.7 ms, respectively. These improvements significantly support species-specific repellent strategies, thereby enhancing the long-term effectiveness of IBRE systems.
智能驱鸟设备中鸟类物种识别的轻量级自关注度量网络
鸟类对输电线路的危害构成了重大的运行风险,智能驱鸟设备(IBRE)需要准确的物种识别才能实现有效的长期驱鸟。为了提高IBRE的性能,提出了一种新的轻量级自关注度量网络(LSAM-Net),用于输电线路附近的少射鸟类物种识别。LSAM-Net集成了简单注意机制(SimAM)来强调关键的空间和通道特征,从而增强了对鸟类图像关键语义信息的提取。此外,采用自相关表示(SCR)模块捕获局部结构模式,有效减轻了伪特征的影响,提高了网络学习判别表示的能力。为了促进局部判别信息在少弹分类中的利用,LSAM-Net利用土动器距离(EMD)来计算图像之间的结构相似性。为了高效部署,我们应用知识蒸馏进一步降低模型复杂度。在Bird-65、CUB200、2011、miniImageNet和Fewshot-CIFAR100上进行的大量实验表明,与最先进的方法相比,LSAM-Net实现了卓越的性能,同时保持了紧凑的架构。在Bird-65和CUB200-2011数据集上,LSAM-Net只需要4.75和1.18千兆浮点运算(GFLOPs),与自关注度量网络(SAM-Net)相比,其推理速度分别提高了52.9%和48.9%。使用TensorRT进行进一步优化,推理时间分别减少了43.6 ms和53.7 ms。这些改进极大地支持了物种特异性驱避策略,从而提高了IBRE系统的长期有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信