Lightweight Conv-Swin Transformer for Wildlife Detection

Guobin Yang, Chenhong Sui, Fuhao Jiang, Yunhao Pan, Ankang Zang, Jian Hu
{"title":"Lightweight Conv-Swin Transformer for Wildlife Detection","authors":"Guobin Yang, Chenhong Sui, Fuhao Jiang, Yunhao Pan, Ankang Zang, Jian Hu","doi":"10.1109/ICARCE55724.2022.10046623","DOIUrl":null,"url":null,"abstract":"Wildlife detection is of great significance for wildlife monitoring and protection. Among existing object detection methods, Faster RCNN is a typical two-stage object detection method. In despite of its effectiveness, it suffers from the less satisfactory detection accuracy. This is mainly limited by the insufficient global representation of both objects and scenes. To this end, this paper proposes a lightweight Conv-Swin Transformer method for wildlife detection involving a lightweight combination of both convolution and Swin Transformer. In this study, Lightweight improvements are made in two main ways. The first one is done by reducing the number of Blocks in the third stage of the Swin Transformer; the second one is done by optimizing the down-sampling of different stages in the Swin Transformer network through the convolutional structure, which can speed up the detection of the model and improve the detection efficiency. The Faster RCNN model was chosen for experiments on a self-constructed wildlife dataset, using three different CNNs as well as the Swin Transformer as the backbone network for comparison. Experimental results show that the improved Conv-Swin Transformer, which combines the advantages of the attention mechanism and the convolutional structure, improves detection speed by 17.5% with a slight reduction in detection accuracy.","PeriodicalId":416305,"journal":{"name":"2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Automation, Robotics and Computer Engineering (ICARCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARCE55724.2022.10046623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Wildlife detection is of great significance for wildlife monitoring and protection. Among existing object detection methods, Faster RCNN is a typical two-stage object detection method. In despite of its effectiveness, it suffers from the less satisfactory detection accuracy. This is mainly limited by the insufficient global representation of both objects and scenes. To this end, this paper proposes a lightweight Conv-Swin Transformer method for wildlife detection involving a lightweight combination of both convolution and Swin Transformer. In this study, Lightweight improvements are made in two main ways. The first one is done by reducing the number of Blocks in the third stage of the Swin Transformer; the second one is done by optimizing the down-sampling of different stages in the Swin Transformer network through the convolutional structure, which can speed up the detection of the model and improve the detection efficiency. The Faster RCNN model was chosen for experiments on a self-constructed wildlife dataset, using three different CNNs as well as the Swin Transformer as the backbone network for comparison. Experimental results show that the improved Conv-Swin Transformer, which combines the advantages of the attention mechanism and the convolutional structure, improves detection speed by 17.5% with a slight reduction in detection accuracy.
用于野生动物检测的轻型逆变变压器
野生动物检测对野生动物监测和保护具有重要意义。在现有的目标检测方法中,Faster RCNN是一种典型的两阶段目标检测方法。尽管它很有效,但它的检测精度却不尽如人意。这主要受到对象和场景的全局表示不足的限制。为此,本文提出了一种轻量级的卷积-Swin Transformer野生动物检测方法,该方法将卷积和Swin Transformer轻量级结合。在本研究中,轻量级的改进主要通过两种方式进行。第一种是通过减少Swin变压器第三级的block数量来实现的;二是通过卷积结构对Swin变压器网络中不同阶段的下采样进行优化,加快了模型的检测速度,提高了检测效率。在自构建的野生动物数据集上,选择Faster RCNN模型进行实验,使用三种不同的cnn和Swin Transformer作为骨干网络进行比较。实验结果表明,改进的卷积- swin变压器结合了注意机制和卷积结构的优点,在检测精度略有降低的情况下,检测速度提高了17.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信