Improving small object detection via cross-layer attention

IF 6.3 3区 综合性期刊 Q1 Multidisciplinary
Ru Peng, Guoran Tan, Xingyu Chen, Xuguang Lan
{"title":"Improving small object detection via cross-layer attention","authors":"Ru Peng,&nbsp;Guoran Tan,&nbsp;Xingyu Chen,&nbsp;Xuguang Lan","doi":"10.1016/j.fmre.2022.09.037","DOIUrl":null,"url":null,"abstract":"<div><div>Small object detection is a fundamental and challenging topic in the computer vision community. To detect small objects in images, several methods rely on feature pyramid networks (FPN), which can alleviate the conflict between resolution and semantic information. However, the FPN-based methods also have limitations. First, existing methods only focus only on regions with close spatial distance, hindering the effectiveness of long-range interactions. Second, element-wise addition ignores the different perceptive fields of the two feature layers, thus causing higher-level features to introduce noise to the lower-level features. To address these problems, we propose a cross-layer attention (CLA) block as a generic block for capturing long-range dependencies and reducing noise from high-level features. Specifically, the CLA block performs feature fusion by factoring in both the channel and spatial dimensions, which provides a reliable way of fusing the features from different layers. Because CLA is a lightweight and general block, it can be plugged into most feature fusion frameworks. On the COCO 2017 dataset, we validated the CLA block by plugging it into several state-of-the-art FPN-based detectors. Experiments show that our approach achieves consistent improvements in both object detection and instance segmentation, which demonstrates the effectiveness of our approach.</div></div>","PeriodicalId":34602,"journal":{"name":"Fundamental Research","volume":"5 4","pages":"Pages 1737-1742"},"PeriodicalIF":6.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fundamental Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667325823000808","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

Abstract

Small object detection is a fundamental and challenging topic in the computer vision community. To detect small objects in images, several methods rely on feature pyramid networks (FPN), which can alleviate the conflict between resolution and semantic information. However, the FPN-based methods also have limitations. First, existing methods only focus only on regions with close spatial distance, hindering the effectiveness of long-range interactions. Second, element-wise addition ignores the different perceptive fields of the two feature layers, thus causing higher-level features to introduce noise to the lower-level features. To address these problems, we propose a cross-layer attention (CLA) block as a generic block for capturing long-range dependencies and reducing noise from high-level features. Specifically, the CLA block performs feature fusion by factoring in both the channel and spatial dimensions, which provides a reliable way of fusing the features from different layers. Because CLA is a lightweight and general block, it can be plugged into most feature fusion frameworks. On the COCO 2017 dataset, we validated the CLA block by plugging it into several state-of-the-art FPN-based detectors. Experiments show that our approach achieves consistent improvements in both object detection and instance segmentation, which demonstrates the effectiveness of our approach.
跨层注意改进小目标检测
小目标检测是计算机视觉领域的一个基础和具有挑战性的课题。为了检测图像中的小目标,有几种方法依赖于特征金字塔网络(FPN),这种方法可以缓解分辨率和语义信息之间的冲突。然而,基于fpn的方法也有局限性。首先,现有方法只关注空间距离较近的区域,阻碍了远程交互的有效性。其次,元素相加忽略了两个特征层的不同感知场,从而导致较高级别的特征向较低级别的特征引入噪声。为了解决这些问题,我们提出了一个跨层注意力(CLA)块,作为捕获远程依赖关系和减少高级特征噪声的通用块。具体而言,CLA块通过考虑通道和空间维度来进行特征融合,为融合来自不同层的特征提供了可靠的方法。因为CLA是一个轻量级的通用块,所以它可以插入到大多数特性融合框架中。在COCO 2017数据集上,我们通过将CLA块插入几个最先进的基于fpn的检测器来验证CLA块。实验表明,该方法在目标检测和实例分割方面都取得了一致的改进,证明了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Fundamental Research
Fundamental Research Multidisciplinary-Multidisciplinary
CiteScore
4.00
自引率
1.60%
发文量
294
审稿时长
79 days
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信