Small object matters in weakly supervised object localization

IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Dongjun Hwang , Seong Joon Oh , Junsuk Choe
{"title":"Small object matters in weakly supervised object localization","authors":"Dongjun Hwang ,&nbsp;Seong Joon Oh ,&nbsp;Junsuk Choe","doi":"10.1016/j.neucom.2025.130494","DOIUrl":null,"url":null,"abstract":"<div><div>Weakly-supervised object localization (WSOL) methods aim to capture the extent of the target object without full supervision such as bounding boxes or segmentation masks. Although numerous studies have been conducted in the research field of WSOL, we find that most existing methods are less effective at localizing small objects. In this paper, we first analyze why previous studies have overlooked this problem. Based on the analysis, we propose two remedies: (1) new evaluation metrics and a dataset to accurately measure localization performance for small objects, and (2) a novel consistency learning framework to zoom in on small objects so the model can perceive them more clearly. Our extensive experimental results demonstrate that the proposed method significantly improves small object localization on four different backbone networks and four different datasets, without sacrificing the performance of medium and large objects. In addition to these gains, our method can be easily applied to existing WSOL methods as it does not require any changes to the model architecture or data input pipeline. Code is available at <span><span>https://github.com/dongjunhwang/small_object_wsol</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"648 ","pages":"Article 130494"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092523122501166X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Weakly-supervised object localization (WSOL) methods aim to capture the extent of the target object without full supervision such as bounding boxes or segmentation masks. Although numerous studies have been conducted in the research field of WSOL, we find that most existing methods are less effective at localizing small objects. In this paper, we first analyze why previous studies have overlooked this problem. Based on the analysis, we propose two remedies: (1) new evaluation metrics and a dataset to accurately measure localization performance for small objects, and (2) a novel consistency learning framework to zoom in on small objects so the model can perceive them more clearly. Our extensive experimental results demonstrate that the proposed method significantly improves small object localization on four different backbone networks and four different datasets, without sacrificing the performance of medium and large objects. In addition to these gains, our method can be easily applied to existing WSOL methods as it does not require any changes to the model architecture or data input pipeline. Code is available at https://github.com/dongjunhwang/small_object_wsol.
在弱监督对象定位中,小对象很重要
弱监督对象定位(WSOL)方法的目的是在没有边界框或分割蒙版等完全监督的情况下捕获目标对象的范围。虽然在WSOL研究领域进行了大量的研究,但我们发现大多数现有的方法在小目标定位方面效果较差。在本文中,我们首先分析了为什么以前的研究忽视了这个问题。在此基础上,我们提出了两种补救措施:(1)新的评估指标和数据集来准确衡量小对象的定位性能;(2)新的一致性学习框架来放大小对象,使模型能够更清楚地感知它们。大量的实验结果表明,该方法在不牺牲大中型目标定位性能的前提下,显著提高了四种不同骨干网和四种不同数据集上的小目标定位性能。除了这些优点之外,我们的方法可以很容易地应用于现有的WSOL方法,因为它不需要对模型体系结构或数据输入管道进行任何更改。代码可从https://github.com/dongjunhwang/small_object_wsol获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信