在弱监督对象定位中，小对象很重要

IF 5.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-06-10 DOI:10.1016/j.neucom.2025.130494

Dongjun Hwang , Seong Joon Oh , Junsuk Choe

{"title":"在弱监督对象定位中，小对象很重要","authors":"Dongjun Hwang , Seong Joon Oh , Junsuk Choe","doi":"10.1016/j.neucom.2025.130494","DOIUrl":null,"url":null,"abstract":"<div><div>Weakly-supervised object localization (WSOL) methods aim to capture the extent of the target object without full supervision such as bounding boxes or segmentation masks. Although numerous studies have been conducted in the research field of WSOL, we find that most existing methods are less effective at localizing small objects. In this paper, we first analyze why previous studies have overlooked this problem. Based on the analysis, we propose two remedies: (1) new evaluation metrics and a dataset to accurately measure localization performance for small objects, and (2) a novel consistency learning framework to zoom in on small objects so the model can perceive them more clearly. Our extensive experimental results demonstrate that the proposed method significantly improves small object localization on four different backbone networks and four different datasets, without sacrificing the performance of medium and large objects. In addition to these gains, our method can be easily applied to existing WSOL methods as it does not require any changes to the model architecture or data input pipeline. Code is available at <span><span>https://github.com/dongjunhwang/small_object_wsol</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"648 ","pages":"Article 130494"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Small object matters in weakly supervised object localization\",\"authors\":\"Dongjun Hwang , Seong Joon Oh , Junsuk Choe\",\"doi\":\"10.1016/j.neucom.2025.130494\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Weakly-supervised object localization (WSOL) methods aim to capture the extent of the target object without full supervision such as bounding boxes or segmentation masks. Although numerous studies have been conducted in the research field of WSOL, we find that most existing methods are less effective at localizing small objects. In this paper, we first analyze why previous studies have overlooked this problem. Based on the analysis, we propose two remedies: (1) new evaluation metrics and a dataset to accurately measure localization performance for small objects, and (2) a novel consistency learning framework to zoom in on small objects so the model can perceive them more clearly. Our extensive experimental results demonstrate that the proposed method significantly improves small object localization on four different backbone networks and four different datasets, without sacrificing the performance of medium and large objects. In addition to these gains, our method can be easily applied to existing WSOL methods as it does not require any changes to the model architecture or data input pipeline. Code is available at <span><span>https://github.com/dongjunhwang/small_object_wsol</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"648 \",\"pages\":\"Article 130494\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092523122501166X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092523122501166X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

弱监督对象定位（WSOL）方法的目的是在没有边界框或分割蒙版等完全监督的情况下捕获目标对象的范围。虽然在WSOL研究领域进行了大量的研究，但我们发现大多数现有的方法在小目标定位方面效果较差。在本文中，我们首先分析了为什么以前的研究忽视了这个问题。在此基础上，我们提出了两种补救措施：(1)新的评估指标和数据集来准确衡量小对象的定位性能；(2)新的一致性学习框架来放大小对象，使模型能够更清楚地感知它们。大量的实验结果表明，该方法在不牺牲大中型目标定位性能的前提下，显著提高了四种不同骨干网和四种不同数据集上的小目标定位性能。除了这些优点之外，我们的方法可以很容易地应用于现有的WSOL方法，因为它不需要对模型体系结构或数据输入管道进行任何更改。代码可从https://github.com/dongjunhwang/small_object_wsol获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Small object matters in weakly supervised object localization

Weakly-supervised object localization (WSOL) methods aim to capture the extent of the target object without full supervision such as bounding boxes or segmentation masks. Although numerous studies have been conducted in the research field of WSOL, we find that most existing methods are less effective at localizing small objects. In this paper, we first analyze why previous studies have overlooked this problem. Based on the analysis, we propose two remedies: (1) new evaluation metrics and a dataset to accurately measure localization performance for small objects, and (2) a novel consistency learning framework to zoom in on small objects so the model can perceive them more clearly. Our extensive experimental results demonstrate that the proposed method significantly improves small object localization on four different backbone networks and four different datasets, without sacrificing the performance of medium and large objects. In addition to these gains, our method can be easily applied to existing WSOL methods as it does not require any changes to the model architecture or data input pipeline. Code is available at https://github.com/dongjunhwang/small_object_wsol.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.