Reducing the need for bounding box annotations in Object Detection using Image Classification data

2021 34th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) Pub Date : 2021-10-01 DOI:10.1109/sibgrapi54419.2021.00035

Leonardo Blanger, N. Hirata, Xiaoyi Jiang

引用次数: 0

Abstract

We address the problem of training Object Detection models using significantly less bounding box annotated images. For that, we take advantage of cheaper and more abundant image classification data. Our proposal consists in automatically generating artificial detection samples, with no need of expensive detection level supervision, using images with classification labels only. We also detail a pretraining initialization strategy for detection architectures using these artificially synthesized samples, before finetuning on real detection data, and experimentally show how this consistently leads to more data efficient models. With the proposed approach, we were able to effectively use only classification data to improve results on the harder and more supervision hungry object detection problem. We achieve results equivalent to those of the full data scenario using only a small fraction of the original detection data for Face, Bird, and Car detection.

查看原文本刊更多论文

使用图像分类数据减少目标检测中对边界框注释的需求

我们解决了使用更少的边界框注释图像来训练目标检测模型的问题。为此，我们利用了更便宜、更丰富的图像分类数据。我们的建议是自动生成人工检测样本，不需要昂贵的检测级监督，只使用带有分类标签的图像。在对真实检测数据进行微调之前，我们还详细介绍了使用这些人工合成样本的检测架构的预训练初始化策略，并通过实验证明了这如何始终导致更高效的数据模型。通过提出的方法，我们能够有效地仅使用分类数据来改进更难和更需要监督的对象检测问题的结果。我们仅使用Face, Bird和Car检测的一小部分原始检测数据就获得了相当于完整数据场景的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 34th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

自引率

0.00%

发文量