A Multi-object Detection Sampling Algorithm For Large Scenes

2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP) Pub Date : 2022-11-25 DOI:10.1109/PAAP56126.2022.10010614

Liang Jin, Xiaochuan Li, Baoyu Fan, Zhenhua Guo, Ruidong Li, Li Wang, Yanwei Wang, Yaqian Zhao, Rengang Li

{"title":"A Multi-object Detection Sampling Algorithm For Large Scenes","authors":"Liang Jin, Xiaochuan Li, Baoyu Fan, Zhenhua Guo, Ruidong Li, Li Wang, Yanwei Wang, Yaqian Zhao, Rengang Li","doi":"10.1109/PAAP56126.2022.10010614","DOIUrl":null,"url":null,"abstract":"Multi-object detection in large scenes aims to find objects in images, which usually contain more than one billion pixels. Based on the concept of dividing and conquering, the state-of-the-art (SOTA) methods slice the super-resolution image into patches first and then lower the image solution to detect objects later. The advantage of this method is that it can adapt quickly to regular detection algorithms. However, a set of parameters needs to be set manually, such as the size of sliding windows and overlap, which is quite hard to fit all scenarios. It may result in a loss of samples located at the boundary of the sliding window and the oversampling of inefficient samples that appear within the overlap. In this paper, we propose a object-oriented image sampling algorithm based on anchor boxes during training and multi-scale pyramids during inference. Inspired by the mature object detection baseline Scale-YOLOv4, we present more tricks to fit large scenes. The accuracy can reach 66%, which is 24 points higher than the CascadeRCNN model of the official backbone network ResNet50. Finally, we have won first place in the PANDA object detection tracking using this method.","PeriodicalId":336339,"journal":{"name":"2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PAAP56126.2022.10010614","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-object detection in large scenes aims to find objects in images, which usually contain more than one billion pixels. Based on the concept of dividing and conquering, the state-of-the-art (SOTA) methods slice the super-resolution image into patches first and then lower the image solution to detect objects later. The advantage of this method is that it can adapt quickly to regular detection algorithms. However, a set of parameters needs to be set manually, such as the size of sliding windows and overlap, which is quite hard to fit all scenarios. It may result in a loss of samples located at the boundary of the sliding window and the oversampling of inefficient samples that appear within the overlap. In this paper, we propose a object-oriented image sampling algorithm based on anchor boxes during training and multi-scale pyramids during inference. Inspired by the mature object detection baseline Scale-YOLOv4, we present more tricks to fit large scenes. The accuracy can reach 66%, which is 24 points higher than the CascadeRCNN model of the official backbone network ResNet50. Finally, we have won first place in the PANDA object detection tracking using this method.

查看原文本刊更多论文

大场景下的多目标检测采样算法

大场景中的多目标检测的目标是在图像中找到目标，这些图像通常包含超过10亿像素。基于分割和征服的概念，最先进的SOTA方法首先将超分辨率图像分割成小块，然后降低图像解来检测目标。该方法的优点是可以快速适应常规的检测算法。但是，需要手动设置一组参数，例如滑动窗口的大小和重叠，这很难适应所有场景。这可能导致位于滑动窗口边界的样本丢失，以及重叠区域内出现的低效样本的过采样。本文提出了一种面向对象的图像采样算法，该算法在训练时基于锚盒，在推理时基于多尺度金字塔。受成熟的目标检测基线Scale-YOLOv4的启发，我们提出了更多的技巧来适应大场景。准确率可以达到66%，比官方骨干网ResNet50的CascadeRCNN模型高出24点。最后，我们在使用该方法的PANDA目标检测跟踪中获得了第一名。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP)

自引率

0.00%

发文量