Finding Things: Image Parsing with Regions and Per-Exemplar Detectors

2013 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2013-06-23 DOI:10.1109/CVPR.2013.386

Joseph Tighe, S. Lazebnik

引用次数: 227

Abstract

This paper presents a system for image parsing, or labeling each pixel in an image with its semantic category, aimed at achieving broad coverage across hundreds of object categories, many of them sparsely sampled. The system combines region-level features with per-exemplar sliding window detectors. Per-exemplar detectors are better suited for our parsing task than traditional bounding box detectors: they perform well on classes with little training data and high intra-class variation, and they allow object masks to be transferred into the test image for pixel-level segmentation. The proposed system achieves state-of-the-art accuracy on three challenging datasets, the largest of which contains 45,676 images and 232 labels.

查看原文本刊更多论文

寻找事物:使用区域和每样例检测器进行图像解析

本文提出了一个用于图像解析的系统，或用其语义类别标记图像中的每个像素，旨在实现数百个对象类别的广泛覆盖，其中许多是稀疏采样的。该系统结合了区域级特征和每样例滑动窗口检测器。每样例检测器比传统的边界盒检测器更适合我们的解析任务:它们在训练数据少、类内变化大的类上表现良好，并且它们允许将对象掩码转移到测试图像中进行像素级分割。该系统在三个具有挑战性的数据集上实现了最先进的精度，其中最大的数据集包含45,676张图像和232个标签。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量