Adaptive Dilated Network With Self-Correction Supervision for Counting

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2020-06-01 DOI:10.1109/cvpr42600.2020.00465

Shuai Bai, Zhiqun He, Y. Qiao, Hanzhe Hu, Wei Wu, Junjie Yan

{"title":"Adaptive Dilated Network With Self-Correction Supervision for Counting","authors":"Shuai Bai, Zhiqun He, Y. Qiao, Hanzhe Hu, Wei Wu, Junjie Yan","doi":"10.1109/cvpr42600.2020.00465","DOIUrl":null,"url":null,"abstract":"The counting problem aims to estimate the number of objects in images. Due to large scale variation and labeling deviations, it remains a challenging task. The static density map supervised learning framework is widely used in existing methods, which uses the Gaussian kernel to generate a density map as the learning target and utilizes the Euclidean distance to optimize the model. However, the framework is intolerable to the labeling deviations and can not reflect the scale variation. In this paper, we propose an adaptive dilated convolution and a novel supervised learning framework named self-correction (SC) supervision. In the supervision level, the SC supervision utilizes the outputs of the model to iteratively correct the annotations and employs the SC loss to simultaneously optimize the model from both the whole and the individuals. In the feature level, the proposed adaptive dilated convolution predicts a continuous value as the specific dilation rate for each location, which adapts the scale variation better than a discrete and static dilation rate. Extensive experiments illustrate that our approach has achieved a consistent improvement on four challenging benchmarks. Especially, our approach achieves better performance than the state-of-the-art methods on all benchmark datasets.","PeriodicalId":6715,"journal":{"name":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"32 1","pages":"4593-4602"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"125","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cvpr42600.2020.00465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 125

Abstract

The counting problem aims to estimate the number of objects in images. Due to large scale variation and labeling deviations, it remains a challenging task. The static density map supervised learning framework is widely used in existing methods, which uses the Gaussian kernel to generate a density map as the learning target and utilizes the Euclidean distance to optimize the model. However, the framework is intolerable to the labeling deviations and can not reflect the scale variation. In this paper, we propose an adaptive dilated convolution and a novel supervised learning framework named self-correction (SC) supervision. In the supervision level, the SC supervision utilizes the outputs of the model to iteratively correct the annotations and employs the SC loss to simultaneously optimize the model from both the whole and the individuals. In the feature level, the proposed adaptive dilated convolution predicts a continuous value as the specific dilation rate for each location, which adapts the scale variation better than a discrete and static dilation rate. Extensive experiments illustrate that our approach has achieved a consistent improvement on four challenging benchmarks. Especially, our approach achieves better performance than the state-of-the-art methods on all benchmark datasets.

查看原文本刊更多论文

计数自校正监督的自适应扩张网络

计数问题的目的是估计图像中物体的数量。由于大规模的变化和标记偏差，这仍然是一项具有挑战性的任务。静态密度图监督学习框架在现有方法中被广泛使用，它以高斯核生成密度图作为学习目标，利用欧几里得距离对模型进行优化。然而，该框架对标注偏差是不能容忍的，不能反映尺度的变化。本文提出了一种自适应扩展卷积和一种新的监督学习框架——自校正监督。在监督层面，SC监督利用模型的输出对标注进行迭代修正，并利用SC损失从整体和个体两方面同时优化模型。在特征层面，提出的自适应扩张卷积预测了一个连续的值作为每个位置的特定扩张率，比离散和静态的扩张率更能适应尺度变化。大量的实验表明，我们的方法在四个具有挑战性的基准上取得了持续的改进。特别是，我们的方法在所有基准数据集上都比最先进的方法实现了更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量