SAGE: Saliency-Guided Mixup with Optimal Rearrangements

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference Pub Date : 2022-10-31 DOI:10.48550/arXiv.2211.00113

A. Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, K. Derpanis, A. Fazly

{"title":"SAGE: Saliency-Guided Mixup with Optimal Rearrangements","authors":"A. Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, K. Derpanis, A. Fazly","doi":"10.48550/arXiv.2211.00113","DOIUrl":null,"url":null,"abstract":"Data augmentation is a key element for training accurate models by reducing overfitting and improving generalization. For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples. As augmentation methods get more complex, their ability to increase the test accuracy improves, yet, such methods become cumbersome, inefficient and lead to poor out-of-domain generalization, as we show in this paper. This motivates a new augmentation technique that allows for high accuracy gains while being simple, efficient (i.e., minimal computation overhead) and generalizable. To this end, we introduce Saliency-Guided Mixup with Optimal Rearrangements (SAGE), which creates new training examples by rearranging and mixing image pairs using visual saliency as guidance. By explicitly leveraging saliency, SAGE promotes discriminative foreground objects and produces informative new images useful for training. We demonstrate on CIFAR-10 and CIFAR-100 that SAGE achieves better or comparable performance to the state of the art while being more efficient. Additionally, evaluations in the out-of-distribution setting, and few-shot learning on mini-ImageNet, show that SAGE achieves improved generalization performance without trading off robustness.","PeriodicalId":72437,"journal":{"name":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","volume":"2012 1","pages":"484"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.00113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Data augmentation is a key element for training accurate models by reducing overfitting and improving generalization. For image classification, the most popular data augmentation techniques range from simple photometric and geometrical transformations, to more complex methods that use visual saliency to craft new training examples. As augmentation methods get more complex, their ability to increase the test accuracy improves, yet, such methods become cumbersome, inefficient and lead to poor out-of-domain generalization, as we show in this paper. This motivates a new augmentation technique that allows for high accuracy gains while being simple, efficient (i.e., minimal computation overhead) and generalizable. To this end, we introduce Saliency-Guided Mixup with Optimal Rearrangements (SAGE), which creates new training examples by rearranging and mixing image pairs using visual saliency as guidance. By explicitly leveraging saliency, SAGE promotes discriminative foreground objects and produces informative new images useful for training. We demonstrate on CIFAR-10 and CIFAR-100 that SAGE achieves better or comparable performance to the state of the art while being more efficient. Additionally, evaluations in the out-of-distribution setting, and few-shot learning on mini-ImageNet, show that SAGE achieves improved generalization performance without trading off robustness.

查看原文本刊更多论文

SAGE:显著性导向的混合与最佳重新安排

数据增强是通过减少过拟合和提高泛化来训练准确模型的关键因素。对于图像分类，最流行的数据增强技术范围从简单的光度和几何变换，到使用视觉显著性来制作新的训练示例的更复杂的方法。随着增强方法变得越来越复杂，它们提高测试精度的能力也在提高，但正如我们在本文中所展示的那样，这些方法变得繁琐、低效，并且导致域外泛化能力差。这激发了一种新的增强技术，它允许在简单、高效(即最小的计算开销)和可推广的同时获得高精度。为此，我们引入了显著性引导混合与最优重排(SAGE)，它通过使用视觉显著性作为指导重新排列和混合图像对来创建新的训练样例。通过明确地利用显著性，SAGE促进了有区别的前景对象，并产生了对训练有用的信息丰富的新图像。我们在CIFAR-10和CIFAR-100上演示了SAGE在效率更高的同时实现了与最先进的性能更好或相当的性能。此外，在离分布设置下的评估，以及在mini-ImageNet上的少量学习，表明SAGE在不牺牲鲁棒性的情况下实现了更好的泛化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference

自引率

0.00%

发文量