A Latent Variable Augmentation Method for Image Categorization with Insufficient Training Samples

ACM Transactions on Knowledge Discovery from Data (TKDD) Pub Date : 2021-07-03 DOI:10.1145/3451165

Luyue Lin, Xin Zheng, Bo Liu, Wei Chen, Yanshan Xiao

{"title":"A Latent Variable Augmentation Method for Image Categorization with Insufficient Training Samples","authors":"Luyue Lin, Xin Zheng, Bo Liu, Wei Chen, Yanshan Xiao","doi":"10.1145/3451165","DOIUrl":null,"url":null,"abstract":"Over the past few years, we have made great progress in image categorization based on convolutional neural networks (CNNs). These CNNs are always trained based on a large-scale image data set; however, people may only have limited training samples for training CNN in the real-world applications. To solve this problem, one intuition is augmenting training samples. In this article, we propose an algorithm called Lavagan (Latent Variables Augmentation Method based on Generative Adversarial Nets) to improve the performance of CNN with insufficient training samples. The proposed Lavagan method is mainly composed of two tasks. The first task is that we augment a number latent variables (LVs) from a set of adaptive and constrained LVs distributions. In the second task, we take the augmented LVs into the training procedure of the image classifier. By taking these two tasks into account, we propose a uniform objective function to incorporate the two tasks into the learning. We then put forward an alternative two-play minimization game to minimize this uniform loss function such that we can obtain the predictive classifier. Moreover, based on Hoeffding’s Inequality and Chernoff Bounding method, we analyze the feasibility and efficiency of the proposed Lavagan method, which manifests that the LV augmentation method is able to improve the performance of Lavagan with insufficient training samples. Finally, the experiment has shown that the proposed Lavagan method is able to deliver more accurate performance than the existing state-of-the-art methods.","PeriodicalId":435653,"journal":{"name":"ACM Transactions on Knowledge Discovery from Data (TKDD)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Knowledge Discovery from Data (TKDD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3451165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Over the past few years, we have made great progress in image categorization based on convolutional neural networks (CNNs). These CNNs are always trained based on a large-scale image data set; however, people may only have limited training samples for training CNN in the real-world applications. To solve this problem, one intuition is augmenting training samples. In this article, we propose an algorithm called Lavagan (Latent Variables Augmentation Method based on Generative Adversarial Nets) to improve the performance of CNN with insufficient training samples. The proposed Lavagan method is mainly composed of two tasks. The first task is that we augment a number latent variables (LVs) from a set of adaptive and constrained LVs distributions. In the second task, we take the augmented LVs into the training procedure of the image classifier. By taking these two tasks into account, we propose a uniform objective function to incorporate the two tasks into the learning. We then put forward an alternative two-play minimization game to minimize this uniform loss function such that we can obtain the predictive classifier. Moreover, based on Hoeffding’s Inequality and Chernoff Bounding method, we analyze the feasibility and efficiency of the proposed Lavagan method, which manifests that the LV augmentation method is able to improve the performance of Lavagan with insufficient training samples. Finally, the experiment has shown that the proposed Lavagan method is able to deliver more accurate performance than the existing state-of-the-art methods.

查看原文本刊更多论文

一种训练样本不足的图像分类潜变量增强方法

在过去的几年里，我们在基于卷积神经网络(cnn)的图像分类方面取得了很大的进展。这些cnn总是基于大规模的图像数据集进行训练;然而，人们可能只有有限的训练样本来训练CNN在现实世界的应用。为了解决这个问题，一种直觉是增加训练样本。在本文中，我们提出了一种名为Lavagan (Latent Variables Augmentation Method based on Generative Adversarial Nets)的算法来改善训练样本不足的CNN的性能。提出的Lavagan方法主要由两个任务组成。第一个任务是我们从一组自适应和约束的潜在变量分布中增加一些潜在变量(lv)。在第二个任务中，我们将增强lv引入到图像分类器的训练过程中。考虑到这两个任务，我们提出了一个统一的目标函数，将这两个任务合并到学习中。然后，我们提出了一种备选的两局最小化对策，以最小化该均匀损失函数，从而获得预测分类器。此外，基于Hoeffding不等式和Chernoff边界法，我们分析了所提出的Lavagan方法的可行性和效率，表明LV增强方法能够在训练样本不足的情况下提高Lavagan的性能。最后，实验表明，所提出的Lavagan方法能够提供比现有的最先进的方法更准确的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Knowledge Discovery from Data (TKDD)

自引率

0.00%

发文量