Dissimilarity Representation Learning for Generalized Zero-Shot Recognition

Proceedings of the 26th ACM international conference on Multimedia Pub Date : 2018-10-15 DOI:10.1145/3240508.3240686

Gang Yang, Jinlu Liu, Jieping Xu, Xirong Li

引用次数: 20

Abstract

Generalized zero-shot learning (GZSL) aims to recognize any test instance coming either from a known class or from a novel class that has no training instance. To synthesize training instances for novel classes and thus resolving GZSL as a common classification problem, we propose a Dissimilarity Representation Learning (DSS) method. Dissimilarity representation is to represent a specific instance in terms of its (dis)similarity to other instances in a visual or attribute based feature space. In the dissimilarity space, instances of the novel classes are synthesized by an end-to-end optimized neural network. The neural network realizes two-level feature mappings and domain adaptions in the dissimilarity space and the attribute based feature space. Experimental results on five benchmark datasets, i.e., AWA, AWA$_2$, SUN, CUB, and aPY, show that the proposed method improves the state-of-the-art with a large margin, approximately 10% gain in terms of the harmonic mean of the top-1 accuracy. Consequently, this paper establishes a new baseline for GZSL.

查看原文本刊更多论文

广义零射击识别的不相似表示学习

广义零次学习(GZSL)旨在识别来自已知类或没有训练实例的新类的任何测试实例。为了综合新类别的训练实例，从而解决GZSL作为一个常见的分类问题，我们提出了一种不相似表示学习(DSS)方法。不相似表示是指在视觉或基于属性的特征空间中，根据其与其他实例的(非)相似度来表示特定实例。在不相似空间中，通过端到端优化的神经网络合成新类的实例。神经网络在不相似空间和基于属性的特征空间中实现了两级特征映射和领域自适应。在AWA、AWA$_2$、SUN、CUB和aPY等5个基准数据集上的实验结果表明，本文提出的方法在top-1精度的谐波平均值方面有较大的提高，提高幅度约为10%。因此，本文建立了GZSL的新基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 26th ACM international conference on Multimedia

自引率

0.00%

发文量