Learning to detect unseen object classes by between-class attribute transfer

2009 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2009-06-20 DOI:10.1109/CVPR.2009.5206594

Christoph H. Lampert, H. Nickisch, S. Harmeling

{"title":"Learning to detect unseen object classes by between-class attribute transfer","authors":"Christoph H. Lampert, H. Nickisch, S. Harmeling","doi":"10.1109/CVPR.2009.5206594","DOIUrl":null,"url":null,"abstract":"We study the problem of object classification when training and test classes are disjoint, i.e. no training examples of the target classes are available. This setup has hardly been studied in computer vision research, but it is the rule rather than the exception, because the world contains tens of thousands of different object classes and for only a very few of them image, collections have been formed and annotated with suitable class labels. In this paper, we tackle the problem by introducing attribute-based classification. It performs object detection based on a human-specified high-level description of the target objects instead of training images. The description consists of arbitrary semantic attributes, like shape, color or even geographic information. Because such properties transcend the specific learning task at hand, they can be pre-learned, e.g. from image datasets unrelated to the current task. Afterwards, new classes can be detected based on their attribute representation, without the need for a new training phase. In order to evaluate our method and to facilitate research in this area, we have assembled a new large-scale dataset, “Animals with Attributes”, of over 30,000 animal images that match the 50 classes in Osherson's classic table of how strongly humans associate 85 semantic attributes with animal classes. Our experiments show that by using an attribute layer it is indeed possible to build a learning object detection system that does not require any training images of the target classes.","PeriodicalId":386532,"journal":{"name":"2009 IEEE Conference on Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2009-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2283","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Conference on Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2009.5206594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2283

Abstract

We study the problem of object classification when training and test classes are disjoint, i.e. no training examples of the target classes are available. This setup has hardly been studied in computer vision research, but it is the rule rather than the exception, because the world contains tens of thousands of different object classes and for only a very few of them image, collections have been formed and annotated with suitable class labels. In this paper, we tackle the problem by introducing attribute-based classification. It performs object detection based on a human-specified high-level description of the target objects instead of training images. The description consists of arbitrary semantic attributes, like shape, color or even geographic information. Because such properties transcend the specific learning task at hand, they can be pre-learned, e.g. from image datasets unrelated to the current task. Afterwards, new classes can be detected based on their attribute representation, without the need for a new training phase. In order to evaluate our method and to facilitate research in this area, we have assembled a new large-scale dataset, “Animals with Attributes”, of over 30,000 animal images that match the 50 classes in Osherson's classic table of how strongly humans associate 85 semantic attributes with animal classes. Our experiments show that by using an attribute layer it is indeed possible to build a learning object detection system that does not require any training images of the target classes.

查看原文本刊更多论文

学习通过类间属性转移来检测不可见的对象类

研究了训练类和测试类不相交，即没有目标类的训练样例时的目标分类问题。这种设置在计算机视觉研究中几乎没有被研究过，但这是一种规则而不是例外，因为世界上包含成千上万不同的对象类，只有极少数的图像，已经形成了集合，并用合适的类标签进行了注释。在本文中，我们通过引入基于属性的分类来解决这个问题。它基于人类指定的目标对象的高级描述而不是训练图像来执行目标检测。描述由任意的语义属性组成，如形状、颜色甚至地理信息。由于这些属性超越了手头的特定学习任务，因此可以预先学习它们，例如，从与当前任务无关的图像数据集中。然后，可以根据它们的属性表示来检测新的类，而不需要新的训练阶段。为了评估我们的方法并促进这一领域的研究，我们组装了一个新的大规模数据集，“具有属性的动物”，其中超过30,000个动物图像与Osherson的经典表中的50个类别相匹配，该表描述了人类如何将85个语义属性与动物类别强烈关联。我们的实验表明，通过使用属性层，确实可以构建一个不需要目标类的任何训练图像的学习对象检测系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量