Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification

2013 IEEE International Conference on Computer Vision Pub Date : 2013-12-01 DOI:10.1109/ICCV.2013.332

Mandar Dixit, Nikhil Rasiwasia, N. Vasconcelos

{"title":"Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification","authors":"Mandar Dixit, Nikhil Rasiwasia, N. Vasconcelos","doi":"10.1109/ICCV.2013.332","DOIUrl":null,"url":null,"abstract":"An extension of the latent Dirichlet allocation (LDA), denoted class-specific-simplex LDA (css-LDA), is proposed for image classification. An analysis of the supervised LDA models currently used for this task shows that the impact of class information on the topics discovered by these models is very weak in general. This implies that the discovered topics are driven by general image regularities, rather than the semantic regularities of interest for classification. To address this, we introduce a model that induces supervision in topic discovery, while retaining the original flexibility of LDA to account for unanticipated structures of interest. The proposed css-LDA is an LDA model with class supervision at the level of image features. In css-LDA topics are discovered per class, i.e. a single set of topics shared across classes is replaced by multiple class-specific topic sets. This model can be used for generative classification using the Bayes decision rule or even extended to discriminative classification with support vector machines (SVMs). A css-LDA model can endow an image with a vector of class and topic specific count statistics that are similar to the Bag-of-words (BoW) histogram. SVM-based discriminants can be learned for classes in the space of these histograms. The effectiveness of css-LDA model in both generative and discriminative classification frameworks is demonstrated through an extensive experimental evaluation, involving multiple benchmark datasets, where it is shown to outperform all existing LDA based image classification approaches.","PeriodicalId":6351,"journal":{"name":"2013 IEEE International Conference on Computer Vision","volume":"20 1","pages":"2672-2679"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2013.332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

An extension of the latent Dirichlet allocation (LDA), denoted class-specific-simplex LDA (css-LDA), is proposed for image classification. An analysis of the supervised LDA models currently used for this task shows that the impact of class information on the topics discovered by these models is very weak in general. This implies that the discovered topics are driven by general image regularities, rather than the semantic regularities of interest for classification. To address this, we introduce a model that induces supervision in topic discovery, while retaining the original flexibility of LDA to account for unanticipated structures of interest. The proposed css-LDA is an LDA model with class supervision at the level of image features. In css-LDA topics are discovered per class, i.e. a single set of topics shared across classes is replaced by multiple class-specific topic sets. This model can be used for generative classification using the Bayes decision rule or even extended to discriminative classification with support vector machines (SVMs). A css-LDA model can endow an image with a vector of class and topic specific count statistics that are similar to the Bag-of-words (BoW) histogram. SVM-based discriminants can be learned for classes in the space of these histograms. The effectiveness of css-LDA model in both generative and discriminative classification frameworks is demonstrated through an extensive experimental evaluation, involving multiple benchmark datasets, where it is shown to outperform all existing LDA based image classification approaches.

查看原文本刊更多论文

分类分类的简单潜Dirichlet分配

提出了一种用于图像分类的潜狄利克雷分配(LDA)的扩展，即类特定单纯形LDA (css-LDA)。对目前用于此任务的监督LDA模型的分析表明，类信息对这些模型发现的主题的影响通常非常弱。这意味着发现的主题是由一般图像规律驱动的，而不是由分类感兴趣的语义规律驱动的。为了解决这个问题，我们引入了一个模型，该模型在主题发现中引入了监督，同时保留了LDA的原始灵活性，以解释意想不到的兴趣结构。本文提出的css-LDA是一种在图像特征层面上具有类监督的LDA模型。在css-LDA中，每个类都发现主题，即跨类共享的单一主题集被多个特定于类的主题集所取代。该模型可以用于贝叶斯决策规则的生成分类，甚至可以扩展到支持向量机(svm)的判别分类。css-LDA模型可以为图像赋予特定于类和主题的计数统计向量，类似于词袋直方图(BoW)。基于svm的判别器可以学习到这些直方图空间中的类。通过涉及多个基准数据集的广泛实验评估，证明了css-LDA模型在生成和判别分类框架中的有效性，其中它被证明优于所有现有的基于LDA的图像分类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE International Conference on Computer Vision

自引率

0.00%

发文量