An Efficient BoF Representation for Object Classification

Q4 Computer Science

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2021-12-16 DOI:10.5565/rev/elcvia.1403

V. Vinoharan, A. Ramanan

{"title":"An Efficient BoF Representation for Object Classification","authors":"V. Vinoharan, A. Ramanan","doi":"10.5565/rev/elcvia.1403","DOIUrl":null,"url":null,"abstract":"The Bag-of-features (BoF) approach has proved to yield better performance in a patch-based object classification system owing to its simplicity. However, often the very large number of patch-based descriptors (such as scale-invariant feature transform and speeded up robust features, extracted from images to create a BoF vector) leads to huge computational cost and an increased storage requirement. This paper demonstrates a two-staged approach to creating a discriminative and compact BoF representation for object classification. As a preprocessing stage to the codebook construction, ambiguous patch-based descriptors are eliminated using an entropy-based and one-pass feature selection approach, to retain high-quality descriptors. As a post-processing stage to the codebook construction, a subset of codewords which is not activated enough in images are eliminated from the initially constructed codebook based on statistical measures. Finally, each patch-based descriptor of an image is assigned to the closest codeword to create a histogram representation. One-versus-all support vector machine is applied to classify the histogram representation. The proposed methods are evaluated on benchmark image datasets. Testing results show that the proposed methods enables the codebook to be more discriminative and compact in moderate sized visual object classification tasks.","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Letters on Computer Vision and Image Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5565/rev/elcvia.1403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

The Bag-of-features (BoF) approach has proved to yield better performance in a patch-based object classification system owing to its simplicity. However, often the very large number of patch-based descriptors (such as scale-invariant feature transform and speeded up robust features, extracted from images to create a BoF vector) leads to huge computational cost and an increased storage requirement. This paper demonstrates a two-staged approach to creating a discriminative and compact BoF representation for object classification. As a preprocessing stage to the codebook construction, ambiguous patch-based descriptors are eliminated using an entropy-based and one-pass feature selection approach, to retain high-quality descriptors. As a post-processing stage to the codebook construction, a subset of codewords which is not activated enough in images are eliminated from the initially constructed codebook based on statistical measures. Finally, each patch-based descriptor of an image is assigned to the closest codeword to create a histogram representation. One-versus-all support vector machine is applied to classify the histogram representation. The proposed methods are evaluated on benchmark image datasets. Testing results show that the proposed methods enables the codebook to be more discriminative and compact in moderate sized visual object classification tasks.

查看原文本刊更多论文

一种有效的对象分类BoF表示

事实证明，特征袋(BoF)方法由于其简单性，在基于补丁的目标分类系统中具有更好的性能。然而，通常非常大量的基于补丁的描述符(如尺度不变特征变换和加速鲁棒特征，从图像中提取以创建BoF向量)会导致巨大的计算成本和增加的存储需求。本文演示了一种两阶段的方法来创建用于对象分类的判别和紧凑的BoF表示。作为码本构建的预处理阶段，使用基于熵和一次特征选择的方法消除基于补丁的模糊描述符，以保留高质量的描述符。作为码本构造的后处理阶段，基于统计度量从初始构造的码本中剔除图像中未充分激活的码字子集。最后，将图像的每个基于补丁的描述符分配给最近的码字以创建直方图表示。采用单对全支持向量机对直方图表示进行分类。在基准图像数据集上对所提出的方法进行了评估。测试结果表明，该方法使码本在中等大小的视觉目标分类任务中具有更好的判别性和紧凑性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊