基于格式塔理论的高级图像表示图像分类

020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP) Pub Date : 2020-05-01 DOI:10.1109/CCSSP49278.2020.9151715

A. Sabrina, Souami Feryel, Belattar Khadidja

{"title":"基于格式塔理论的高级图像表示图像分类","authors":"A. Sabrina, Souami Feryel, Belattar Khadidja","doi":"10.1109/CCSSP49278.2020.9151715","DOIUrl":null,"url":null,"abstract":"Feature extraction and representation are crucial stages for image classification. However, the derived features might not be robust discriminators, which radically alter image classification results. In this paper, we present an enhanced Bag of visual Words representation for better image classification. We are inspired by the Gestalt laws of grouping.So, our design goals are:•Introduce the spatial information into Bag of Words representation by grouping the aligned visual words (high order features),•Preserve the spatial relationships among the visual words groups against different geometric transformations,•Achieve the high-level representation of visual words by extracting visual phrases. Theses phrases convey the semantic meaning of object parts.The proposed approach has been evaluated, in terms of classification accuracy, on Caltech101 image database. Experimental results indicate that the classification performances are improved by using visual phrases.","PeriodicalId":401063,"journal":{"name":"020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"High-level image representation-based on Gestalt theory for image classification\",\"authors\":\"A. Sabrina, Souami Feryel, Belattar Khadidja\",\"doi\":\"10.1109/CCSSP49278.2020.9151715\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature extraction and representation are crucial stages for image classification. However, the derived features might not be robust discriminators, which radically alter image classification results. In this paper, we present an enhanced Bag of visual Words representation for better image classification. We are inspired by the Gestalt laws of grouping.So, our design goals are:•Introduce the spatial information into Bag of Words representation by grouping the aligned visual words (high order features),•Preserve the spatial relationships among the visual words groups against different geometric transformations,•Achieve the high-level representation of visual words by extracting visual phrases. Theses phrases convey the semantic meaning of object parts.The proposed approach has been evaluated, in terms of classification accuracy, on Caltech101 image database. Experimental results indicate that the classification performances are improved by using visual phrases.\",\"PeriodicalId\":401063,\"journal\":{\"name\":\"020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCSSP49278.2020.9151715\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCSSP49278.2020.9151715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

特征提取和表示是图像分类的关键环节。然而，衍生的特征可能不是鲁棒鉴别器，这从根本上改变了图像分类结果。在本文中，我们提出了一种增强的Bag视觉词表示，以更好地进行图像分类。我们受到完形法则的启发。因此，我们的设计目标是:•通过对对齐的视觉词(高阶特征)进行分组，将空间信息引入到Bag of Words表示中;•通过不同的几何变换，保持视觉词组之间的空间关系;•通过提取视觉短语，实现视觉词的高级表示。这些短语表达了对象部分的语义含义。在Caltech101图像数据库上对该方法的分类精度进行了评价。实验结果表明，使用视觉短语可以提高分类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

High-level image representation-based on Gestalt theory for image classification

Feature extraction and representation are crucial stages for image classification. However, the derived features might not be robust discriminators, which radically alter image classification results. In this paper, we present an enhanced Bag of visual Words representation for better image classification. We are inspired by the Gestalt laws of grouping.So, our design goals are:•Introduce the spatial information into Bag of Words representation by grouping the aligned visual words (high order features),•Preserve the spatial relationships among the visual words groups against different geometric transformations,•Achieve the high-level representation of visual words by extracting visual phrases. Theses phrases convey the semantic meaning of object parts.The proposed approach has been evaluated, in terms of classification accuracy, on Caltech101 image database. Experimental results indicate that the classification performances are improved by using visual phrases.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

020 1st International Conference on Communications, Control Systems and Signal Processing (CCSSP)

自引率

0.00%

发文量