{"title":"Bag-of-visual-words Model for Image Classification Based on Spatial Semantic Distribution","authors":"Yong-Qin Li, Bu-Dong Xu, Hai-Di Sheng","doi":"10.1109/SPAC46244.2018.8965481","DOIUrl":null,"url":null,"abstract":"To satisfy the requirement of image classification in the application of image retrieval, a novel method of image representation based on bag-of-visual-words model is proposed in the paper to describe the spatial semantic distribution of associated features. Firstly, the extracted SIFT features are mapped into visual words including certain semantic information. According to spatial pyramid hierarchy, the specific region is divided with local features, and the spatial distribution of associated features is analyzed from different aspects and in various regions. In this way, the semantic phrases are established with local features. Next, the spatial semantic lexicon is constructed with sparse encoding of spatial semantic phrases, and the images are described with the form of sparse statistical histogram vectors. Finally, the vectors of images are classified with the classifier embedded with the improved bag-of-visual-words model. The experimental results show that the accuracy of image classification is significantly enhanced which is benefited from the Bag-of-visual-words model with spatial semantic distribution.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"294 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To satisfy the requirement of image classification in the application of image retrieval, a novel method of image representation based on bag-of-visual-words model is proposed in the paper to describe the spatial semantic distribution of associated features. Firstly, the extracted SIFT features are mapped into visual words including certain semantic information. According to spatial pyramid hierarchy, the specific region is divided with local features, and the spatial distribution of associated features is analyzed from different aspects and in various regions. In this way, the semantic phrases are established with local features. Next, the spatial semantic lexicon is constructed with sparse encoding of spatial semantic phrases, and the images are described with the form of sparse statistical histogram vectors. Finally, the vectors of images are classified with the classifier embedded with the improved bag-of-visual-words model. The experimental results show that the accuracy of image classification is significantly enhanced which is benefited from the Bag-of-visual-words model with spatial semantic distribution.