{"title":"相似图像搜索的视觉词对","authors":"Yuan Li, Xiaochun Cao","doi":"10.1109/ICIG.2011.142","DOIUrl":null,"url":null,"abstract":"The state-of-the-art large scale image retrieval systems have mainly relied on two seminal works: the SIFT descriptor and bag-of-features (BOF) model. However, with the growth of image dataset, the discriminative power of SIFT descriptors was weakened rapidly when mapped to visual words. In this paper, we present a new approach to generate visual word pairs for image retrieval. Two different descriptors are employed to represent the same interest region, and then a visual word pair is obtained by quantizing the descriptor pair with two independent codebooks. By encoding different types of information of the same region, our approach can effectively boost the matching accuracy of descriptors. We evaluate our approach with INRIA Holidays dataset on a 120K image database, and the experiment results suggest that our approach significantly improved the retrieval performance of BOF model.","PeriodicalId":277974,"journal":{"name":"2011 Sixth International Conference on Image and Graphics","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Visual Word Pairs for Similar Image Search\",\"authors\":\"Yuan Li, Xiaochun Cao\",\"doi\":\"10.1109/ICIG.2011.142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The state-of-the-art large scale image retrieval systems have mainly relied on two seminal works: the SIFT descriptor and bag-of-features (BOF) model. However, with the growth of image dataset, the discriminative power of SIFT descriptors was weakened rapidly when mapped to visual words. In this paper, we present a new approach to generate visual word pairs for image retrieval. Two different descriptors are employed to represent the same interest region, and then a visual word pair is obtained by quantizing the descriptor pair with two independent codebooks. By encoding different types of information of the same region, our approach can effectively boost the matching accuracy of descriptors. We evaluate our approach with INRIA Holidays dataset on a 120K image database, and the experiment results suggest that our approach significantly improved the retrieval performance of BOF model.\",\"PeriodicalId\":277974,\"journal\":{\"name\":\"2011 Sixth International Conference on Image and Graphics\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 Sixth International Conference on Image and Graphics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIG.2011.142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Sixth International Conference on Image and Graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIG.2011.142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The state-of-the-art large scale image retrieval systems have mainly relied on two seminal works: the SIFT descriptor and bag-of-features (BOF) model. However, with the growth of image dataset, the discriminative power of SIFT descriptors was weakened rapidly when mapped to visual words. In this paper, we present a new approach to generate visual word pairs for image retrieval. Two different descriptors are employed to represent the same interest region, and then a visual word pair is obtained by quantizing the descriptor pair with two independent codebooks. By encoding different types of information of the same region, our approach can effectively boost the matching accuracy of descriptors. We evaluate our approach with INRIA Holidays dataset on a 120K image database, and the experiment results suggest that our approach significantly improved the retrieval performance of BOF model.