{"title":"A comparative study on the local-pyramid approach for Content-Based Image Retrieval","authors":"Lin Feng, Anand Bilas Ray","doi":"10.1109/IVMSPW.2011.5970363","DOIUrl":null,"url":null,"abstract":"The local-pyramid approach for image representation and feature extraction is studied for the Content-Based Image Retrieval (CBIR). Lazebnik's pyramid matching kernels and the K-means clustering is used. The SIFT descriptor is deployed for feature extraction from the images, resulting in an efficient image representation scheme and reduction of the computational complexity. Histogram intersection is used to compute the similarity between the query image and the database images. The local-pyramid approach with a 3-level pyramid and a dictionary size of 100 achieves an average precision of 86.5% in retrieving images from the benchmark database, COREL 1K, and 77.35% for that with random image database.","PeriodicalId":405588,"journal":{"name":"2011 IEEE 10th IVMSP Workshop: Perception and Visual Signal Analysis","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE 10th IVMSP Workshop: Perception and Visual Signal Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVMSPW.2011.5970363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The local-pyramid approach for image representation and feature extraction is studied for the Content-Based Image Retrieval (CBIR). Lazebnik's pyramid matching kernels and the K-means clustering is used. The SIFT descriptor is deployed for feature extraction from the images, resulting in an efficient image representation scheme and reduction of the computational complexity. Histogram intersection is used to compute the similarity between the query image and the database images. The local-pyramid approach with a 3-level pyramid and a dictionary size of 100 achieves an average precision of 86.5% in retrieving images from the benchmark database, COREL 1K, and 77.35% for that with random image database.