{"title":"图像检索和分类中颜色和局部描述符的鲁棒融合","authors":"A. Alzu’bi, A. Amira, N. Ramzan, Tareq Jaber","doi":"10.1109/IWSSIP.2015.7314224","DOIUrl":null,"url":null,"abstract":"This paper introduces an optimized image descriptor that combines both global and local features for image retrieval and classification. Color histograms in HSV space are extracted and quantized as global features, while root scale-invariant feature transform (rootSIFT) descriptors are densely extracted as local features. The extracted features are fused and reduced to obtain a lower-dimensional descriptor and discriminate the underlying variances of data. Image descriptors are encoded by the visual locally aggregated features (VLAD) approach. The Corel image dataset is used for evaluation and benchmarking. The experimental results show that the proposed descriptor improves the classification accuracy by 5% as well as the retrieval accuracy by 10% and 20% over rootSIFT and HSV, respectively. Additionally, the retrieval model outperforms many state-of-the-art approaches.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Robust fusion of color and local descriptors for image retrieval and classification\",\"authors\":\"A. Alzu’bi, A. Amira, N. Ramzan, Tareq Jaber\",\"doi\":\"10.1109/IWSSIP.2015.7314224\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces an optimized image descriptor that combines both global and local features for image retrieval and classification. Color histograms in HSV space are extracted and quantized as global features, while root scale-invariant feature transform (rootSIFT) descriptors are densely extracted as local features. The extracted features are fused and reduced to obtain a lower-dimensional descriptor and discriminate the underlying variances of data. Image descriptors are encoded by the visual locally aggregated features (VLAD) approach. The Corel image dataset is used for evaluation and benchmarking. The experimental results show that the proposed descriptor improves the classification accuracy by 5% as well as the retrieval accuracy by 10% and 20% over rootSIFT and HSV, respectively. Additionally, the retrieval model outperforms many state-of-the-art approaches.\",\"PeriodicalId\":249021,\"journal\":{\"name\":\"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWSSIP.2015.7314224\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWSSIP.2015.7314224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robust fusion of color and local descriptors for image retrieval and classification
This paper introduces an optimized image descriptor that combines both global and local features for image retrieval and classification. Color histograms in HSV space are extracted and quantized as global features, while root scale-invariant feature transform (rootSIFT) descriptors are densely extracted as local features. The extracted features are fused and reduced to obtain a lower-dimensional descriptor and discriminate the underlying variances of data. Image descriptors are encoded by the visual locally aggregated features (VLAD) approach. The Corel image dataset is used for evaluation and benchmarking. The experimental results show that the proposed descriptor improves the classification accuracy by 5% as well as the retrieval accuracy by 10% and 20% over rootSIFT and HSV, respectively. Additionally, the retrieval model outperforms many state-of-the-art approaches.