基于视觉词袋方法的子词典图像表示

G. V. Pedrosa, A. Traina, C. Traina
{"title":"基于视觉词袋方法的子词典图像表示","authors":"G. V. Pedrosa, A. Traina, C. Traina","doi":"10.1109/CBMS.2014.60","DOIUrl":null,"url":null,"abstract":"Bag-of-Visual-Words (BoVW) is a well known approach to represent images for visual recognition and retrieval tasks. This approach represents an image as a histogram of visual words and the dissimilarity between two images is measured by comparing those histograms. When performing comparisons involving a specific type of images, some visual words can be more informative and discriminative than others. To take advantage of this fact, assigning appropriate weights can improve the performance of image retrieval. In this paper, we developed a novel modeling approach based on sub dictionaries. We extracted a sub-dictionary as a subset of visual words that best represents a specific image class. To measure the dissimilarity distance between images, we take into account the distance of the histogram obtained using the visual dictionary and the distances of the sub histograms obtained by each sub-dictionary. The proposed approach was evaluated by classifying a standard biomedical image dataset into categories defined by image modality and body part and also natural image scenes. The experimental results demonstrate the gain obtained of the proposed weighting approach when compared to the traditional weighting approach based on TF-IDF (Term Frequency-Inverse Document Frequency). Our proposed approach has shown promising results to boost the classification accuracy as well as the retrieval precision. Moreover, it does that without increasing the feature vector dimensionality.","PeriodicalId":398710,"journal":{"name":"2014 IEEE 27th International Symposium on Computer-Based Medical Systems","volume":"9 20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Using Sub-dictionaries for Image Representation Based on the Bag-of-Visual-Words Approach\",\"authors\":\"G. V. Pedrosa, A. Traina, C. Traina\",\"doi\":\"10.1109/CBMS.2014.60\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bag-of-Visual-Words (BoVW) is a well known approach to represent images for visual recognition and retrieval tasks. This approach represents an image as a histogram of visual words and the dissimilarity between two images is measured by comparing those histograms. When performing comparisons involving a specific type of images, some visual words can be more informative and discriminative than others. To take advantage of this fact, assigning appropriate weights can improve the performance of image retrieval. In this paper, we developed a novel modeling approach based on sub dictionaries. We extracted a sub-dictionary as a subset of visual words that best represents a specific image class. To measure the dissimilarity distance between images, we take into account the distance of the histogram obtained using the visual dictionary and the distances of the sub histograms obtained by each sub-dictionary. The proposed approach was evaluated by classifying a standard biomedical image dataset into categories defined by image modality and body part and also natural image scenes. The experimental results demonstrate the gain obtained of the proposed weighting approach when compared to the traditional weighting approach based on TF-IDF (Term Frequency-Inverse Document Frequency). Our proposed approach has shown promising results to boost the classification accuracy as well as the retrieval precision. Moreover, it does that without increasing the feature vector dimensionality.\",\"PeriodicalId\":398710,\"journal\":{\"name\":\"2014 IEEE 27th International Symposium on Computer-Based Medical Systems\",\"volume\":\"9 20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE 27th International Symposium on Computer-Based Medical Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2014.60\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 27th International Symposium on Computer-Based Medical Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2014.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

视觉词袋(BoVW)是一种众所周知的用于视觉识别和检索任务的图像表示方法。该方法将图像表示为视觉词的直方图,并通过比较直方图来测量两幅图像之间的不同之处。当对特定类型的图像进行比较时,一些视觉词汇可能比其他词汇更具信息性和歧视性。为了利用这一事实,分配适当的权重可以提高图像检索的性能。在本文中,我们开发了一种新的基于子字典的建模方法。我们提取了一个子字典,作为最能代表特定图像类的视觉词的子集。为了测量图像之间的不相似距离,我们考虑了使用视觉字典获得的直方图的距离和每个子字典获得的子直方图的距离。通过将标准生物医学图像数据集分类为图像模态和身体部位定义的类别以及自然图像场景来评估所提出的方法。实验结果表明,与基于TF-IDF (Term Frequency- inverse Document Frequency)的传统加权方法相比,该加权方法获得了较大的增益。该方法在提高分类精度和检索精度方面取得了良好的效果。此外,它在不增加特征向量维数的情况下做到了这一点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using Sub-dictionaries for Image Representation Based on the Bag-of-Visual-Words Approach
Bag-of-Visual-Words (BoVW) is a well known approach to represent images for visual recognition and retrieval tasks. This approach represents an image as a histogram of visual words and the dissimilarity between two images is measured by comparing those histograms. When performing comparisons involving a specific type of images, some visual words can be more informative and discriminative than others. To take advantage of this fact, assigning appropriate weights can improve the performance of image retrieval. In this paper, we developed a novel modeling approach based on sub dictionaries. We extracted a sub-dictionary as a subset of visual words that best represents a specific image class. To measure the dissimilarity distance between images, we take into account the distance of the histogram obtained using the visual dictionary and the distances of the sub histograms obtained by each sub-dictionary. The proposed approach was evaluated by classifying a standard biomedical image dataset into categories defined by image modality and body part and also natural image scenes. The experimental results demonstrate the gain obtained of the proposed weighting approach when compared to the traditional weighting approach based on TF-IDF (Term Frequency-Inverse Document Frequency). Our proposed approach has shown promising results to boost the classification accuracy as well as the retrieval precision. Moreover, it does that without increasing the feature vector dimensionality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信