{"title":"视觉词汇构建的空间语境","authors":"Ge Zhou, Zhiyong Wang, Jiajun Wang, D. Feng","doi":"10.1109/IASP.2010.5476136","DOIUrl":null,"url":null,"abstract":"The bag-of-visual-words model has been widely used in many applications, such as object recognition, image categorization, and visual information retrieval. However, most existing approaches construct a visual vocabulary by simply clustering image regions represented with low-level visual features, where spatial context of image regions has not been well utilized. In this paper, we present two techniques to take such a context into account. One is based on the Self-Organizing Map for Adaptive Processing of Structured Data (SOM-SD), and the other is based on our proposed Hierarchical Fuzzy C-means with Spatial Constraints (FCM-HS). We have employed these two methods together with language modeling for image categorization. Experimental results obtained on Caltech dataset have demonstrated that these two methods can achieve better classification performance than those without considering spatial context. The comparison of these two methods is also discussed in this paper.","PeriodicalId":223866,"journal":{"name":"2010 International Conference on Image Analysis and Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Spatial context for visual vocabulary construction\",\"authors\":\"Ge Zhou, Zhiyong Wang, Jiajun Wang, D. Feng\",\"doi\":\"10.1109/IASP.2010.5476136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The bag-of-visual-words model has been widely used in many applications, such as object recognition, image categorization, and visual information retrieval. However, most existing approaches construct a visual vocabulary by simply clustering image regions represented with low-level visual features, where spatial context of image regions has not been well utilized. In this paper, we present two techniques to take such a context into account. One is based on the Self-Organizing Map for Adaptive Processing of Structured Data (SOM-SD), and the other is based on our proposed Hierarchical Fuzzy C-means with Spatial Constraints (FCM-HS). We have employed these two methods together with language modeling for image categorization. Experimental results obtained on Caltech dataset have demonstrated that these two methods can achieve better classification performance than those without considering spatial context. The comparison of these two methods is also discussed in this paper.\",\"PeriodicalId\":223866,\"journal\":{\"name\":\"2010 International Conference on Image Analysis and Signal Processing\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Image Analysis and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IASP.2010.5476136\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Image Analysis and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IASP.2010.5476136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spatial context for visual vocabulary construction
The bag-of-visual-words model has been widely used in many applications, such as object recognition, image categorization, and visual information retrieval. However, most existing approaches construct a visual vocabulary by simply clustering image regions represented with low-level visual features, where spatial context of image regions has not been well utilized. In this paper, we present two techniques to take such a context into account. One is based on the Self-Organizing Map for Adaptive Processing of Structured Data (SOM-SD), and the other is based on our proposed Hierarchical Fuzzy C-means with Spatial Constraints (FCM-HS). We have employed these two methods together with language modeling for image categorization. Experimental results obtained on Caltech dataset have demonstrated that these two methods can achieve better classification performance than those without considering spatial context. The comparison of these two methods is also discussed in this paper.