Nenad Tomašev, Doni Pracner, R. Brehar, Miloš Radovanović, D. Mladenić, M. Ivanović, S. Nedevschi
{"title":"基于局部不变图像特征的wikimage数据目标识别","authors":"Nenad Tomašev, Doni Pracner, R. Brehar, Miloš Radovanović, D. Mladenić, M. Ivanović, S. Nedevschi","doi":"10.1109/ICCP.2013.6646097","DOIUrl":null,"url":null,"abstract":"Object recognition is an essential task in content-based image retrieval and classification. This paper deals with object recognition in WIKImage data, a collection of publicly available annotated Wikipedia images. WIKImage comprises a set of 14 binary classification problems with significant class imbalance. Our approach is based on using the local invariant image features and we have compared 3 standard and widely used feature types: SIFT, SURF and ORB. We have examined how the choice of representation affects the k-nearest neighbor data topology and have shown that some feature types might be more appropriate than others for this particular problem. In order to assess the difficulty of the data, we have evaluated 7 different k-nearest neighbor classification methods and shown that the recently proposed hubness-aware classifiers might be used to either increase the accuracy of prediction, or the macro-averaged F-score. However, our results indicate that further improvements are possible and that including the textual feature information might prove beneficial for system performance.","PeriodicalId":380109,"journal":{"name":"2013 IEEE 9th International Conference on Intelligent Computer Communication and Processing (ICCP)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Object recognition in wikimage data based on local invariant image features\",\"authors\":\"Nenad Tomašev, Doni Pracner, R. Brehar, Miloš Radovanović, D. Mladenić, M. Ivanović, S. Nedevschi\",\"doi\":\"10.1109/ICCP.2013.6646097\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Object recognition is an essential task in content-based image retrieval and classification. This paper deals with object recognition in WIKImage data, a collection of publicly available annotated Wikipedia images. WIKImage comprises a set of 14 binary classification problems with significant class imbalance. Our approach is based on using the local invariant image features and we have compared 3 standard and widely used feature types: SIFT, SURF and ORB. We have examined how the choice of representation affects the k-nearest neighbor data topology and have shown that some feature types might be more appropriate than others for this particular problem. In order to assess the difficulty of the data, we have evaluated 7 different k-nearest neighbor classification methods and shown that the recently proposed hubness-aware classifiers might be used to either increase the accuracy of prediction, or the macro-averaged F-score. However, our results indicate that further improvements are possible and that including the textual feature information might prove beneficial for system performance.\",\"PeriodicalId\":380109,\"journal\":{\"name\":\"2013 IEEE 9th International Conference on Intelligent Computer Communication and Processing (ICCP)\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 9th International Conference on Intelligent Computer Communication and Processing (ICCP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCP.2013.6646097\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 9th International Conference on Intelligent Computer Communication and Processing (ICCP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCP.2013.6646097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object recognition in wikimage data based on local invariant image features
Object recognition is an essential task in content-based image retrieval and classification. This paper deals with object recognition in WIKImage data, a collection of publicly available annotated Wikipedia images. WIKImage comprises a set of 14 binary classification problems with significant class imbalance. Our approach is based on using the local invariant image features and we have compared 3 standard and widely used feature types: SIFT, SURF and ORB. We have examined how the choice of representation affects the k-nearest neighbor data topology and have shown that some feature types might be more appropriate than others for this particular problem. In order to assess the difficulty of the data, we have evaluated 7 different k-nearest neighbor classification methods and shown that the recently proposed hubness-aware classifiers might be used to either increase the accuracy of prediction, or the macro-averaged F-score. However, our results indicate that further improvements are possible and that including the textual feature information might prove beneficial for system performance.