2012 13th International Workshop on Image Analysis for Multimedia Interactive Services最新文献

筛选

英文中文

How different kinds of sound in videos can influence gaze 视频中不同种类的声音是如何影响凝视的

2012 13th International Workshop on Image Analysis for Multimedia Interactive Services Pub Date : 2012-05-23 DOI: 10.1109/WIAMIS.2012.6226776

Guanghan Song, D. Pellerin, L. Granjon

引用次数: 7

A skeleton based binarization approach for video text recognition 基于骨架的视频文本识别二值化方法

2012 13th International Workshop on Image Analysis for Multimedia Interactive Services Pub Date : 2012-05-23 DOI: 10.1109/WIAMIS.2012.6226754

Haojin Yang, B. Quehl, Harald Sack

引用次数: 4

A machine learning approach to determining tag relevance in geotagged Flickr imagery 确定地理标记Flickr图像中标记相关性的机器学习方法

2012 13th International Workshop on Image Analysis for Multimedia Interactive Services Pub Date : 2012-05-01 DOI: 10.1109/WIAMIS.2012.6226774

Mark Hughes, N. O’Connor, G. Jones

{"title":"A machine learning approach to determining tag relevance in geotagged Flickr imagery","authors":"Mark Hughes, N. O’Connor, G. Jones","doi":"10.1109/WIAMIS.2012.6226774","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226774","url":null,"abstract":"We present a novel machine learning based approach to determining the semantic relevance of community contributed image annotations for the purposes of image retrieval. Current large scale community image retrieval systems typically rely on human annotated tags which are subjectively assigned and may not provide useful or semantically meaningful labels to the images. Homogeneous tags which fail to distinguish between are a common occurrence, which can lead to poor search effectiveness on this data. We described a method to improve text based image retrieval systems by eliminating generic or non relevant image tags. To classify tag relevance, we propose a novel feature set based on statistical information available for each tag within a collection of geotagged images harvested from Flickr. Using this feature set machine learning models are trained to classify the relevance of each tag to its associated image. The goal of this process is to allow for rich and accurate captioning of these images, with the objective of improving the accuracy of text based image retrieval systems. A thorough evaluation is carried out using a human annotated benchmark collection of Flickr tags.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"33 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134333575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

首页上一页