Proceedings of the 2016 ACM Workshop on Multimedia COMMONS最新文献

YFCC100M HybridNet fc6 Deep Features for Content-Based Image Retrieval YFCC100M HybridNet fc6基于内容的图像检索深度特征

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983557

Giuseppe Amato, F. Falchi, C. Gennaro, F. Rabitti

引用次数: 15

Developing Benchmarks: The Importance of the Process and New Paradigms 开发基准:过程和新范式的重要性

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983562

R. Ordelman

{"title":"Developing Benchmarks: The Importance of the Process and New Paradigms","authors":"R. Ordelman","doi":"10.1145/2983554.2983562","DOIUrl":"https://doi.org/10.1145/2983554.2983562","url":null,"abstract":"The value and importance of Benchmark Evaluations is widely acknowledged. Benchmarks play a key role in many research projects. It takes time, a well-balanced team of domain specialists preferably with links to the user community and industry, and a strong involvement of the research community itself to establish a sound evaluation framework that includes (annotated) data sets, well-defined tasks that reflect the needs in the 'real world', a proper evaluation methodology, ground-truth, including a strategy for repetitive assessments, and last but not least, funding. Although the benefits of an evaluation framework are typically reviewed from a perspective of 'research output' --e.g., a scientific publication demonstrating an advance of a certain methodology-- it is important to be aware of the value of the process of creating a benchmark itself: it increases significantly the understanding of the problem we want to address and as a consequence also the impact of the evaluation outcomes. In this talk I will overview the history of a series of tasks focusing on audiovisual search emphasizing its 'multimodal' aspects, starting in 2006 with the workshop on 'Searching Spontaneous Conversational Speech' that led to tasks in CLEF and MediaEval (\"Search and Hyperlinking\"), and recently also TRECVid (\"Video Hyperlinking\"). The focus of my talk will be on the process rather than on the results of these evaluations themselves, and will address cross-benchmark connections, and new benchmark paradigms, specifically the integration of benchmarking in industrial 'living labs' that are becoming popular in some domains.","PeriodicalId":340803,"journal":{"name":"Proceedings of the 2016 ACM Workshop on Multimedia COMMONS","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128387214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

In-depth Exploration of Geotagging Performance using Sampling Strategies on YFCC100M YFCC100M上基于采样策略的地理标记性能深入探索

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983558

Giorgos Kordopatis-Zilos, S. Papadopoulos, Y. Kompatsiaris

引用次数: 7

Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting 基于召回权的Flickr照片标签概念级多模态排序

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983555

R. Shah, Yi Yu, Suhua Tang, S. Satoh, Akshay Verma, Roger Zimmermann

{"title":"Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting","authors":"R. Shah, Yi Yu, Suhua Tang, S. Satoh, Akshay Verma, Roger Zimmermann","doi":"10.1145/2983554.2983555","DOIUrl":"https://doi.org/10.1145/2983554.2983555","url":null,"abstract":"Social media platforms allow users to annotate photos with tags that significantly facilitate an effective semantics understanding, search, and retrieval of photos. However, due to the manual, ambiguous, and personalized nature of user tagging, many tags of a photo are in a random order and even irrelevant to the visual content. Aiming to automatically compute tag relevance for a given photo, we propose a tag ranking scheme based on voting from photo neighbors derived from multimodal information. Specifically, we determine photo neighbors leveraging geo, visual, and semantics concepts derived from spatial information, visual content, and textual metadata, respectively. We leverage high-level features instead traditional low-level features to compute tag relevance. Experimental results on a representative set of 203,840 photos from the YFCC100M dataset confirm that above-mentioned multimodal concepts complement each other in computing tag relevance. Moreover, we explore the fusion of multimodal information to refine tag ranking leveraging recall based weighting. Experimental results on the representative set confirm that the proposed algorithm outperforms state-of-the-arts.","PeriodicalId":340803,"journal":{"name":"Proceedings of the 2016 ACM Workshop on Multimedia COMMONS","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130468562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Which Languages do People Speak on Flickr?: A Language and Geo-Location Study of the YFCC100m Dataset 人们在Flickr上说哪些语言?: YFCC100m数据集的语言与地理定位研究

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983560

Alireza Koochali, Sebastian Kalkowski, A. Dengel, Damian Borth, Christian Schulze

{"title":"Which Languages do People Speak on Flickr?: A Language and Geo-Location Study of the YFCC100m Dataset","authors":"Alireza Koochali, Sebastian Kalkowski, A. Dengel, Damian Borth, Christian Schulze","doi":"10.1145/2983554.2983560","DOIUrl":"https://doi.org/10.1145/2983554.2983560","url":null,"abstract":"Recently, the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset was introduced to the computer vision and multimedia research community. This dataset consists of millions of images and videos spread over the globe. This geo-distribution hints at a potentially large set of different languages being used in titles, descriptions, and tags of these images and videos. Since the YFCC100m metadata does not provide any information about the languages used in the dataset, this paper presents the first analysis of this kind. The language and geo-location characteristics of the YFCC100m dataset is described by providing (a) an overview of used languages, (b) language to country associations, and (c) second language usage in this dataset. Being able to know the language spoken in titles, descriptions, and tags, users of the dataset can make language specific decisions to select subsets of images for, e.g., proper training of classifiers or analyze user behavior specific to their spoken language. Also, this language information is essential for further linguistic studies on the metadata of the YFCC100m dataset.","PeriodicalId":340803,"journal":{"name":"Proceedings of the 2016 ACM Workshop on Multimedia COMMONS","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122662700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Analysis of Spatial, Temporal, and Content Characteristics of Videos in the YFCC100M Dataset YFCC100M数据集视频时空及内容特征分析

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 2016-10-16 DOI: 10.1145/2983554.2983559

Jun-Ho Choi, Jong-Seok Lee

引用次数: 2

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS 2016年ACM多媒体共享研讨会论文集

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS Pub Date : 1900-01-01 DOI: 10.1145/2983554

引用次数: 1