Proceedings of the 19th ACM international conference on Multimedia最新文献

Eventscapes: visualizing events over time with emotive facets 事件场景:随着时间的推移，用情感方面来可视化事件

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072044

Brett Adams, Dinh Q. Phung, S. Venkatesh

引用次数: 29

Understanding images with natural sentences 用自然的句子理解图像

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072417

Y. Ushiku, T. Harada, Y. Kuniyoshi

引用次数: 10

An audio-driven virtual dance-teaching assistant 音频驱动的虚拟舞蹈教学助手

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072416

S. Essid, Yves Grenier, M. Maazaoui, G. Richard, R. Tournemenne

引用次数: 2

OpenIMAJ and ImageTerrier: Java libraries and tools for scalable multimedia analysis and indexing of images OpenIMAJ和ImageTerrier:用于可伸缩多媒体分析和图像索引的Java库和工具

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072421

Jonathon S. Hare, Sina Samangooei, D. Dupplaw

{"title":"OpenIMAJ and ImageTerrier: Java libraries and tools for scalable multimedia analysis and indexing of images","authors":"Jonathon S. Hare, Sina Samangooei, D. Dupplaw","doi":"10.1145/2072298.2072421","DOIUrl":"https://doi.org/10.1145/2072298.2072421","url":null,"abstract":"OpenIMAJ and ImageTerrier are recently released open-source libraries and tools for experimentation and development of multimedia applications using Java-compatible programming languages. OpenIMAJ (the Open toolkit for Intelligent Multimedia Analysis in Java) is a collection of libraries for multimedia analysis. The image libraries contain methods for processing images and extracting state-of-the-art features, including SIFT. The video and audio libraries support both cross-platform capture and processing. The clustering and nearest-neighbour libraries contain efficient, multi-threaded implementations of clustering algorithms. The clustering library makes it possible to easily create BoVW representations for images and videos. OpenIMAJ also incorporates a number of tools to enable extremely-large-scale multimedia analysis using distributed computing with Apache Hadoop. ImageTerrier is a scalable, high-performance search engine platform for content-based image retrieval applications using features extracted with the OpenIMAJ library and tools. The ImageTerrier platform provides a comprehensive test-bed for experimenting with image retrieval techniques. The platform incorporates a state-of-the-art implementation of the single-pass indexing technique for constructing inverted indexes and is capable of producing highly compressed index data structures.","PeriodicalId":318758,"journal":{"name":"Proceedings of the 19th ACM international conference on Multimedia","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121089042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 82

Learning heterogeneous data for hierarchical web video classification 学习异构数据的分层网络视频分类

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072355

Xianming Liu, H. Yao, R. Ji, Pengfei Xu, Xiaoshuai Sun, Q. Tian

{"title":"Learning heterogeneous data for hierarchical web video classification","authors":"Xianming Liu, H. Yao, R. Ji, Pengfei Xu, Xiaoshuai Sun, Q. Tian","doi":"10.1145/2072298.2072355","DOIUrl":"https://doi.org/10.1145/2072298.2072355","url":null,"abstract":"Web videos such as YouTube are hard to obtain sufficient precisely labeled training data and analyze due to the complex ontology. To deal with these problems, we present a hierarchical web video classification framework by learning heterogeneous web data, and construct a bottom-up semantic forest of video concepts by learning from meta-data. The main contributions are two-folds: firstly, analysis about middle-level concepts' distribution is taken based on data collected from web communities, and a concepts redistribution assumption is made to build effective transfer learning algorithm. Furthermore, an AdaBoost-Like transfer learning algorithm is proposed to transfer the knowledge learned from Flickr images to YouTube video domain and thus it facilitates video classification. Secondly, a group of hierarchical taxonomies named Semantic Forest are mined from YouTube and Flickr tags which reflect better user intention on the semantic level. A bottom-up semantic integration is also constructed with the help of semantic forest, in order to analyze video content hierarchically in a novel perspective. A group of experiments are performed on the dataset collected from Flickr and YouTube. Compared with state-of-the-arts, the proposed framework is more robust and tolerant to web noise.","PeriodicalId":318758,"journal":{"name":"Proceedings of the 19th ACM international conference on Multimedia","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123746301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Ztitch: a mobile phone application for 3D scene creation, navigation, and sharing Ztitch:一个用于3D场景创建、导航和共享的手机应用程序

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072460

Andrew Au, Jie Liang

引用次数: 2

Keyframe presentation for browsing of user-generated videos on map interfaces 在地图界面上浏览用户生成视频的关键帧演示

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2071926

Jia Hao, Guanfeng Wang, Beomjoo Seo, Roger Zimmermann

引用次数: 13

ImagiLight: a vision approach to lighting scene setting ImagiLight:一种视觉方法来照明场景设置

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2071995

T. Gritti, G. Monaci

引用次数: 3

Cognitive intervention in autism using multimedia stimulus 多媒体刺激对自闭症的认知干预

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2072448

S. Venkatesh, S. Greenhill, Dinh Q. Phung, Brett Adams

引用次数: 4

Expanding the point: automatic enlargement of presentation video elements 扩展点:自动放大演示视频元素

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI: 10.1145/2072298.2071913

Q. Tung, R. Swaminathan, A. Efrat, Kobus Barnard

引用次数: 8