Proceedings of the 23rd ACM international conference on Multimedia最新文献

OmniViewer: Enabling Multi-modal 3D DASH OmniViewer:启用多模态3D DASH

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2807971

Zhenhuan Gao, Chien-Nan Chen, K. Nahrstedt

引用次数: 8

Vocabulary Expansion Using Word Vectors for Video Semantic Indexing 使用词向量进行视频语义索引的词汇扩展

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2806347

Nakamasa Inoue, Koichi Shinoda

引用次数: 1

Predicting Image Memorability by Multi-view Adaptive Regression 多视图自适应回归预测图像记忆性

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2806303

Houwen Peng, Kai Li, Bing Li, Haibin Ling, Weihua Xiong, Weiming Hu

引用次数: 19

About Events, Objects, and their Relationships: Human-centered Event Understanding from Multimedia 关于事件、对象及其关系:从多媒体角度理解以人为中心的事件

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2806413

A. Scherp, V. Mezaris, B. Ionescu, F. D. Natale

引用次数: 0

Learning Semantic Correlation of Web Images and Text with Mixture of Local Linear Mappings 混合局部线性映射学习Web图像和文本的语义关联

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2806331

Youtian Du, Kai Yang

引用次数: 4

3D Printing and Camera Mapping: Dialectic of Virtual and Reality 3D打印与相机映射:虚拟与现实的辩证

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2808105

He-Lin Luo, I-Chun Chen, Y. Hung

{"title":"3D Printing and Camera Mapping: Dialectic of Virtual and Reality","authors":"He-Lin Luo, I-Chun Chen, Y. Hung","doi":"10.1145/2733373.2808105","DOIUrl":"https://doi.org/10.1145/2733373.2808105","url":null,"abstract":"Projection Mapping, the superimposing of virtual images upon actual objects, is already extensively used in performance arts. Applications of it are already quite mature, therefore, here we wish to achieve the opposite, or specifically speaking, the superimposing of actual objects into virtual images. This method of reverse superimposition is called \"camera mapping.\" Through cameras, camera mapping captures actual objects, and introduces them into a virtual world. Then using superimposition, this allows for actual objects to be rendered as virtual objects. However, the actual objects here must have refined shapes so that they may be superimposed back into the camera. Through the proliferation of 3D printing, virtual 3D models in computers can be created in reality, thereby providing a framework for the limits and demands of \"camera mapping.\" The new media artwork Digital Buddha combines 3D Printing and camera mapping. This work was created by 3-D deformable modeling through a computer, then transforming the model into a sculpture using 3D printing, and then remapping the materially produced sculpture back into the camera. Finally, it uses the already known algorithm to convert the model back into that of the original non-deformed sculpture. From this creation project, in the real world, audiences will see a deformed, abstract sculpture; and in the virtual world, through camera mapping, they will see a concrete sculpture (Buddha). In its representation, this piece of work pays homage to the work TV Buddha produced by video art master Nam June Paik. Using the influence television possesses over people, this work extends into the most important concepts of the digital era, \"coding\" and \"decoding,\" simultaneously addressing the shock and insecurity people in the digital era feel toward images.","PeriodicalId":427170,"journal":{"name":"Proceedings of the 23rd ACM international conference on Multimedia","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125086048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Real Time Rolling Shutter 实时滚动快门

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2808110

David S. Monaghan, N. O’Connor, A. Cleary, D. Connolly

{"title":"The Real Time Rolling Shutter","authors":"David S. Monaghan, N. O’Connor, A. Cleary, D. Connolly","doi":"10.1145/2733373.2808110","DOIUrl":"https://doi.org/10.1145/2733373.2808110","url":null,"abstract":"From an early age children are often told either, you are creative you should do art but stay away from science and maths. Or that you are mathematical you should do science but you're not that creative. Compounding this there also exist some traditional barriers of artistic rhetoric that say, \"don't touch, don't think and don't be creative, we've already done that for you, you can just look...\". The Real Time Rolling Shutter is part of a collaborative Art/Science partnership whose core tenets are in complete contrast to this. The Art/Science exhibitions we have created have invited the public to become part of the exhibition by utilising augmented digital mirrors, Kinects, feed-back camera and projector systems and augmented reality perception helmets. The fundamental underlying principles we are trying to adhere to are to foster curiosity, intrigue, wonderment and amazement and we endeavour to draw the audience into the interactive nature of our exhibits and exclaim to everyone that you can be what ever you chose to be, and that everyone can be creative, everyone can be an artist, everyone can be a scientist... all it takes is an inquisitive mind, so come and explore the real-time rolling shutter and be creative.","PeriodicalId":427170,"journal":{"name":"Proceedings of the 23rd ACM international conference on Multimedia","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125174206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session details: Panel 2 会议详情:小组2

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/3257791

Yung-Hsiang Lu

引用次数: 0

Learning Deep Features For MSR-bing Information Retrieval Challenge 为MSR-bing信息检索挑战学习深度特征

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2809928

Qiang Song, Sixie Yu, Cong Leng, Jiaxiang Wu, Qinghao Hu, Jian Cheng

{"title":"Learning Deep Features For MSR-bing Information Retrieval Challenge","authors":"Qiang Song, Sixie Yu, Cong Leng, Jiaxiang Wu, Qinghao Hu, Jian Cheng","doi":"10.1145/2733373.2809928","DOIUrl":"https://doi.org/10.1145/2733373.2809928","url":null,"abstract":"Two tasks have been put forward in the MSR-bing Grand Challenge 2015. To address the information retrieval task, we raise and integrate a series of methods with visual features obtained by convolution neural network (CNN) models. In our experiments, we discover that the ranking strategies of Hierarchical clustering and PageRank methods are mutually complementary. Another task is fine-grained classification. In contrast to basic-level recognition, fine-grained classification aims to distinguish between different breeds or species or product models, and often requires distinctions that must be conditioned on the object pose for reliable identification. Current state-of-the-art techniques rely heavily upon the use of part annotations, while the bing datasets suffer both abundance of part annotations and dirty background. In this paper, we propose a CNN-based feature representation for visual recognition only using image-level information. Our CNN model is pre-trained on a collection of clean datasets and fine-tuned on the bing datasets. Furthermore, a multi-scale training strategy is adopted by simply resizing the input images into different scales and then merging the soft-max posteriors. We then implement our method into a unified visual recognition system on Microsoft cloud service. Finally, our solution achieved top performance in both tasks of the contest","PeriodicalId":427170,"journal":{"name":"Proceedings of the 23rd ACM international conference on Multimedia","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126071705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

EventBuilder: Real-time Multimedia Event Summarization by Visualizing Social Media EventBuilder:可视化社交媒体的实时多媒体事件汇总

Proceedings of the 23rd ACM international conference on Multimedia Pub Date : 2015-10-13 DOI: 10.1145/2733373.2809932

R. Shah, A. Shaikh, Yi Yu, Wenjing Geng, Roger Zimmermann, Gangshan Wu

引用次数: 36