Proceedings of the 16th International Conference on Multimodal Interaction最新文献_第8页

Towards Automated Assessment of Public Speaking Skills Using Multimodal Cues 利用多模态线索自动评估公共演讲技巧

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663265

L. Chen, G. Feng, Jilliam Joe, C. W. Leong, Christopher Kitchen, Chong Min Lee

引用次数: 95

Why We Watch the News: A Dataset for Exploring Sentiment in Broadcast Video News 我们为什么看新闻:一个探索广播视频新闻情感的数据集

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663237

Joseph G. Ellis, Brendan Jou, Shih-Fu Chang

{"title":"Why We Watch the News: A Dataset for Exploring Sentiment in Broadcast Video News","authors":"Joseph G. Ellis, Brendan Jou, Shih-Fu Chang","doi":"10.1145/2663204.2663237","DOIUrl":"https://doi.org/10.1145/2663204.2663237","url":null,"abstract":"We present a multimodal sentiment study performed on a novel collection of videos mined from broadcast and cable television news programs. To the best of our knowledge, this is the first dataset released for studying sentiment in the domain of broadcast video news. We describe our algorithm for the processing and creation of person-specific segments from news video, yielding 929 sentence-length videos, and are annotated via Amazon Mechanical Turk. The spoken transcript and the video content itself are each annotated for their expression of positive, negative or neutral sentiment. Based on these gathered user annotations, we demonstrate for news video the importance of taking into account multimodal information for sentiment prediction, and in particular, challenging previous text-based approaches that rely solely on available transcripts. We show that as much as 21.54% of the sentiment annotations for transcripts differ from their respective sentiment annotations when the video clip itself is presented. We present audio and visual classification baselines over a three-way sentiment prediction of positive, negative and neutral, as well as person-dependent versus person-independent classification influence on performance. Finally, we release the News Rover Sentiment dataset to the greater research community.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114359928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

UM3I 2014: International Workshop on Understanding and Modeling Multiparty, Multimodal Interactions UM3I 2014:理解和建模多方，多模式交互国际研讨会

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2668321

S. Moubayed, D. Bohus, A. Esposito, D. Heylen, Maria Koutsombogera, Haris Papageorgiou, Gabriel Skantze

引用次数: 1

Exploring a Model of Gaze for Grounding in Multimodal HRI 多模态HRI中凝视接地模型的探索

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663275

Gregor Mehlmann, M. Häring, Kathrin Janowski, Tobias Baur, Patrick Gebhard, E. André

引用次数: 63

Managing Human-Robot Engagement with Forecasts and... um... Hesitations 管理人机互动与预测和…嗯…犹豫

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663241

D. Bohus, E. Horvitz

引用次数: 79

Detecting conversing groups with a single worn accelerometer 用一个磨损的加速度计检测会话组

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663228

H. Hung, G. Englebienne, L. C. Quiros

引用次数: 30

Impact of Coordinate Systems on 3D Manipulations in Mobile Augmented Reality 坐标系统对移动增强现实中三维操作的影响

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663234

Philipp Tiefenbacher, Steven Wichert, D. Merget, G. Rigoll

引用次数: 2

Non-Visual Navigation Using Combined Audio Music and Haptic Cues 结合音频音乐和触觉提示的非视觉导航

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663243

Emily Fujimoto, M. Turk

引用次数: 14

Combining Multiple Kernel Methods on Riemannian Manifold for Emotion Recognition in the Wild 结合黎曼流形的多核方法进行野外情绪识别

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2666274

Mengyi Liu, Ruiping Wang, Shaoxin Li, S. Shan, Zhiwu Huang, Xilin Chen

{"title":"Combining Multiple Kernel Methods on Riemannian Manifold for Emotion Recognition in the Wild","authors":"Mengyi Liu, Ruiping Wang, Shaoxin Li, S. Shan, Zhiwu Huang, Xilin Chen","doi":"10.1145/2663204.2666274","DOIUrl":"https://doi.org/10.1145/2663204.2666274","url":null,"abstract":"In this paper, we present the method for our submission to the Emotion Recognition in the Wild Challenge (EmotiW 2014). The challenge is to automatically classify the emotions acted by human subjects in video clips under real-world environment. In our method, each video clip can be represented by three types of image set models (i.e. linear subspace, covariance matrix, and Gaussian distribution) respectively, which can all be viewed as points residing on some Riemannian manifolds. Then different Riemannian kernels are employed on these set models correspondingly for similarity/distance measurement. For classification, three types of classifiers, i.e. kernel SVM, logistic regression, and partial least squares, are investigated for comparisons. Finally, an optimal fusion of classifiers learned from different kernels and different modalities (video and audio) is conducted at the decision level for further boosting the performance. We perform an extensive evaluation on the challenge data (including validation set and blind test set), and evaluate the effects of different strategies in our pipeline. The final recognition accuracy achieved 50.4% on test set, with a significant gain of 16.7% above the challenge baseline 33.7%.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122181251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 204

Gaze-Based Proactive User Interface for Pen-Based Systems 面向笔系统的基于注视的主动用户界面

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2666287

Çagla Çig

引用次数: 0