Proceedings of the 16th International Conference on Multimodal Interaction最新文献_第10页

Session details: Oral Session 1: Dialogue and Social Interaction 会话环节1:对话与社会互动

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/3246741

T. Nishida

引用次数: 0

Combining Multimodal Features with Hierarchical Classifier Fusion for Emotion Recognition in the Wild 结合多模态特征与层次分类器融合的野外情绪识别

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2666272

Bo Sun, Liandong Li, Tian Zuo, Ying Chen, Guoyan Zhou, Xuewen Wu

引用次数: 70

Speech-Driven Animation Constrained by Appropriate Discourse Functions 适当话语功能约束下的语音驱动动画

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663252

Najmeh Sadoughi, Yang Liu, C. Busso

{"title":"Speech-Driven Animation Constrained by Appropriate Discourse Functions","authors":"Najmeh Sadoughi, Yang Liu, C. Busso","doi":"10.1145/2663204.2663252","DOIUrl":"https://doi.org/10.1145/2663204.2663252","url":null,"abstract":"Conversational agents provide powerful opportunities to interact and engage with the users. The challenge is how to create naturalistic behaviors that replicate the complex gestures observed during human interactions. Previous studies have used rule-based frameworks or data-driven models to generate appropriate gestures, which are properly synchronized with the underlying discourse functions. Among these methods, speech-driven approaches are especially appealing given the rich information conveyed on speech. It captures emotional cues and prosodic patterns that are important to synthesize behaviors (i.e., modeling the variability and complexity of the timings of the behaviors). The main limitation of these models is that they fail to capture the underlying semantic and discourse functions of the message (e.g., nodding). This study proposes a speech-driven framework that explicitly model discourse functions, bridging the gap between speech-driven and rule-based models. The approach is based on dynamic Bayesian Network (DBN), where an additional node is introduced to constrain the models by specific discourse functions. We implement the approach by synthesizing head and eyebrow motion. We conduct perceptual evaluations to compare the animations generated using the constrained and unconstrained models.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131172284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Digital Reading Support for The Blind by Multimodal Interaction 多模式交互对盲人数字阅读的支持

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663266

Yasmine N. El-Glaly, Francis K. H. Quek

{"title":"Digital Reading Support for The Blind by Multimodal Interaction","authors":"Yasmine N. El-Glaly, Francis K. H. Quek","doi":"10.1145/2663204.2663266","DOIUrl":"https://doi.org/10.1145/2663204.2663266","url":null,"abstract":"Slate-type devices allow Individuals with Blindness or Severe Visual Impairment (IBSVI) to read in place with the touch of their fingertip by audio-rendering the words they touch. Such technologies are helpful for spatial cognition while reading. However, users have to move their fingers slowly or they may lose place on screen. Also, IBSVI may wander between lines without realizing they did. In this paper, we address these two interaction problems by introducing dynamic speech-touch interaction model, and intelligent reading support system. With this model, the speed of the speech will dynamically change coping up with the user's finger speed. The proposed model is composed of: 1- Audio Dynamics Model, and 2- Off-line Speech Synthesis Technique. The intelligent reading support system predicts the direction of reading, corrects the reading word if the user drifts, and notifies the user using a sonic gutter to help her from straying off the reading line. We tested the new audio dynamics model, the sonic gutter, and the reading support model in two user studies. The participants' feedback helped us fine-tune the parameters of the two models. Finally, we ran an evaluation study where the reading support system is compared to other VoiceOver technologies. The results showed preponderance to the reading support system with its audio dynamics and intelligent reading support components.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"177 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132646160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

ICMI 2014 Workshop on Multimodal, Multi-Party, Real-World Human-Robot Interaction ICMI 2014多模式、多方、真实世界人机交互研讨会

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2668319

M. Foster, M. Giuliani, Ronald P. A. Petrick

引用次数: 0

Natural Communication about Uncertainties in Situated Interaction 情境相互作用中不确定性的自然沟通

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663249

T. Pejsa, D. Bohus, Michael F. Cohen, C. Saw, James Mahoney, E. Horvitz

引用次数: 11

Perceptions of Interpersonal Behavior are Influenced by Gender, Facial Expression Intensity, and Head Pose 人际行为知觉受性别、面部表情强度和头部姿势的影响

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2667575

J. Girard

引用次数: 3

Orchestration for Group Videoconferencing: An Interactive Demonstrator 群组视频会议的编排:一个交互式演示

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2669624

Wolfgang Weiss, Rene Kaiser, Manolis Falelakis

引用次数: 2

Session details: Keynote Address 3 会议详情:主题演讲

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/3246749

O. Aran

引用次数: 0

Towards Social Touch Intelligence: Developing a Robust System for Automatic Touch Recognition 面向社会触摸智能:开发一种鲁棒的自动触摸识别系统

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2666281

Merel M. Jung

引用次数: 11