Proceedings of the 16th International Conference on Multimodal Interaction最新文献_第9页

Identification of the Driver's Interest Point using a Head Pose Trajectory for Situated Dialog Systems 使用头部姿态轨迹识别驾驶员的兴趣点

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663230

Young-Ho Kim, Teruhisa Misu

引用次数: 8

Neural Networks for Emotion Recognition in the Wild 野外情绪识别的神经网络

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2666270

Michal Grosicki

引用次数: 2

MLA'14: Third Multimodal Learning Analytics Workshop and Grand Challenges MLA'14:第三届多模式学习分析研讨会和重大挑战

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2668318

X. Ochoa, M. Worsley, K. Chiluiza, S. Luz

引用次数: 18

CrossMotion: Fusing Device and Image Motion for User Identification, Tracking and Device Association 交叉运动:用于用户识别、跟踪和设备关联的融合设备和图像运动

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663270

Andrew D. Wilson, Hrvoje Benko

引用次数: 32

Analysis of Respiration for Prediction of "Who Will Be Next Speaker and When?" in Multi-Party Meetings 多党会议中“谁是下一位发言人，何时发言”预测的呼吸分析

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663271

Ryo Ishii, K. Otsuka, Shiro Kumano, Junji Yamato

{"title":"Analysis of Respiration for Prediction of \"Who Will Be Next Speaker and When?\" in Multi-Party Meetings","authors":"Ryo Ishii, K. Otsuka, Shiro Kumano, Junji Yamato","doi":"10.1145/2663204.2663271","DOIUrl":"https://doi.org/10.1145/2663204.2663271","url":null,"abstract":"To build a model for predicting the next speaker and the start time of the next utterance in multi-party meetings, we performed a fundamental study of how respiration could be effective for the prediction model. The results of the analysis reveal that a speaker inhales more rapidly and quickly right after the end of a unit of utterance in turn-keeping. The next speaker takes a bigger breath toward speaking in turn-changing than listeners who will not become the next speaker. Based on the results of the analysis, we constructed the prediction models to evaluate how effective the parameters are. The results of the evaluation suggest that the speaker's inhalation right after a unit of utterance, such as the start time from the end of the unit of utterance and the slope and duration of the inhalation phase, is effective for predicting whether turn-keeping or turn-changing happen about 350 ms before the start time of the next utterance on average and that listener's inhalation before the next utterance, such as the maximal inspiration and amplitude of the inhalation phase, is effective for predicting the next speaker in turn-changing about 900 ms before the start time of the next utterance on average.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127641867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

A Multimodal Context-based Approach for Distress Assessment 一种基于情境的多模式痛苦评估方法

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663274

Sayan Ghosh, Moitreya Chatterjee, Louis-Philippe Morency

{"title":"A Multimodal Context-based Approach for Distress Assessment","authors":"Sayan Ghosh, Moitreya Chatterjee, Louis-Philippe Morency","doi":"10.1145/2663204.2663274","DOIUrl":"https://doi.org/10.1145/2663204.2663274","url":null,"abstract":"The increasing prevalence of psychological distress disorders, such as depression and post-traumatic stress, necessitates a serious effort to create new tools and technologies to help with their diagnosis and treatment. In recent years, new computational approaches were proposed to objectively analyze patient non-verbal behaviors over the duration of the entire interaction between the patient and the clinician. In this paper, we go beyond non-verbal behaviors and propose a tri-modal approach which integrates verbal behaviors with acoustic and visual behaviors to analyze psychological distress during the course of the dyadic semi-structured interviews. Our approach exploits the advantages of the dyadic nature of these interactions to contextualize the participant responses based on the affective components (intimacy and polarity levels) of the questions. We validate our approach using one of the largest corpus of semi-structured interviews for distress assessment which consists of 154 multimodal dyadic interactions. Our results show significant improvement on distress prediction performance when integrating verbal behaviors with acoustic and visual behaviors. In addition, our analysis shows that contextualizing the responses improves the prediction performance, most significantly with positive and intimate questions.","PeriodicalId":389037,"journal":{"name":"Proceedings of the 16th International Conference on Multimodal Interaction","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129056559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Session details: Keynote Address 4 会议详情:主题演讲

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/3246752

J. Cohn

引用次数: 0

Rhythmic Body Movements of Laughter 笑的有节奏的身体动作

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663240

Radoslaw Niewiadomski, M. Mancini, Yu Ding, C. Pelachaud, G. Volpe

引用次数: 25

Session details: Oral Session 6: Healthcare and Assistive Technologies 会议详情:口头会议6:医疗保健和辅助技术

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/3246751

D. Bohus

引用次数: 0

Personal Aesthetics for Soft Biometrics: A Generative Multi-resolution Approach 软生物识别的个人美学:一种生成式多分辨率方法

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI: 10.1145/2663204.2663259

Cristina Segalin, A. Perina, M. Cristani

引用次数: 19