Punchline Detection using Context-Aware Hierarchical Multimodal Fusion

Proceedings of the 2020 International Conference on Multimodal Interaction Pub Date : 2020-10-21 DOI:10.1145/3382507.3418891

Akshat Choube, M. Soleymani

引用次数: 8

Abstract

Humor has a history as old as humanity. Humor often induces laughter and elicits amusement and engagement. Humorous behavior involves behavior manifested in different modalities including language, voice tone, and gestures. Thus, automatic understanding of humorous behavior requires multimodal behavior analysis. Humor detection is a well-established problem in Natural Language Processing but its multimodal analysis is less explored. In this paper, we present a context-aware hierarchical fusion network for multimodal punchline detection. The proposed neural architecture first fuses the modalities two by two and then fuses all three modalities. The network also models the context of the punchline using Gated Recurrent Unit(s). The model's performance is evaluated on UR-FUNNY database yielding state-of-the-art performance.

查看原文本刊更多论文

使用上下文感知分层多模态融合的笑点检测

幽默的历史和人类一样古老。幽默常常引起笑声，使人感到愉快和愉快。幽默行为包括不同形式的行为，包括语言、语调和手势。因此，对幽默行为的自动理解需要多模态行为分析。幽默检测是自然语言处理中一个公认的问题，但其多模态分析研究较少。在本文中，我们提出了一个上下文感知的分层融合网络，用于多模态笑点检测。提出的神经结构首先对两个模态进行融合，然后对所有三个模态进行融合。该网络还使用门控循环单元对笑点的上下文进行建模。该模型的性能在UR-FUNNY数据库上进行评估，产生最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2020 International Conference on Multimodal Interaction

自引率

0.00%

发文量