Multimodal sentiment analysis based on multi-head attention mechanism

Proceedings of the 4th International Conference on Machine Learning and Soft Computing Pub Date : 2020-01-17 DOI:10.1145/3380688.3380693

Chen Xi, G. Lu, Jingjie Yan

引用次数: 34

Abstract

Multimodal sentiment analysis is still a promising area of research, which has many issues needed to be addressed. Among them, extracting reasonable unimodal features and designing a robust multimodal sentiment analysis model is the most basic problem. This paper presents some novel ways of extracting sentiment features from visual, audio and text, furthermore use these features to verify the multimodal sentiment analysis model based on multi-head attention mechanism. The proposed model is evaluated on Multimodal Opinion Utterances Dataset (MOUD) corpus and CMU Multi-modal Opinion-level Sentiment Intensity (CMU-MOSI) corpus for multimodal sentiment analysis. Experimental results prove the effectiveness of the proposed approach. The accuracy of the MOUD and MOSI datasets is 90.43% and 82.71%, respectively. Compared to the state-of-the-art models, the improvement of the performance are approximately 2 and 0.4 points.

查看原文本刊更多论文

基于多头注意机制的多模态情感分析

多模态情感分析仍然是一个很有前途的研究领域，有许多问题需要解决。其中，提取合理的单模态特征，设计稳健的多模态情感分析模型是最基本的问题。本文提出了从视觉、音频和文本中提取情感特征的新方法，并利用这些特征验证了基于多头注意机制的多模态情感分析模型。在多模态意见话语数据集(mod)语料库和CMU多模态意见级情感强度(CMU- mosi)语料库上对该模型进行了评估，用于多模态情感分析。实验结果证明了该方法的有效性。mod和MOSI数据集的精度分别为90.43%和82.71%。与最先进的车型相比，性能提高了约2分和0.4分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 4th International Conference on Machine Learning and Soft Computing

自引率

0.00%

发文量