A Method with Universal Transformer for Multimodal Sentiment Analysis

Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition Pub Date : 2022-09-23 DOI:10.1145/3573942.3573968

Hao Ai, Ying Liu, Jie Fang, Sheikh Faisal Rashid

{"title":"A Method with Universal Transformer for Multimodal Sentiment Analysis","authors":"Hao Ai, Ying Liu, Jie Fang, Sheikh Faisal Rashid","doi":"10.1145/3573942.3573968","DOIUrl":null,"url":null,"abstract":"Multimodal sentiment analysis refers to the use of computers to analyze and identify the emotions that people want to express through the extracted multimodal sentiment features, and it plays a significant role in human-computer interaction and financial market prediction. Most existing approaches to multimodal sentiment analysis use contextual information for modeling, and while this modeling approach can effectively capture the contextual connections within modalities, the correlations between modalities are often overlooked, and the correlations between modalities are also critical to the final recognition results in multimodal sentiment analysis. Therefore, this paper proposes a multimodal sentiment analysis approach based on the universal transformer, a framework that uses the universal transformer to model the connections between multiple modalities while employing effective feature extraction methods to capture the contextual connections of individual modalities. We evaluated our proposed method on two benchmark datasets for multimodal sentiment analysis, CMU-MOSI and CMU-MOSEI, and the results outperformed other methods of the same type.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573942.3573968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Multimodal sentiment analysis refers to the use of computers to analyze and identify the emotions that people want to express through the extracted multimodal sentiment features, and it plays a significant role in human-computer interaction and financial market prediction. Most existing approaches to multimodal sentiment analysis use contextual information for modeling, and while this modeling approach can effectively capture the contextual connections within modalities, the correlations between modalities are often overlooked, and the correlations between modalities are also critical to the final recognition results in multimodal sentiment analysis. Therefore, this paper proposes a multimodal sentiment analysis approach based on the universal transformer, a framework that uses the universal transformer to model the connections between multiple modalities while employing effective feature extraction methods to capture the contextual connections of individual modalities. We evaluated our proposed method on two benchmark datasets for multimodal sentiment analysis, CMU-MOSI and CMU-MOSEI, and the results outperformed other methods of the same type.

查看原文本刊更多论文

基于通用变压器的多模态情感分析方法

多模态情感分析是指利用计算机通过提取的多模态情感特征来分析和识别人们想要表达的情感，在人机交互和金融市场预测中具有重要作用。现有的多模态情感分析方法大多使用上下文信息进行建模，虽然这种建模方法可以有效地捕捉模态内部的上下文联系，但模态之间的相关性往往被忽视，而模态之间的相关性对多模态情感分析的最终识别结果也至关重要。因此，本文提出了一种基于通用变压器的多模态情感分析方法，该框架使用通用变压器对多个模态之间的连接进行建模，同时采用有效的特征提取方法捕获单个模态的上下文连接。我们在两个多模态情感分析基准数据集(CMU-MOSI和CMU-MOSEI)上对所提出的方法进行了评估，结果优于其他同类型的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

自引率

0.00%

发文量