Multimodal Sentiment Analysis: Review, Application Domains and Future Directions

2021 IEEE Pune Section International Conference (PuneCon) Pub Date : 2021-12-16 DOI:10.1109/punecon52575.2021.9686504

Ankita Gandhi, K. Adhvaryu, Vidhi Khanduja

{"title":"Multimodal Sentiment Analysis: Review, Application Domains and Future Directions","authors":"Ankita Gandhi, K. Adhvaryu, Vidhi Khanduja","doi":"10.1109/punecon52575.2021.9686504","DOIUrl":null,"url":null,"abstract":"In this information age, opinion mining which is also known as sentiment analysis turns up to be the most important task in the field of natural language processing. Previous literature in area of sentiment analysis which mostly focused on single modality that is on textual data. Almost all the latest advancement in the sentiment analysis are using textual dataset and resources only. With the invent of internet which increases the use of social media, people are using vlogs, videos, pictures, audios, emojis and microblogs to represent their opinions on different web platforms. In this new media age, every day 720k hours of videos are uploaded on alone Youtube only. We have number of such platforms like YouTube. In the classical methods other modalities’ expressiveness is overlooked and thus these methods fail to generate accurate results. Numerous commercial applications used the aggregation of sentiments and opinions of individuals by anticipating large population. Thus, it is highly necessary that the diverse modalities from the raw data available from the internet should be utilized to mine opinions and identify sentiments. Varied data (i.e., text, speech, visual and code-mixed data) available over internet is integrated by Multimodal Sentiment Analysis. Multimodality refers to more than one modality like bimodal which uses any two modalities or trimodal which uses all the three modalities. Each modality offers its own exclusive features and can be collectively used to mine their positive or negative sentiments, opinions or responses about the entity. The latest development in multimodal sentiment analysis is that the diverse modalities i.e., audio, visual and textual are fused to generate better accuracy. Also, language and culture independent and speaker independent models can be generated. In this survey, we have defined various fusion techniques for sentiment analysis using multiple modalities, characteristics, features for multimodal sentiment analysis. This paper gives an outline of latest approaches used for multimodal sentiment analysis and various application domains in the field of multimodal Sentiment analysis using traditional methods as well as various deep learning methods. It also describes emerging areas of research in sentiment analysis using multimodal data.","PeriodicalId":154406,"journal":{"name":"2021 IEEE Pune Section International Conference (PuneCon)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Pune Section International Conference (PuneCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/punecon52575.2021.9686504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

In this information age, opinion mining which is also known as sentiment analysis turns up to be the most important task in the field of natural language processing. Previous literature in area of sentiment analysis which mostly focused on single modality that is on textual data. Almost all the latest advancement in the sentiment analysis are using textual dataset and resources only. With the invent of internet which increases the use of social media, people are using vlogs, videos, pictures, audios, emojis and microblogs to represent their opinions on different web platforms. In this new media age, every day 720k hours of videos are uploaded on alone Youtube only. We have number of such platforms like YouTube. In the classical methods other modalities’ expressiveness is overlooked and thus these methods fail to generate accurate results. Numerous commercial applications used the aggregation of sentiments and opinions of individuals by anticipating large population. Thus, it is highly necessary that the diverse modalities from the raw data available from the internet should be utilized to mine opinions and identify sentiments. Varied data (i.e., text, speech, visual and code-mixed data) available over internet is integrated by Multimodal Sentiment Analysis. Multimodality refers to more than one modality like bimodal which uses any two modalities or trimodal which uses all the three modalities. Each modality offers its own exclusive features and can be collectively used to mine their positive or negative sentiments, opinions or responses about the entity. The latest development in multimodal sentiment analysis is that the diverse modalities i.e., audio, visual and textual are fused to generate better accuracy. Also, language and culture independent and speaker independent models can be generated. In this survey, we have defined various fusion techniques for sentiment analysis using multiple modalities, characteristics, features for multimodal sentiment analysis. This paper gives an outline of latest approaches used for multimodal sentiment analysis and various application domains in the field of multimodal Sentiment analysis using traditional methods as well as various deep learning methods. It also describes emerging areas of research in sentiment analysis using multimodal data.

查看原文本刊更多论文

多模态情感分析:综述、应用领域和未来方向

在这个信息时代，观点挖掘也被称为情感分析，成为自然语言处理领域最重要的任务。以往的情感分析文献主要集中在文本数据的单一情态上。几乎所有情感分析的最新进展都只使用文本数据集和资源。随着互联网的发明增加了社交媒体的使用，人们正在使用vlog，视频，图片，音频，表情符号和微博来表达他们在不同网络平台上的观点。在这个新媒体时代，每天仅在Youtube上就上传了72万小时的视频。我们有很多这样的平台，比如YouTube。在经典方法中，其他模态的表达性被忽略，因此这些方法无法产生准确的结果。许多商业应用程序通过预测大量人口来汇总个人的情绪和意见。因此，利用来自互联网的原始数据的多种模式来挖掘意见和识别情绪是非常必要的。多种数据(即文本、语音、视觉和代码混合数据)通过多模态情感分析集成在互联网上。多模态指的是一种以上的模态，如使用任意两种模态的双峰态或使用所有三种模态的三模态。每种情态都有自己独特的特点，可以共同用来挖掘他们对实体的积极或消极的情绪、意见或反应。多模态情感分析的最新发展是将不同的模态，即音频、视觉和文本相融合，以产生更好的准确性。此外，还可以生成独立于语言和文化的模型和独立于说话者的模型。在这项调查中，我们定义了各种融合技术的情感分析使用多模态，特征，特征的多模态情感分析。本文概述了多模态情感分析的最新方法，以及传统方法和各种深度学习方法在多模态情感分析领域的应用领域。它还描述了使用多模态数据进行情感分析的新兴研究领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Pune Section International Conference (PuneCon)

自引率

0.00%

发文量