{"title":"AV-ITN: A Method of Multimodal Video Emotional Content Analysis","authors":"L. Fu, Qiang Zhang, Rui Wang","doi":"10.1109/TOCS56154.2022.10016083","DOIUrl":null,"url":null,"abstract":"With the rapid development of Internet technology, social media has also developed rapidly with the support of the Internet. Social media data has grown exponentially. A large amount of data needs to be reviewed urgently. The method of machine review requires video emotional content analysis technology (Affective Video Content Analysis Technology). Analysis, AVCA) support. A large number of studies have shown that the use of deep learning methods to achieve emotional content analysis is currently the most effective method, and its effect is better than traditional algorithms and manual methods. Based on this, this paper proposes a multi-modal video sentiment analysis algorithm AVITN that utilizes both audio and video modal information to promote sentiment analysis in videos. AV-ITN achieves a high accuracy rate of 83.66% on the IEMOCAP benchmark. The multimodal video sentiment analysis algorithm proposed in this paper can obtain more emotional information contained in the video and effectively improve the accuracy of video sentiment analysis.","PeriodicalId":227449,"journal":{"name":"2022 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TOCS56154.2022.10016083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the rapid development of Internet technology, social media has also developed rapidly with the support of the Internet. Social media data has grown exponentially. A large amount of data needs to be reviewed urgently. The method of machine review requires video emotional content analysis technology (Affective Video Content Analysis Technology). Analysis, AVCA) support. A large number of studies have shown that the use of deep learning methods to achieve emotional content analysis is currently the most effective method, and its effect is better than traditional algorithms and manual methods. Based on this, this paper proposes a multi-modal video sentiment analysis algorithm AVITN that utilizes both audio and video modal information to promote sentiment analysis in videos. AV-ITN achieves a high accuracy rate of 83.66% on the IEMOCAP benchmark. The multimodal video sentiment analysis algorithm proposed in this paper can obtain more emotional information contained in the video and effectively improve the accuracy of video sentiment analysis.
随着互联网技术的飞速发展,社交媒体也在互联网的支持下迅速发展起来。社交媒体数据呈指数级增长。大量的数据需要紧急审查。机器审查的方法需要视频情感内容分析技术(Affective video content analysis technology)。分析,AVCA)支持。大量研究表明,利用深度学习方法实现情感内容分析是目前最有效的方法,其效果优于传统算法和人工方法。在此基础上,本文提出了一种多模态视频情感分析算法AVITN,该算法利用音频和视频模态信息促进视频情感分析。AV-ITN在IEMOCAP基准上的准确率高达83.66%。本文提出的多模态视频情感分析算法可以获取视频中包含的更多情感信息,有效提高视频情感分析的准确性。