Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai
{"title":"Multi-Tensor Fusion Network with Hybrid Attention for Multimodal Sentiment Analysis","authors":"Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai","doi":"10.1109/ICMLC51923.2020.9469572","DOIUrl":null,"url":null,"abstract":"Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC51923.2020.9469572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.