Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai
{"title":"基于混合关注的多张量融合网络多模态情感分析","authors":"Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai","doi":"10.1109/ICMLC51923.2020.9469572","DOIUrl":null,"url":null,"abstract":"Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Multi-Tensor Fusion Network with Hybrid Attention for Multimodal Sentiment Analysis\",\"authors\":\"Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai\",\"doi\":\"10.1109/ICMLC51923.2020.9469572\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.\",\"PeriodicalId\":170815,\"journal\":{\"name\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Learning and Cybernetics (ICMLC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC51923.2020.9469572\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC51923.2020.9469572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Tensor Fusion Network with Hybrid Attention for Multimodal Sentiment Analysis
Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.