Xingang Wang , Mengyi Wang , Hai Cui , Yijia Zhang
{"title":"Manifold knowledge-guided feature fusion network for multimodal sentiment analysis","authors":"Xingang Wang , Mengyi Wang , Hai Cui , Yijia Zhang","doi":"10.1016/j.eswa.2025.127537","DOIUrl":null,"url":null,"abstract":"<div><div>With the continuous progress of multimedia and information technology, multimodal sentiment analysis (MSA) has become one of the most advanced and challenging research directions in the field of artificial intelligence. Multimodal data, including text, visual and audio information, provides additional perspectives for sentiment analysis. However, extraneous information in non-verbal modalities affects the accuracy of sentiment analysis, as sentiment-related features are mainly concentrated in changes in mouth movements and pitch changes, which poses a challenge for accurate sentiment analysis. To solve this problem, we propose a manifold knowledge-guided feature fusion network (MKGN). MKGN uses manifold knowledge generated by manifold learning algorithms to guide neural networks to extract effective non-verbal features and establish associations between multiple features while reducing dimensionality. In addition, in order to improve the quality of knowledge, we propose two knowledge enhancement methods: knowledge filter (KF) and knowledge contrastive learning (CL). Among them, KF is used to filter out unreliable knowledge, and CL further strengthens retained knowledge by changing the distance between knowledge. Importantly, the proposed MKGN achieves excellent performance on three datasets compared to state-of-the-art models. On the MOSI dataset, the accuracy is improved by 2% and 1%, respectively. On the MOSEI dataset, the accuracy improved by 3.8% and 1.8%, respectively. On the UR-FUNNY dataset, the accuracy improved by 0.4%.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"280 ","pages":"Article 127537"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011595","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
With the continuous progress of multimedia and information technology, multimodal sentiment analysis (MSA) has become one of the most advanced and challenging research directions in the field of artificial intelligence. Multimodal data, including text, visual and audio information, provides additional perspectives for sentiment analysis. However, extraneous information in non-verbal modalities affects the accuracy of sentiment analysis, as sentiment-related features are mainly concentrated in changes in mouth movements and pitch changes, which poses a challenge for accurate sentiment analysis. To solve this problem, we propose a manifold knowledge-guided feature fusion network (MKGN). MKGN uses manifold knowledge generated by manifold learning algorithms to guide neural networks to extract effective non-verbal features and establish associations between multiple features while reducing dimensionality. In addition, in order to improve the quality of knowledge, we propose two knowledge enhancement methods: knowledge filter (KF) and knowledge contrastive learning (CL). Among them, KF is used to filter out unreliable knowledge, and CL further strengthens retained knowledge by changing the distance between knowledge. Importantly, the proposed MKGN achieves excellent performance on three datasets compared to state-of-the-art models. On the MOSI dataset, the accuracy is improved by 2% and 1%, respectively. On the MOSEI dataset, the accuracy improved by 3.8% and 1.8%, respectively. On the UR-FUNNY dataset, the accuracy improved by 0.4%.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.