{"title":"有害模因检测的情感感知跨模态语义交互模型","authors":"Yuxiao Duan, Xiang Zhao, Hao Guo","doi":"10.1016/j.dss.2025.114509","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.</div></div>","PeriodicalId":55181,"journal":{"name":"Decision Support Systems","volume":"196 ","pages":"Article 114509"},"PeriodicalIF":6.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment-aware cross-modal semantic interaction model for harmful meme detection\",\"authors\":\"Yuxiao Duan, Xiang Zhao, Hao Guo\",\"doi\":\"10.1016/j.dss.2025.114509\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.</div></div>\",\"PeriodicalId\":55181,\"journal\":{\"name\":\"Decision Support Systems\",\"volume\":\"196 \",\"pages\":\"Article 114509\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Decision Support Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167923625001101\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Support Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167923625001101","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Sentiment-aware cross-modal semantic interaction model for harmful meme detection
The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.
期刊介绍:
The common thread of articles published in Decision Support Systems is their relevance to theoretical and technical issues in the support of enhanced decision making. The areas addressed may include foundations, functionality, interfaces, implementation, impacts, and evaluation of decision support systems (DSSs).