有害模因检测的情感感知跨模态语义交互模型

IF 6.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Decision Support Systems Pub Date : 2025-07-25 DOI:10.1016/j.dss.2025.114509

Yuxiao Duan, Xiang Zhao, Hao Guo

{"title":"有害模因检测的情感感知跨模态语义交互模型","authors":"Yuxiao Duan, Xiang Zhao, Hao Guo","doi":"10.1016/j.dss.2025.114509","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.</div></div>","PeriodicalId":55181,"journal":{"name":"Decision Support Systems","volume":"196 ","pages":"Article 114509"},"PeriodicalIF":6.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment-aware cross-modal semantic interaction model for harmful meme detection\",\"authors\":\"Yuxiao Duan, Xiang Zhao, Hao Guo\",\"doi\":\"10.1016/j.dss.2025.114509\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.</div></div>\",\"PeriodicalId\":55181,\"journal\":{\"name\":\"Decision Support Systems\",\"volume\":\"196 \",\"pages\":\"Article 114509\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Decision Support Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167923625001101\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Support Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167923625001101","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

有害模因的日益泛滥对社会产生了严重的负面影响，对这些模因的检测是一项艰巨的挑战。以往的研究主要集中在模因的模态和语义属性上，而忽视了模因跨模态交互作用和详细语义信息的重要性。尽管一些方法结合了大型语言模型，但由于伦理约束，它们往往存在有害回避的问题。为了解决这些问题，我们提出了一种新的情感感知跨模态语义交互检测器，该检测器通过三个主要维度：语义提取、模态交互和情感极性评估来深入研究其深远影响。在语义提取模块中，采用可视化问答的方式，将详细的知识和描述融合在一起。对于模态交互，研究模因对象与文本之间的位置关系，建立基于距离的注意多模态检测器。在情感极性模块中，判断文本的情感极性。这些组件被整合成一个有凝聚力的联合检测系统。在三个基准数据集上进行的广泛实验表明，SSID显著优于最先进的基线，提高了检测精度并表现出鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sentiment-aware cross-modal semantic interaction model for harmful meme detection

The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Decision Support Systems 工程技术-计算机：人工智能

CiteScore

14.70

自引率

6.70%

发文量

119

审稿时长

13 months

期刊介绍： The common thread of articles published in Decision Support Systems is their relevance to theoretical and technical issues in the support of enhanced decision making. The areas addressed may include foundations, functionality, interfaces, implementation, impacts, and evaluation of decision support systems (DSSs).