Sentiment-aware cross-modal semantic interaction model for harmful meme detection

IF 6.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yuxiao Duan, Xiang Zhao, Hao Guo
{"title":"Sentiment-aware cross-modal semantic interaction model for harmful meme detection","authors":"Yuxiao Duan,&nbsp;Xiang Zhao,&nbsp;Hao Guo","doi":"10.1016/j.dss.2025.114509","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.</div></div>","PeriodicalId":55181,"journal":{"name":"Decision Support Systems","volume":"196 ","pages":"Article 114509"},"PeriodicalIF":6.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Support Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167923625001101","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The increasing proliferation of harmful memes has a serious negative impact on society, rendering the detection of such memes a formidable challenge. Prior research has predominantly concentrated on the modal and semantic attributes of memes while neglecting the significance of cross-modal interactions and detailed semantic information. Although some approaches have incorporated large language models, they often have the problem of harmful avoidance due to ethical constraints. To address these issues, we propose a novel sentiment-aware cross-modal semantic interaction detector, which delves into the profound implications through three principal dimensions: semantic extraction, modal interaction, and sentiment polarity assessment. In the semantic extraction module, Visual Question-Answering is utilized to incorporate detailed knowledge and descriptions. For modal interaction, the positional relationships between meme objects and texts are investigated, and a distance-based attentional multimodal detector is established. In the sentiment polarity module, the sentiment polarity of the text is judged. These components are integrated to form a cohesive joint detection system. Extensive experiments across three benchmark datasets demonstrate SSID significantly outperforms state-of-the-art baselines, enhancing detection accuracy and exhibiting robustness.
有害模因检测的情感感知跨模态语义交互模型
有害模因的日益泛滥对社会产生了严重的负面影响,对这些模因的检测是一项艰巨的挑战。以往的研究主要集中在模因的模态和语义属性上,而忽视了模因跨模态交互作用和详细语义信息的重要性。尽管一些方法结合了大型语言模型,但由于伦理约束,它们往往存在有害回避的问题。为了解决这些问题,我们提出了一种新的情感感知跨模态语义交互检测器,该检测器通过三个主要维度:语义提取、模态交互和情感极性评估来深入研究其深远影响。在语义提取模块中,采用可视化问答的方式,将详细的知识和描述融合在一起。对于模态交互,研究模因对象与文本之间的位置关系,建立基于距离的注意多模态检测器。在情感极性模块中,判断文本的情感极性。这些组件被整合成一个有凝聚力的联合检测系统。在三个基准数据集上进行的广泛实验表明,SSID显著优于最先进的基线,提高了检测精度并表现出鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Decision Support Systems
Decision Support Systems 工程技术-计算机:人工智能
CiteScore
14.70
自引率
6.70%
发文量
119
审稿时长
13 months
期刊介绍: The common thread of articles published in Decision Support Systems is their relevance to theoretical and technical issues in the support of enhanced decision making. The areas addressed may include foundations, functionality, interfaces, implementation, impacts, and evaluation of decision support systems (DSSs).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信