基于llm的错误信息检测多任务联合学习模型

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management Pub Date : 2025-07-17 DOI:10.1016/j.ipm.2025.104305

Gang Ren , Li Jiang , Tingting Huang , Ying Yang , Ruida Xie

{"title":"基于llm的错误信息检测多任务联合学习模型","authors":"Gang Ren , Li Jiang , Tingting Huang , Ying Yang , Ruida Xie","doi":"10.1016/j.ipm.2025.104305","DOIUrl":null,"url":null,"abstract":"<div><div>The coexistence of Human-Generated Content (HGC) and Artificial Intelligence-Generated Content (AIGC) versions of the same event on social media presents significant challenges for governmental governance and information regulation. In this study, we propose a Large Language Model-enhanced Multi-Task Joint Learning Model for Misinformation Detection (LMTMD) to address the challenge of mixed HGC and AIGC on social media. We design a two-stage instruction, leveraging large language models (LLMs) for data augmentation to generate AIGC versions of events. Furthermore, a novel unsupervised joint learning strategy is proposed, which incorporates content consistency contrastive learning and difference consistency learning. The strategy aims to preserve both the consistency of event content and the heterogeneity between AIGC and HGC. Extensive experiments conducted on real-world datasets, including Weibo and GossipCop, demonstrate that the proposed model outperforms state-of-the-art baselines, achieving a Consistent Match Accuracy (CM-Acc) of 77.21% on the Weibo dataset and 78.13% on the GossipCop dataset. Additionally, the model achieves AIGC detection accuracy rates of 90.58% on the Weibo dataset and 90.95% on the GossipCop dataset, thereby validating the effectiveness of both the model and the joint learning strategy. Our model can effectively adapt to the emerging scenario of mixed HGC and AIGC versions of events on social platforms and enriches the research perspective of misinformation detection.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104305"},"PeriodicalIF":7.4000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LLM-Enhanced Multi-Task Joint Learning Model for Misinformation Detection\",\"authors\":\"Gang Ren , Li Jiang , Tingting Huang , Ying Yang , Ruida Xie\",\"doi\":\"10.1016/j.ipm.2025.104305\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The coexistence of Human-Generated Content (HGC) and Artificial Intelligence-Generated Content (AIGC) versions of the same event on social media presents significant challenges for governmental governance and information regulation. In this study, we propose a Large Language Model-enhanced Multi-Task Joint Learning Model for Misinformation Detection (LMTMD) to address the challenge of mixed HGC and AIGC on social media. We design a two-stage instruction, leveraging large language models (LLMs) for data augmentation to generate AIGC versions of events. Furthermore, a novel unsupervised joint learning strategy is proposed, which incorporates content consistency contrastive learning and difference consistency learning. The strategy aims to preserve both the consistency of event content and the heterogeneity between AIGC and HGC. Extensive experiments conducted on real-world datasets, including Weibo and GossipCop, demonstrate that the proposed model outperforms state-of-the-art baselines, achieving a Consistent Match Accuracy (CM-Acc) of 77.21% on the Weibo dataset and 78.13% on the GossipCop dataset. Additionally, the model achieves AIGC detection accuracy rates of 90.58% on the Weibo dataset and 90.95% on the GossipCop dataset, thereby validating the effectiveness of both the model and the joint learning strategy. Our model can effectively adapt to the emerging scenario of mixed HGC and AIGC versions of events on social platforms and enriches the research perspective of misinformation detection.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 6\",\"pages\":\"Article 104305\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325002468\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002468","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

社交媒体上同一事件的人工生成内容（HGC）和人工智能生成内容（AIGC）版本共存，给政府治理和信息监管带来了重大挑战。在这项研究中，我们提出了一个大语言模型增强的多任务联合学习模型用于错误信息检测（LMTMD），以解决社交媒体上混合HGC和AIGC的挑战。我们设计了一个两阶段指令，利用大型语言模型（llm）进行数据增强，以生成事件的AIGC版本。在此基础上，提出了一种融合内容一致性、对比学习和差异一致性学习的无监督联合学习策略。该策略旨在既保持事件内容的一致性，又保持AIGC和HGC之间的异质性。在包括微博和八卦cop在内的真实数据集上进行的大量实验表明，所提出的模型优于最先进的基线，在微博数据集上实现了77.21%的一致匹配精度（CM-Acc），在八卦cop数据集上实现了78.13%。此外，该模型在微博数据集和GossipCop数据集上的AIGC检测准确率分别达到90.58%和90.95%，从而验证了该模型和联合学习策略的有效性。我们的模型可以有效适应社交平台上事件HGC和AIGC混合版本的新兴场景，丰富了错误信息检测的研究视角。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LLM-Enhanced Multi-Task Joint Learning Model for Misinformation Detection

The coexistence of Human-Generated Content (HGC) and Artificial Intelligence-Generated Content (AIGC) versions of the same event on social media presents significant challenges for governmental governance and information regulation. In this study, we propose a Large Language Model-enhanced Multi-Task Joint Learning Model for Misinformation Detection (LMTMD) to address the challenge of mixed HGC and AIGC on social media. We design a two-stage instruction, leveraging large language models (LLMs) for data augmentation to generate AIGC versions of events. Furthermore, a novel unsupervised joint learning strategy is proposed, which incorporates content consistency contrastive learning and difference consistency learning. The strategy aims to preserve both the consistency of event content and the heterogeneity between AIGC and HGC. Extensive experiments conducted on real-world datasets, including Weibo and GossipCop, demonstrate that the proposed model outperforms state-of-the-art baselines, achieving a Consistent Match Accuracy (CM-Acc) of 77.21% on the Weibo dataset and 78.13% on the GossipCop dataset. Additionally, the model achieves AIGC detection accuracy rates of 90.58% on the Weibo dataset and 90.95% on the GossipCop dataset, thereby validating the effectiveness of both the model and the joint learning strategy. Our model can effectively adapt to the emerging scenario of mixed HGC and AIGC versions of events on social platforms and enriches the research perspective of misinformation detection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.