Information Processing & Management最新文献

筛选
英文 中文
DocTER: Evaluating document-based knowledge editing 博士:评估基于文档的知识编辑
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-23 DOI: 10.1016/j.ipm.2025.104299
Suhang Wu , Ante Wang , Minlong Peng , Yujie Lin , Wenbo Li , Mingming Sun , Jinsong Su
{"title":"DocTER: Evaluating document-based knowledge editing","authors":"Suhang Wu ,&nbsp;Ante Wang ,&nbsp;Minlong Peng ,&nbsp;Yujie Lin ,&nbsp;Wenbo Li ,&nbsp;Mingming Sun ,&nbsp;Jinsong Su","doi":"10.1016/j.ipm.2025.104299","DOIUrl":"10.1016/j.ipm.2025.104299","url":null,"abstract":"<div><div>Knowledge editing aims to correct outdated or inaccurate knowledge in neural networks. In this paper, we explore knowledge editing using easily accessible documents instead of manually labeled factual triples employed in earlier research. To advance this field, we establish the first evaluation benchmark, <em>DocTER</em>, featuring <u>Doc</u>uments containing coun<u>TER</u>factual knowledge for editing. A comprehensive four-perspective evaluation is introduced: <em>Edit Success</em>, <em>Locality</em>, <em>Reasoning</em>, and <em>Cross-lingual Transfer</em>, comprising 2,000, 2,000, 583, 1,000 test cases, respectively. To adapt conventional triplet-based knowledge editing methods for this task, we develop an <em>Extract-then-Edit</em> pipeline that extracts triples from documents before applying existing methods. Experiments on popular knowledge editing methods demonstrate that editing with documents presents significantly greater challenges than using triples. In document-based scenarios, even the best-performing in-context editing approach still lags behind by 10 points in editing success when compared to using gold triples. This observation also holds for both reasoning and cross-lingual test sets. We further analyze key factors influencing task performance, including the quality of extracted triples, the frequency and position of edited knowledge in documents, various methods for enhancing reasoning, and performance differences across various directions in cross-lingual knowledge editing, which provide valuable insights for future research.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104299"},"PeriodicalIF":7.4,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144686730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMR-agents: Synergistic medical reasoning agents for zero-shot medical visual question answering with MLLMs SMR-agents:协同医学推理代理,用于零射击医学视觉问题回答与mlm
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-22 DOI: 10.1016/j.ipm.2025.104297
Dujuan Wang , Tao Cheng , Sutong Wang , Youhua (Frank) Chen , Yunqiang Yin
{"title":"SMR-agents: Synergistic medical reasoning agents for zero-shot medical visual question answering with MLLMs","authors":"Dujuan Wang ,&nbsp;Tao Cheng ,&nbsp;Sutong Wang ,&nbsp;Youhua (Frank) Chen ,&nbsp;Yunqiang Yin","doi":"10.1016/j.ipm.2025.104297","DOIUrl":"10.1016/j.ipm.2025.104297","url":null,"abstract":"<div><div>Existing medical visual question answering (Med-VQA) systems often lack transparent reasoning and robustness, limiting their clinical reliability. This study proposes the Synergistic Medical Reasoning Agents (SMR-Agents) framework to address these limitations by simulating collaborative consultation among multidisciplinary medical expert agents, thereby enhancing interpretability and diagnostic reliability. SMR-Agents first constructs a structured medical scene graph from the input image and question to identify and highlight relevant visual features. A pre-trained large language model then acts as a general practitioner, automatically selecting and coordinating a team of specialized medical expert agents based on this scene graph. The recruited experts engage in iterative reasoning: domain-specific diagnostic agents generate initial answer hypotheses, and a group of consulting experts conducts a peer-review discussion to refine the answer and formulate its explanatory rationale. The entire process operates in a zero-shot manner without task-specific training of the models. Evaluation on three public Med-VQA datasets and a private colorectal image dataset demonstrates that SMR-Agents achieves state-of-the-art performance across all benchmarks. Notably, it yields significant improvements in accuracy for open-ended questions and produces more interpretable reasoning compared to existing methods. These results demonstrate that combining structured scene understanding with iterative multi-expert collaboration substantially enhances both the accuracy and transparency of Med-VQA systems. The SMR-Agents framework thus provides a robust, interpretable approach to AI-assisted medical diagnosis, aligning machine reasoning with the expert-driven consultation processes used in clinical practice.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104297"},"PeriodicalIF":7.4,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144679302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A three-way efficacy prediction method fusing temporal composite rough set and hybrid machine learning models on multigranulation temporal hybrid attribute information system 一种融合时间复合粗糙集和混合机器学习模型的多粒时间混合属性信息系统三方效能预测方法
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-22 DOI: 10.1016/j.ipm.2025.104300
Xixuan Zhao , Bingzhen Sun , Jin Ye , Jiqian Liu , Xinfang Zhang , Haoran Sun , Xiaoli Chu
{"title":"A three-way efficacy prediction method fusing temporal composite rough set and hybrid machine learning models on multigranulation temporal hybrid attribute information system","authors":"Xixuan Zhao ,&nbsp;Bingzhen Sun ,&nbsp;Jin Ye ,&nbsp;Jiqian Liu ,&nbsp;Xinfang Zhang ,&nbsp;Haoran Sun ,&nbsp;Xiaoli Chu","doi":"10.1016/j.ipm.2025.104300","DOIUrl":"10.1016/j.ipm.2025.104300","url":null,"abstract":"<div><div>Efficacy prediction is a key research topic in clinical management practice. To address the general efficacy prediction problem characterized by multigranularity, temporality, and incompleteness, this study proposes a three-way efficacy prediction method that integrates a temporal composite rough set and a hybrid machine learning model (TCRS-HML). First, a multigranulation temporal hybrid attribute information system (MTHAIS) is constructed to handle hybrid attributes exhibiting these characteristics, and the data in MTHAIS is preprocessed using random forest and bag-of-words models. Next, the concept of temporal order is introduced into classical composite rough sets, and temporal equivalence, temporal neighborhood, and temporal similarity relationships are established based on the temporal hybrid attribute matrices of the objects. Subsequently, the definitions of a temporal composite rough set and its attribute reduction method are presented, along with a discussion of their mathematical properties. Finally, efficacy prediction results are obtained by building a hybrid machine learning model pool and selecting the optimal model. Experimental results, based on 493 real temporal medical records from 120 rheumatoid arthritis (RA) patients at the Guangdong Hospital of Traditional Chinese Medicine (2018–2023), show that the accuracy, precision, recall, and F<span><math><msub><mrow></mrow><mrow><mn>1</mn></mrow></msub></math></span> score of the proposed method are 0.8012, 0.8241, 0.8012, and 0.7885, respectively. These results outperform those of 17 comparative methods, demonstrating the scientific validity and feasibility of proposed approach. Furthermore, sensitivity analyses and statistical tests confirm the robustness and generalizability of the method. This study provides a new methodological reference for management science problems such as clinical efficacy prediction and contributes to the integration of rough set theory and machine learning in management and decision sciences.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104300"},"PeriodicalIF":7.4,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144679303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models for scholarly ontology generation: An extensive analysis in the engineering field 学术本体生成的大型语言模型:工程领域的广泛分析
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-21 DOI: 10.1016/j.ipm.2025.104262
Tanay Aggarwal , Angelo Salatino , Francesco Osborne , Enrico Motta
{"title":"Large language models for scholarly ontology generation: An extensive analysis in the engineering field","authors":"Tanay Aggarwal ,&nbsp;Angelo Salatino ,&nbsp;Francesco Osborne ,&nbsp;Enrico Motta","doi":"10.1016/j.ipm.2025.104262","DOIUrl":"10.1016/j.ipm.2025.104262","url":null,"abstract":"<div><div>Ontologies of research topics are crucial for structuring scientific knowledge, enabling scientists to navigate vast amounts of research, and forming the backbone of intelligent systems such as search engines and recommendation systems. However, manual creation of these ontologies is expensive, slow, and often results in outdated and overly general representations. As a solution, researchers have been investigating ways to automate or semi-automate the process of generating these ontologies. One of the key challenges in this domain is accurately assessing the semantic relationships between pairs of research topics. This paper presents an analysis of the capabilities of large language models (LLMs) in identifying such relationships, with a specific focus on the field of engineering. To this end, we introduce a novel benchmark based on the IEEE Thesaurus for evaluating the task of identifying three types of semantic relations between pairs of topics: <em>broader</em>, <em>narrower</em>, and <em>same-as</em>. Our study evaluates the performance of seventeen LLMs, which differ in scale, accessibility (open vs. proprietary), and model type (full vs. quantised), while also assessing four zero-shot reasoning strategies. Several models with varying architectures and sizes have achieved excellent results on this task, including Mixtral-8<span><math><mo>×</mo></math></span> 7B, Dolphin-Mistral-7B, and Claude 3 Sonnet, with F1-scores of 0.847, 0.920, and 0.967, respectively. Furthermore, our findings demonstrate that smaller, quantised models, when optimised through prompt engineering, can achieve strong performance while requiring very limited computational resources.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104262"},"PeriodicalIF":7.4,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144670344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing knowledge graph interactions: A comprehensive Text-to-Cypher pipeline with large language models 增强知识图交互:具有大型语言模型的全面文本到密码管道
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-21 DOI: 10.1016/j.ipm.2025.104280
Chao Yang , Changyi Li , Xiaodu Hu , Hao Yu , Jinzhi Lu
{"title":"Enhancing knowledge graph interactions: A comprehensive Text-to-Cypher pipeline with large language models","authors":"Chao Yang ,&nbsp;Changyi Li ,&nbsp;Xiaodu Hu ,&nbsp;Hao Yu ,&nbsp;Jinzhi Lu","doi":"10.1016/j.ipm.2025.104280","DOIUrl":"10.1016/j.ipm.2025.104280","url":null,"abstract":"<div><div>Knowledge Graphs (KGs) store structured information but typically require specialized query languages, such as Cypher for Neo4j, creating accessibility challenges for users unfamiliar with graph syntax. Large Language Models (LLMs) offer a solution by translating natural language into Cypher queries. However, existing models—including large-scale LLMs (e.g., ChatGPT) and smaller open-source models (e.g., Llama-7B, 8B) often struggle with accurately generating domain-specific queries due to inadequate alignment with KG schemas and limited domain-specific training data. To address these limitations, we propose a training pipeline tailored specifically for domain-aligned Cypher query generation, emphasizing usability for smaller-scale models. Our method integrates template-based synthetic data generation for diverse, high-quality training samples. We combine supervised fine-tuning with preference learning to enhance domain knowledge and Cypher syntax understanding. Additionally, our approach includes a context-aware retrieval mechanism that dynamically incorporates relevant schema elements at inference, improving alignment with domain-specific knowledge. We evaluated our method on the Hetionet biomedical KG using a benchmark dataset of 240 queries across three complexity levels. Our results show that our context-aware prompting achieves a substantial improvement, increasing component matching accuracy by 23.6% for ChatGPT-4o over the vanilla prompt baseline. When applying our full training pipeline to smaller-scale models, CodeLlama-13B* achieves an execution accuracy of 69.2%, nearly matching ChatGPT-4o’s 72.1%. Importantly, our approach significantly narrows the performance gap, enabling smaller models to effectively manage complex, domain-specific tasks previously dominated by larger models. These findings demonstrate that our method is scalable, computationally efficient, and robust for practical Cypher query generation applications.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104280"},"PeriodicalIF":7.4,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144670343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Restaurant recommendations under multimodal online reviews: A novel method based on image captioning and text analysis with multi-criteria decision-making 多模式在线评论下的餐厅推荐:一种基于多标准决策的图像字幕和文本分析的新方法
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-19 DOI: 10.1016/j.ipm.2025.104308
Ziyu Chen , Naijie Chai , Jianqiang Wang , Xiaokang Wang
{"title":"Restaurant recommendations under multimodal online reviews: A novel method based on image captioning and text analysis with multi-criteria decision-making","authors":"Ziyu Chen ,&nbsp;Naijie Chai ,&nbsp;Jianqiang Wang ,&nbsp;Xiaokang Wang","doi":"10.1016/j.ipm.2025.104308","DOIUrl":"10.1016/j.ipm.2025.104308","url":null,"abstract":"<div><div>Restaurant selection has become a complex decision-making process for consumers, driven by an overwhelming volume of online reviews. While text and numerical reviews provide valuable insights, the increasing use of visual content, further enriches consumer evaluations. However, existing research lacks effective methods for integrating multimodal reviews to facilitate informed decision-making. To address this gap, this paper proposes a novel approach for restaurant selection based on multimodal online reviews, the contributions of which mainly focus on the following aspects: (i) employ image captioning techniques to convert image review into textual descriptions, bridging the gap between image and text, (ii) apply text analysis methods to extract relevant evaluation criteria from both text and image-generated descriptions, and (iii) integrate insights from both modalities by assessing the object and content consistency between image and text, ensuring the reliability of reviews. The method is applied to Yelp, using a dataset of 31,412 reviews from 10 restaurants. Eight evaluation criteria are extracted from both text and image reviews. The results show that compared with single-modal and dual-modal review-based recommendation methods, the proposed multimodal approach uncovers more comprehensive evaluation criteria and generates more realistic ranking results. Additionally, the proposed information fusion method outperforms traditional fusion methods in effectively integrating multimodal information.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104308"},"PeriodicalIF":7.4,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144662444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PVSTrans: Patch-view-shape progressive interaction transformer for 3D shape recognition PVSTrans:用于3D形状识别的斑块视图形状渐进交互变压器
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-19 DOI: 10.1016/j.ipm.2025.104279
Xiangyu Ma, Jing Bai, Zenghui Su, Yubin Wang
{"title":"PVSTrans: Patch-view-shape progressive interaction transformer for 3D shape recognition","authors":"Xiangyu Ma,&nbsp;Jing Bai,&nbsp;Zenghui Su,&nbsp;Yubin Wang","doi":"10.1016/j.ipm.2025.104279","DOIUrl":"10.1016/j.ipm.2025.104279","url":null,"abstract":"<div><div>3D shape recognition has made substantial progress due to its wide-ranging applications and increasing research interest. Existing studies have investigated the paradigm of aggregating 3D shape descriptors derived from independently extracted view features. However, this stepwise approach has not fully capitalized on the intrinsic correlations between local regions of varying granularity and global shapes. To address this gap, we propose the Patch-View-Shape Progressive Interaction Transformer (PVSTrans), which enhances shape-patch interactions through progressive view-patch and shape-view interactions, effectively capturing essential dependencies among intra-view features, inter-view features, and global 3D shape features. Furthermore, by utilizing the byproducts of the progressive interaction process, specifically the attention weights of views and intra-view patches, we introduce a Shape-Guided Patch Selection strategy to dynamically identify significant patches in each view, which in conjunction with the multi-view features, forms a more informative 3D shape descriptors for final classification. Experimental results across diverse datasets, including ModelNet40, ScanObjectNN, FG3D, and ShapeNet Core55, demonstrate the effectiveness and generalizability of PVSTrans in 3D shape recognition tasks. Additionally, comprehensive experiments involving various views with differing quantities and spatial relations highlight the robustness of PVSTrans in handling incomplete views and irregular spatial configurations, showcasing its substantial potential for application in complex real-world scenarios. The code is available on <span><span>https://github.com/Oli-lab-nun/PVSTrans</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104279"},"PeriodicalIF":7.4,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144662443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network dismantling with community-based edge percolation 基于社区边缘渗透的网络拆除
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-18 DOI: 10.1016/j.ipm.2025.104295
Min Wu , Bitao Dai , Wu Shi , Jianhong Mou , Suoyi Tan , Stefano Boccaletti , Xin Lu
{"title":"Network dismantling with community-based edge percolation","authors":"Min Wu ,&nbsp;Bitao Dai ,&nbsp;Wu Shi ,&nbsp;Jianhong Mou ,&nbsp;Suoyi Tan ,&nbsp;Stefano Boccaletti ,&nbsp;Xin Lu","doi":"10.1016/j.ipm.2025.104295","DOIUrl":"10.1016/j.ipm.2025.104295","url":null,"abstract":"<div><div>Traditional node-based dismantling strategies, which remove nodes along with their associated edges, tend to incur high costs. In contrast, edge-based strategies are more cost-effective but often suffer from low efficiency due to the large number of edges in most networks. To address these challenges, we propose a divide-and-conquer framework that reinterprets network-level dismantling as cluster-level dismantling. Specifically, we integrate community detection with explosive percolation to develop the Community-based Edge Percolation (CEP) algorithm, which targets critical edges whose removal effectively breaks the network into subcritical components, thereby optimizing dismantling efficiency while minimizing costs. Experiments on 38 synthetic networks derived from four different models, as well as on nine empirical networks, show that CEP consistently outperforms state-of-the-art (SOTA) algorithms across nearly all datasets, yielding improvements of up to 30.611 % in <span><math><msub><mi>f</mi><mi>c</mi></msub></math></span> and 67.108 % in Schneider <em>R</em>. Further analysis indicates that the sets of removed edges identified by CEP have a low correlation with those identified by other benchmarks, underlining its novelty and superior capability in identifying critical edges. Overall, we propose a universal and efficient edge dismantling framework that exhibits substantial advantages in large-scale empirical networks, offering valuable insights into network robustness.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104295"},"PeriodicalIF":7.4,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144662442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated rhetorical move and step recognition in fact-checking articles with neural models 用神经模型自动识别事实核查文章中的修辞动作和步骤
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-17 DOI: 10.1016/j.ipm.2025.104293
Xinxue Liu , Ningyuan Song , Kejun Chen , Ye Chen , Lei Pei
{"title":"Automated rhetorical move and step recognition in fact-checking articles with neural models","authors":"Xinxue Liu ,&nbsp;Ningyuan Song ,&nbsp;Kejun Chen ,&nbsp;Ye Chen ,&nbsp;Lei Pei","doi":"10.1016/j.ipm.2025.104293","DOIUrl":"10.1016/j.ipm.2025.104293","url":null,"abstract":"<div><div>As online misinformation has drawn social concerns, extensive efforts have been dedicated to fact-checking, which can help contain the spread of misinformation. Among them, persuasive fact-checking articles play a fundamental role, but little work has focused on their discourse structures that are important for understanding how they work. Rhetorical moves and steps, common in genre analysis, can be used to figure out text structures and communicative goals. Based on existing literature, this research first summarizes a rhetorical structure comprising five moves and six steps for fact-checking articles, which describes how they are organized to achieve the persuasive purpose. We then produce a corpus including 420 articles with annotations of our structures. For automated recognition, we propose our BiLSTM with Hierarchical Attention model, which achieves micro-F1 scores of 70 % and 61.5 % for moves and steps, respectively. The performance and subsequent ablation study demonstrate the effectiveness of our model. Utilizing it, an analysis of the distribution and patterns of moves and steps is conducted on an expanded set of 3800 fact-checking articles. Accordingly, we find that the distributions of rhetorical structures in articles have common characteristics and unique differences, reflecting the strategies used when writing. We further conducted sequence mining, and the obtained frequent sequences can help improve fact-checking writing and provide new ideas for studying the relationship between fact-checking texts and their persuasive effects. Generally, the fact-checking rhetorical structures and the automated model proposed in this work have the potential to help leverage the fact-checking corpus and finally contribute to rebutting misinformation.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104293"},"PeriodicalIF":7.4,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144655654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-Enhanced Multi-Task Joint Learning Model for Misinformation Detection 基于llm的错误信息检测多任务联合学习模型
IF 7.4 1区 管理学
Information Processing & Management Pub Date : 2025-07-17 DOI: 10.1016/j.ipm.2025.104305
Gang Ren , Li Jiang , Tingting Huang , Ying Yang , Ruida Xie
{"title":"LLM-Enhanced Multi-Task Joint Learning Model for Misinformation Detection","authors":"Gang Ren ,&nbsp;Li Jiang ,&nbsp;Tingting Huang ,&nbsp;Ying Yang ,&nbsp;Ruida Xie","doi":"10.1016/j.ipm.2025.104305","DOIUrl":"10.1016/j.ipm.2025.104305","url":null,"abstract":"<div><div>The coexistence of Human-Generated Content (HGC) and Artificial Intelligence-Generated Content (AIGC) versions of the same event on social media presents significant challenges for governmental governance and information regulation. In this study, we propose a Large Language Model-enhanced Multi-Task Joint Learning Model for Misinformation Detection (LMTMD) to address the challenge of mixed HGC and AIGC on social media. We design a two-stage instruction, leveraging large language models (LLMs) for data augmentation to generate AIGC versions of events. Furthermore, a novel unsupervised joint learning strategy is proposed, which incorporates content consistency contrastive learning and difference consistency learning. The strategy aims to preserve both the consistency of event content and the heterogeneity between AIGC and HGC. Extensive experiments conducted on real-world datasets, including Weibo and GossipCop, demonstrate that the proposed model outperforms state-of-the-art baselines, achieving a Consistent Match Accuracy (CM-Acc) of 77.21% on the Weibo dataset and 78.13% on the GossipCop dataset. Additionally, the model achieves AIGC detection accuracy rates of 90.58% on the Weibo dataset and 90.95% on the GossipCop dataset, thereby validating the effectiveness of both the model and the joint learning strategy. Our model can effectively adapt to the emerging scenario of mixed HGC and AIGC versions of events on social platforms and enriches the research perspective of misinformation detection.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104305"},"PeriodicalIF":7.4,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144655653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信