FoRKER：具有知识编辑和自我反思功能的专注推理者

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-09-19 DOI:10.1016/j.eswa.2025.129771

Chunbai Zhang , Haoyang Li , Chao Wang , Yang Zhou , Yan Peng

{"title":"FoRKER：具有知识编辑和自我反思功能的专注推理者","authors":"Chunbai Zhang , Haoyang Li , Chao Wang , Yang Zhou , Yan Peng","doi":"10.1016/j.eswa.2025.129771","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a retrieve-read paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) unstable document retrieval performance, 2) weak knowledge refinement capabilities, and 3) the absence of a reflection mechanism for error awareness. To address these limitations, we propose FoRKER (Focused Reasoner with Knowledge Editing and Self-Reflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of FoRKER. Notably, the integration of Self-Reflection technology further endows FoRKER with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that FoRKER achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, FoRKER demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"298 ","pages":"Article 129771"},"PeriodicalIF":7.5000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FoRKER: Focused reasoner with knowledge editing and self-reflection\",\"authors\":\"Chunbai Zhang , Haoyang Li , Chao Wang , Yang Zhou , Yan Peng\",\"doi\":\"10.1016/j.eswa.2025.129771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a retrieve-read paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) unstable document retrieval performance, 2) weak knowledge refinement capabilities, and 3) the absence of a reflection mechanism for error awareness. To address these limitations, we propose FoRKER (Focused Reasoner with Knowledge Editing and Self-Reflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of FoRKER. Notably, the integration of Self-Reflection technology further endows FoRKER with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that FoRKER achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, FoRKER demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"298 \",\"pages\":\"Article 129771\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095741742503386X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095741742503386X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

多跳问答（Multi-hop question answer， MHQA）是一种复杂的问答（QA）基准，它要求智能体集成来自不同来源的信息，并利用交叉引用推理来回答复杂的问题。现有的mhqa处理框架通常采用检索-读取范式。然而，这些基于检索-读取范式的努力仍然受到以下方面的限制：1)不稳定的文档检索性能，2)较弱的知识细化能力，以及3)缺乏错误感知的反映机制。为了解决这些限制，我们提出了FoRKER（具有知识编辑和自我反思的集中推理器），这是一个即插即用框架。具体来说，我们开发了一种新的渐进式聚焦机制来精确定位高度相关的文档资源，并引入知识编辑技术来进一步消除文本信息中的噪声干扰。此外，我们设计了一种新的提示方法，称为证据链（CoE），旨在增强FoRKER的推理能力。值得注意的是，Self-Reflection技术的集成进一步赋予了FoRKER从错误中学习和改进的能力。在广泛使用的数据集上进行的大量实验表明，FoRKER在信息检索和阅读理解方面取得了最新的成果，同时也表现出了有效的泛化。令人振奋的是，在MusiqueQA数据集上，与先进的竞争对手相比，FoRKER的回答分数提高了20%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FoRKER: Focused reasoner with knowledge editing and self-reflection

Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a retrieve-read paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) unstable document retrieval performance, 2) weak knowledge refinement capabilities, and 3) the absence of a reflection mechanism for error awareness. To address these limitations, we propose FoRKER (Focused Reasoner with Knowledge Editing and Self-Reflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of FoRKER. Notably, the integration of Self-Reflection technology further endows FoRKER with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that FoRKER achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, FoRKER demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.