FoRKER:具有知识编辑和自我反思功能的专注推理者

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Chunbai Zhang , Haoyang Li , Chao Wang , Yang Zhou , Yan Peng
{"title":"FoRKER:具有知识编辑和自我反思功能的专注推理者","authors":"Chunbai Zhang ,&nbsp;Haoyang Li ,&nbsp;Chao Wang ,&nbsp;Yang Zhou ,&nbsp;Yan Peng","doi":"10.1016/j.eswa.2025.129771","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a <em>retrieve-read</em> paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) <em>unstable document retrieval performance</em>, 2) <em>weak knowledge refinement capabilities</em>, and 3) <em>the absence of a reflection mechanism for error awareness</em>. To address these limitations, we propose <span>FoRKER</span> (<strong>Fo</strong>cused <strong>R</strong>easoner with <strong>K</strong>nowledge <strong>E</strong>diting and Self-<strong>R</strong>eflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of <span>FoRKER</span>. Notably, the integration of Self-Reflection technology further endows <span>FoRKER</span> with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that <span>FoRKER</span> achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, <span>FoRKER</span> demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"298 ","pages":"Article 129771"},"PeriodicalIF":7.5000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FoRKER: Focused reasoner with knowledge editing and self-reflection\",\"authors\":\"Chunbai Zhang ,&nbsp;Haoyang Li ,&nbsp;Chao Wang ,&nbsp;Yang Zhou ,&nbsp;Yan Peng\",\"doi\":\"10.1016/j.eswa.2025.129771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a <em>retrieve-read</em> paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) <em>unstable document retrieval performance</em>, 2) <em>weak knowledge refinement capabilities</em>, and 3) <em>the absence of a reflection mechanism for error awareness</em>. To address these limitations, we propose <span>FoRKER</span> (<strong>Fo</strong>cused <strong>R</strong>easoner with <strong>K</strong>nowledge <strong>E</strong>diting and Self-<strong>R</strong>eflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of <span>FoRKER</span>. Notably, the integration of Self-Reflection technology further endows <span>FoRKER</span> with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that <span>FoRKER</span> achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, <span>FoRKER</span> demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"298 \",\"pages\":\"Article 129771\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S095741742503386X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095741742503386X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

多跳问答(Multi-hop question answer, MHQA)是一种复杂的问答(QA)基准,它要求智能体集成来自不同来源的信息,并利用交叉引用推理来回答复杂的问题。现有的mhqa处理框架通常采用检索-读取范式。然而,这些基于检索-读取范式的努力仍然受到以下方面的限制:1)不稳定的文档检索性能,2)较弱的知识细化能力,以及3)缺乏错误感知的反映机制。为了解决这些限制,我们提出了FoRKER(具有知识编辑和自我反思的集中推理器),这是一个即插即用框架。具体来说,我们开发了一种新的渐进式聚焦机制来精确定位高度相关的文档资源,并引入知识编辑技术来进一步消除文本信息中的噪声干扰。此外,我们设计了一种新的提示方法,称为证据链(CoE),旨在增强FoRKER的推理能力。值得注意的是,Self-Reflection技术的集成进一步赋予了FoRKER从错误中学习和改进的能力。在广泛使用的数据集上进行的大量实验表明,FoRKER在信息检索和阅读理解方面取得了最新的成果,同时也表现出了有效的泛化。令人振奋的是,在MusiqueQA数据集上,与先进的竞争对手相比,FoRKER的回答分数提高了20%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
FoRKER: Focused reasoner with knowledge editing and self-reflection
Multi-hop question answering (MHQA) is a complex question answering (QA) benchmark that requires agents to integrate information from diverse sources and utilize cross-referencing reasoning to answer intricate questions. Existing MHQA-handling frameworks typically employ a retrieve-read paradigm. However, these efforts rooted in the retrieve-read paradigm are still constrained by: 1) unstable document retrieval performance, 2) weak knowledge refinement capabilities, and 3) the absence of a reflection mechanism for error awareness. To address these limitations, we propose FoRKER (Focused Reasoner with Knowledge Editing and Self-Reflection), which is a plug-and-play framework. Specifically, we develop a novel progressive focusing mechanism to pinpoint highly relevant document resources and introduce knowledge editing techniques to further eliminate noise interference within textual information. Additionally, we design a novel prompting method, named Chain-of-Evidence (CoE), which is designed to augment the reasoning capabilities of FoRKER. Notably, the integration of Self-Reflection technology further endows FoRKER with the ability to learn and improve from its mistakes. Extensive experiments on widely-used datasets demonstrate that FoRKER achieves new state-of-the-art results in information retrieval and reading comprehension, while also exhibiting effective generalization. Exhilaratingly, on the MusiqueQA dataset, FoRKER demonstrates a 20 % improvement in Answering scores compared to the advanced competitors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信