超越分解:多文档问答中的分层依赖管理

IF 2.8 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Xiaoyan Zheng, Zhi Li, Qianglong Chen, Yin Zhang
{"title":"超越分解:多文档问答中的分层依赖管理","authors":"Xiaoyan Zheng,&nbsp;Zhi Li,&nbsp;Qianglong Chen,&nbsp;Yin Zhang","doi":"10.1002/asi.24971","DOIUrl":null,"url":null,"abstract":"<p>When using retrieval-augmented generation (RAG) to handle multi-document question answering (MDQA) tasks, it is beneficial to decompose complex queries into multiple simpler ones to enhance retrieval results. However, previous strategies always employ a one-shot approach of question decomposition, overlooking subquestions dependency problem and failing to ensure that the derived subqueries are single-hop. To overcome this challenge, we introduce a novel framework called DSRC-QCS. Decompose-solve-renewal-cycle (DSRC) is an iterative multi-hop question processing module. The key idea of DSRC involves using a unique symbol to achieve hierarchical dependency management and employing a cyclical process of question decomposition, solving, and renewal to continuously generate and resolve all single-hop subquestions. Query-chain selector (QCS) functions as a voting mechanism that effectively utilizes the reasoning process of DSRC to assess and select solutions. We compare DSRC-QCS against five RAG approaches across three datasets and three LLMs. DSRC-QCS demonstrates superior performance. Compared to the Direct Retrieval method, DSRC-QCS improves the average F1 score by 17.36% with Alpaca-7b, 10.83% with LLaMa2-Chat-7b, and 11.88% with GPT-3.5-Turbo. We also conduct ablation studies to validate the performance of both DSRC and QCS and explore factors influencing the effectiveness of DSRC. We have included all prompts in the Appendix.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 5","pages":"770-789"},"PeriodicalIF":2.8000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Beyond decomposition: Hierarchical dependency management in multi-document question answering\",\"authors\":\"Xiaoyan Zheng,&nbsp;Zhi Li,&nbsp;Qianglong Chen,&nbsp;Yin Zhang\",\"doi\":\"10.1002/asi.24971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>When using retrieval-augmented generation (RAG) to handle multi-document question answering (MDQA) tasks, it is beneficial to decompose complex queries into multiple simpler ones to enhance retrieval results. However, previous strategies always employ a one-shot approach of question decomposition, overlooking subquestions dependency problem and failing to ensure that the derived subqueries are single-hop. To overcome this challenge, we introduce a novel framework called DSRC-QCS. Decompose-solve-renewal-cycle (DSRC) is an iterative multi-hop question processing module. The key idea of DSRC involves using a unique symbol to achieve hierarchical dependency management and employing a cyclical process of question decomposition, solving, and renewal to continuously generate and resolve all single-hop subquestions. Query-chain selector (QCS) functions as a voting mechanism that effectively utilizes the reasoning process of DSRC to assess and select solutions. We compare DSRC-QCS against five RAG approaches across three datasets and three LLMs. DSRC-QCS demonstrates superior performance. Compared to the Direct Retrieval method, DSRC-QCS improves the average F1 score by 17.36% with Alpaca-7b, 10.83% with LLaMa2-Chat-7b, and 11.88% with GPT-3.5-Turbo. We also conduct ablation studies to validate the performance of both DSRC and QCS and explore factors influencing the effectiveness of DSRC. We have included all prompts in the Appendix.</p>\",\"PeriodicalId\":48810,\"journal\":{\"name\":\"Journal of the Association for Information Science and Technology\",\"volume\":\"76 5\",\"pages\":\"770-789\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2024-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Association for Information Science and Technology\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asi.24971\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Association for Information Science and Technology","FirstCategoryId":"91","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asi.24971","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

在使用检索增强生成(RAG)处理多文档问答(MDQA)任务时,将复杂的查询分解为多个更简单的查询有助于提高检索结果。然而,以前的策略总是采用一次性的问题分解方法,忽略了子问题的依赖性问题,不能确保派生的子查询是单跳的。为了克服这一挑战,我们引入了一个名为DSRC-QCS的新框架。分解-求解-更新循环(DSRC)是一个迭代的多跳问题处理模块。DSRC的核心思想是使用唯一的符号来实现分层依赖管理,并采用问题分解、求解和更新的循环过程来连续地生成和解决所有单跳子问题。查询链选择器(Query-chain selector, QCS)作为一种投票机制,有效地利用DSRC的推理过程来评估和选择解决方案。我们在三个数据集和三个llm中比较了DSRC-QCS与五种RAG方法。DSRC-QCS性能优越。与直接检索方法相比,DSRC-QCS对Alpaca-7b、LLaMa2-Chat-7b和GPT-3.5-Turbo的平均F1分数分别提高了17.36%、10.83%和11.88%。我们还进行了消融研究,以验证DSRC和QCS的性能,并探讨影响DSRC有效性的因素。我们在附录中包含了所有提示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Beyond decomposition: Hierarchical dependency management in multi-document question answering

When using retrieval-augmented generation (RAG) to handle multi-document question answering (MDQA) tasks, it is beneficial to decompose complex queries into multiple simpler ones to enhance retrieval results. However, previous strategies always employ a one-shot approach of question decomposition, overlooking subquestions dependency problem and failing to ensure that the derived subqueries are single-hop. To overcome this challenge, we introduce a novel framework called DSRC-QCS. Decompose-solve-renewal-cycle (DSRC) is an iterative multi-hop question processing module. The key idea of DSRC involves using a unique symbol to achieve hierarchical dependency management and employing a cyclical process of question decomposition, solving, and renewal to continuously generate and resolve all single-hop subquestions. Query-chain selector (QCS) functions as a voting mechanism that effectively utilizes the reasoning process of DSRC to assess and select solutions. We compare DSRC-QCS against five RAG approaches across three datasets and three LLMs. DSRC-QCS demonstrates superior performance. Compared to the Direct Retrieval method, DSRC-QCS improves the average F1 score by 17.36% with Alpaca-7b, 10.83% with LLaMa2-Chat-7b, and 11.88% with GPT-3.5-Turbo. We also conduct ablation studies to validate the performance of both DSRC and QCS and explore factors influencing the effectiveness of DSRC. We have included all prompts in the Appendix.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.30
自引率
8.60%
发文量
115
期刊介绍: The Journal of the Association for Information Science and Technology (JASIST) is a leading international forum for peer-reviewed research in information science. For more than half a century, JASIST has provided intellectual leadership by publishing original research that focuses on the production, discovery, recording, storage, representation, retrieval, presentation, manipulation, dissemination, use, and evaluation of information and on the tools and techniques associated with these processes. The Journal welcomes rigorous work of an empirical, experimental, ethnographic, conceptual, historical, socio-technical, policy-analytic, or critical-theoretical nature. JASIST also commissions in-depth review articles (“Advances in Information Science”) and reviews of print and other media.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信