ExaRanker: Synthetic Explanations Improve Neural Rankers

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval Pub Date : 2023-07-18 DOI:10.1145/3539618.3592067

Fernando Ferraretto, Thiago Laitz, R. Lotufo, Rodrigo Nogueira

引用次数: 0

Abstract

Recent work has shown that incorporating explanations into the output generated by large language models (LLMs) can significantly enhance performance on a broad spectrum of reasoning tasks. Our study extends these findings by demonstrating the benefits of explanations for neural rankers. By utilizing LLMs such as GPT-3.5 to enrich retrieval datasets with explanations, we trained a sequence-to-sequence ranking model, dubbed ExaRanker, to generate relevance labels and explanations for query-document pairs. The ExaRanker model, finetuned on a limited number of examples and synthetic explanations, exhibits performance comparable to models finetuned on three times more examples, but without explanations. Moreover, incorporating explanations imposes no additional computational overhead into the reranking step and allows for on-demand explanation generation. The codebase and datasets used in this study will be available at https://github.com/unicamp-dl/ExaRanker

查看原文本刊更多论文

ExaRanker:综合解释提高神经排序器

最近的研究表明，将解释合并到大型语言模型(llm)生成的输出中可以显著提高广泛推理任务的性能。我们的研究扩展了这些发现，证明了解释神经排名的好处。通过使用GPT-3.5等llm来丰富检索数据集的解释，我们训练了一个序列到序列排序模型，称为ExaRanker，以生成查询文档对的相关标签和解释。ExaRanker模型在有限数量的示例和综合解释上进行了微调，其性能可与在三倍以上的示例上进行微调的模型相媲美，但没有解释。此外，合并解释不会给重新排序步骤带来额外的计算开销，并允许按需生成解释。本研究中使用的代码库和数据集可在https://github.com/unicamp-dl/ExaRanker上获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

自引率

0.00%

发文量