Language-Agnostic Transformers and Assessing ChatGPT-Based Query Rewriting for Multilingual Document-Grounded QA

Proceedings of the Third DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering Pub Date : 1900-01-01 DOI:10.18653/v1/2023.dialdoc-1.11

Srinivas Gowriraj, Soham Dinesh Tiwari, Mitali Potnis, Srijan Bansal, T. Mitamura, Eric Nyberg

引用次数: 1

Abstract

The DialDoc 2023 shared task has expanded the document-grounded dialogue task to encompass multiple languages, despite having limited annotated data. This paper assesses the effectiveness of both language-agnostic and language-aware paradigms for multilingual pre-trained transformer models in a bi-encoder-based dense passage retriever (DPR), concluding that the language-agnostic approach is superior. Additionally, the study investigates the impact of query rewriting techniques using large language models, such as ChatGPT, on multilingual, document-grounded question-answering systems. The experiments conducted demonstrate that, for the examples examined, query rewriting does not enhance performance compared to the original queries. This failure is due to topic switching in final dialogue turns and irrelevant topics being considered for query rewriting.

查看原文本刊更多论文

基于chatgpt的多语言文档QA查询重写的语言不可知转换和评估

DialDoc 2023共享任务扩展了基于文档的对话任务，使其包含多种语言，尽管注释数据有限。本文在基于双编码器的密集段落检索器(DPR)中评估了语言不可知论范式和语言感知范式对多语言预训练转换模型的有效性，结论是语言不可知论方法更优越。此外，该研究还调查了使用大型语言模型(如ChatGPT)的查询重写技术对多语言、基于文档的问答系统的影响。所进行的实验表明，对于所检查的示例，与原始查询相比，查询重写并没有提高性能。这种失败是由于在最后的对话回合中切换主题，以及在查询重写时考虑不相关的主题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Third DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering

自引率

0.00%

发文量