What GPT Knows About Who is Who

First Workshop on Insights from Negative Results in NLP Pub Date : 2022-05-16 DOI:10.48550/arXiv.2205.07407

Xiaohan Yang, Eduardo Peynetti, Vasco Meerman, Christy Tanner

引用次数: 6

Abstract

Coreference resolution – which is a crucial task for understanding discourse and language at large – has yet to witness widespread benefits from large language models (LLMs). Moreover, coreference resolution systems largely rely on supervised labels, which are highly expensive and difficult to annotate, thus making it ripe for prompt engineering. In this paper, we introduce a QA-based prompt-engineering method and discern generative, pre-trained LLMs’ abilities and limitations toward the task of coreference resolution. Our experiments show that GPT-2 and GPT-Neo can return valid answers, but that their capabilities to identify coreferent mentions are limited and prompt-sensitive, leading to inconsistent results.

查看原文本刊更多论文

GPT知道谁是谁

共同参考解析是理解话语和语言的关键任务，但它尚未从大型语言模型(llm)中得到广泛的应用。此外，共参解析系统很大程度上依赖于监督标签，这是非常昂贵和难以注释的，从而使其成熟的快速工程。在本文中，我们引入了一种基于问答的提示工程方法，并识别出生成的、预先训练的法学硕士在共同参考解析任务中的能力和局限性。我们的实验表明，GPT-2和GPT-Neo可以返回有效的答案，但它们识别共同提及的能力有限且对时间敏感，导致结果不一致。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

First Workshop on Insights from Negative Results in NLP

自引率

0.00%

发文量