Measuring Attribution in Natural Language Generation Models

IF 5.3 2区计算机科学

Computational Linguistics Pub Date : 2023-07-07 DOI:10.1162/coli_a_00490

Hannah Rashkin, Vitaly Nikolaev, Matthew Lamm, Lora Aroyo, Michael Collins, Dipanjan Das, Slav Petrov, Gaurav Singh Tomar, Iulia Turc, David Reitter

引用次数: 0

Abstract

Large neural models have brought a new challenge to natural language generation (NLG): it has become imperative to ensure the safety and reliability of the output of models that generate freely. To this end, we present an evaluation framework, Attributable to Identified Sources (AIS), stipulating that NLG output pertaining to the external world is to be verified against an independent, provided source. We define AIS and a two-stage annotation pipeline for allowing annotators to evaluate model output according to annotation guidelines. We successfully validate this approach on generation datasets spanning three tasks (two conversational QA datasets, a summarization dataset, and a table-to-text dataset). We provide full annotation guidelines in the appendices and publicly release the annotated data at https://github.com/google-research-datasets/AIS.

查看原文本刊更多论文

自然语言生成模型中的归因测量

大型神经模型给自然语言生成(NLG)带来了新的挑战:确保自由生成的模型输出的安全性和可靠性已成为当务之急。为此，我们提出了一个评估框架，归属于已识别的来源(AIS)，规定与外部世界有关的NLG输出将根据一个独立的、提供的来源进行验证。我们定义了AIS和一个两阶段的注释管道，允许注释者根据注释指南评估模型输出。我们成功地在跨越三个任务的生成数据集上验证了这种方法(两个会话QA数据集，一个摘要数据集和一个表到文本数据集)。我们在附录中提供了完整的注释指南，并在https://github.com/google-research-datasets/AIS上公开发布了注释数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Linguistics Computer Science-Artificial Intelligence

自引率

0.00%

发文量

期刊介绍： Computational Linguistics is the longest-running publication devoted exclusively to the computational and mathematical properties of language and the design and analysis of natural language processing systems. This highly regarded quarterly offers university and industry linguists, computational linguists, artificial intelligence and machine learning investigators, cognitive scientists, speech specialists, and philosophers the latest information about the computational aspects of all the facets of research on language.