Document Summarization with Latent Queries

IF 4.2 1区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Transactions of the Association for Computational Linguistics Pub Date : 2022-05-01 DOI:10.1162/tacl_a_00480

Yumo Xu, Mirella Lapata

引用次数: 15

Abstract

The availability of large-scale datasets has driven the development of neural models that create generic summaries for single or multiple documents. For query-focused summarization (QFS), labeled training data in the form of queries, documents, and summaries is not readily available. We provide a unified modeling framework for any kind of summarization, under the assumption that all summaries are a response to a query, which is observed in the case of QFS and latent in the case of generic summarization. We model queries as discrete latent variables over document tokens, and learn representations compatible with observed and unobserved query verbalizations. Our framework formulates summarization as a generative process, and jointly optimizes a latent query model and a conditional language model. Despite learning from generic summarization data only, our approach outperforms strong comparison systems across benchmarks, query types, document settings, and target domains.1

查看原文本刊更多论文

具有潜在查询的文档摘要

大规模数据集的可用性推动了神经模型的发展，该模型为单个或多个文档创建通用摘要。对于以查询为中心的摘要（QFS），以查询、文档和摘要形式标记的训练数据并不容易获得。我们为任何类型的摘要提供了一个统一的建模框架，假设所有摘要都是对查询的响应，这在QFS的情况下是观察到的，在通用摘要的情况下则是潜在的。我们将查询建模为文档标记上的离散潜在变量，并学习与观察到和未观察到的查询语句兼容的表示。我们的框架将摘要表述为一个生成过程，并联合优化了一个潜在查询模型和一个条件语言模型。尽管我们只从通用摘要数据中学习，但我们的方法在基准测试、查询类型、文档设置和目标域方面都优于强大的比较系统。1

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Transactions of the Association for Computational Linguistics Multiple-

CiteScore

32.60

自引率

4.60%

发文量

审稿时长

8 weeks

期刊介绍： The highly regarded quarterly journal Computational Linguistics has a companion journal called Transactions of the Association for Computational Linguistics. This open access journal publishes articles in all areas of natural language processing and is an important resource for academic and industry computational linguists, natural language processing experts, artificial intelligence and machine learning investigators, cognitive scientists, speech specialists, as well as linguists and philosophers. The journal disseminates work of vital relevance to these professionals on an annual basis.