Faithful Abstractive Summarization via Fact-aware Consistency-constrained Transformer

Proceedings of the 31st ACM International Conference on Information & Knowledge Management Pub Date : 2022-10-17 DOI:10.1145/3511808.3557319

Yuanjie Lyu, Chen Zhu, Tong Xu, Zikai Yin, Enhong Chen

{"title":"Faithful Abstractive Summarization via Fact-aware Consistency-constrained Transformer","authors":"Yuanjie Lyu, Chen Zhu, Tong Xu, Zikai Yin, Enhong Chen","doi":"10.1145/3511808.3557319","DOIUrl":null,"url":null,"abstract":"Abstractive summarization is a classic task in Natural Language Generation (NLG), which aims to produce a concise summary of the original document. Recently, great efforts have been made on sequence-to-sequence neural networks to generate abstractive sum- maries with a high level of fluency. However, prior arts mainly focus on the optimization of token-level likelihood, while the rich semantic information in documents has been largely ignored. In this way, the summarization results could be vulnerable to hallucinations, i.e., the semantic-level inconsistency between a summary and corresponding original document. To deal with this challenge, in this paper, we propose a novel fact-aware abstractive summarization model, named Entity-Relation Pointer Generator Network (ERPGN). Specially, we attempt to formalize the facts in original document as a factual knowledge graph, and then generate the high-quality summary via directly modeling consistency between summary and the factual knowledge graph. To that end, we first leverage two pointer net- work structures to capture the fact in original documents. Then, to enhance the traditional token-level likelihood loss, we design two extra semantic-level losses to measure the disagreement between a summary and facts from its original document. Extensive experi- ments on public datasets demonstrate that our ERPGN framework could outperform both classic abstractive summarization models and the state-of-the-art fact-aware baseline methods, with significant improvement in terms of faithfulness.","PeriodicalId":389624,"journal":{"name":"Proceedings of the 31st ACM International Conference on Information & Knowledge Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 31st ACM International Conference on Information & Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3511808.3557319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Abstractive summarization is a classic task in Natural Language Generation (NLG), which aims to produce a concise summary of the original document. Recently, great efforts have been made on sequence-to-sequence neural networks to generate abstractive sum- maries with a high level of fluency. However, prior arts mainly focus on the optimization of token-level likelihood, while the rich semantic information in documents has been largely ignored. In this way, the summarization results could be vulnerable to hallucinations, i.e., the semantic-level inconsistency between a summary and corresponding original document. To deal with this challenge, in this paper, we propose a novel fact-aware abstractive summarization model, named Entity-Relation Pointer Generator Network (ERPGN). Specially, we attempt to formalize the facts in original document as a factual knowledge graph, and then generate the high-quality summary via directly modeling consistency between summary and the factual knowledge graph. To that end, we first leverage two pointer net- work structures to capture the fact in original documents. Then, to enhance the traditional token-level likelihood loss, we design two extra semantic-level losses to measure the disagreement between a summary and facts from its original document. Extensive experi- ments on public datasets demonstrate that our ERPGN framework could outperform both classic abstractive summarization models and the state-of-the-art fact-aware baseline methods, with significant improvement in terms of faithfulness.

查看原文本刊更多论文

基于事实感知的一致性约束转换器的忠实抽象摘要

摘要是自然语言生成(NLG)中的一项经典任务，其目的是生成原始文档的简明摘要。近年来，人们在序列到序列的神经网络上做出了很大的努力，以生成具有高流畅性的抽象求和。然而，现有技术主要集中在标记级似然的优化上，而忽略了文档中丰富的语义信息。这样，摘要结果就容易产生幻觉，即摘要与相应的原始文档在语义层面上不一致。为了应对这一挑战，本文提出了一种新的事实感知抽象摘要模型，称为实体-关系指针生成器网络(ERPGN)。特别地，我们尝试将原始文档中的事实形式化为事实知识图，然后通过直接建模摘要与事实知识图之间的一致性来生成高质量的摘要。为此，我们首先利用两个指针网络结构来捕获原始文档中的事实。然后，为了增强传统的标记级似然损失，我们设计了两个额外的语义级损失来度量摘要与原始文档的事实之间的不一致。在公共数据集上进行的大量实验表明，我们的ERPGN框架可以优于经典的抽象摘要模型和最先进的事实感知基线方法，在可信度方面有显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 31st ACM International Conference on Information & Knowledge Management

自引率

0.00%

发文量