Pointer-Generator Abstractive Text Summarization Model with Part of Speech Features

2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS) Pub Date : 2019-10-01 DOI:10.1109/ICSESS47205.2019.9040715

Shuxia Ren, Zheming Zhang

{"title":"Pointer-Generator Abstractive Text Summarization Model with Part of Speech Features","authors":"Shuxia Ren, Zheming Zhang","doi":"10.1109/ICSESS47205.2019.9040715","DOIUrl":null,"url":null,"abstract":"The typical sequence-to-sequence with attention mechanism models have achieved good results in the task of abstractive text summarization. However, this kind of models always have some shortcomings: they have out-of-vocabulary (OOV) problems, sometimes may repeat themselves and are always of low quality. In order to solve these problems, we propose a pointer-generator text summarization model with part of speech features. First, we use the word vector and prat of speech vector as the input of the model, and then improve the quality of generated abstracts by combining convolutional neural network (CNN) and bi-directional LSTM. Second, we use pointergenerator network to control whether generating or copying words to solve the problem of OOV. Finally, we use coverage mechanism to monitor the abstract we have generated to avoid duplication problems. Compared with the classic pointergenerator network, the ROUGE scores of our model have greatly improved and the performance on LCSTS dataset is better than the state-of-the-art model at present.","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The typical sequence-to-sequence with attention mechanism models have achieved good results in the task of abstractive text summarization. However, this kind of models always have some shortcomings: they have out-of-vocabulary (OOV) problems, sometimes may repeat themselves and are always of low quality. In order to solve these problems, we propose a pointer-generator text summarization model with part of speech features. First, we use the word vector and prat of speech vector as the input of the model, and then improve the quality of generated abstracts by combining convolutional neural network (CNN) and bi-directional LSTM. Second, we use pointergenerator network to control whether generating or copying words to solve the problem of OOV. Finally, we use coverage mechanism to monitor the abstract we have generated to avoid duplication problems. Compared with the classic pointergenerator network, the ROUGE scores of our model have greatly improved and the performance on LCSTS dataset is better than the state-of-the-art model at present.

查看原文本刊更多论文

具有词性特征的指针生成器抽象文本摘要模型

典型的带有注意机制的序列到序列模型在抽象文本摘要任务中取得了较好的效果。然而，这种模型总是有一些缺点:它们有词汇外(OOV)问题，有时可能会重复，并且质量总是很低。为了解决这些问题，我们提出了一种具有词性特征的指针生成器文本摘要模型。首先，我们使用单词向量和部分语音向量作为模型的输入，然后将卷积神经网络(CNN)和双向LSTM相结合，提高生成摘要的质量。其次，我们使用点间生成器网络来控制是否生成或复制单词，以解决OOV问题。最后，我们使用覆盖机制来监控我们生成的摘要，以避免重复问题。与经典的指针生成器网络相比，我们的模型的ROUGE分数有了很大的提高，并且在LCSTS数据集上的性能优于目前最先进的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)

自引率

0.00%

发文量