Pointer-Generator Abstractive Text Summarization Model with Part of Speech Features

Shuxia Ren, Zheming Zhang
{"title":"Pointer-Generator Abstractive Text Summarization Model with Part of Speech Features","authors":"Shuxia Ren, Zheming Zhang","doi":"10.1109/ICSESS47205.2019.9040715","DOIUrl":null,"url":null,"abstract":"The typical sequence-to-sequence with attention mechanism models have achieved good results in the task of abstractive text summarization. However, this kind of models always have some shortcomings: they have out-of-vocabulary (OOV) problems, sometimes may repeat themselves and are always of low quality. In order to solve these problems, we propose a pointer-generator text summarization model with part of speech features. First, we use the word vector and prat of speech vector as the input of the model, and then improve the quality of generated abstracts by combining convolutional neural network (CNN) and bi-directional LSTM. Second, we use pointergenerator network to control whether generating or copying words to solve the problem of OOV. Finally, we use coverage mechanism to monitor the abstract we have generated to avoid duplication problems. Compared with the classic pointergenerator network, the ROUGE scores of our model have greatly improved and the performance on LCSTS dataset is better than the state-of-the-art model at present.","PeriodicalId":203944,"journal":{"name":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS47205.2019.9040715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The typical sequence-to-sequence with attention mechanism models have achieved good results in the task of abstractive text summarization. However, this kind of models always have some shortcomings: they have out-of-vocabulary (OOV) problems, sometimes may repeat themselves and are always of low quality. In order to solve these problems, we propose a pointer-generator text summarization model with part of speech features. First, we use the word vector and prat of speech vector as the input of the model, and then improve the quality of generated abstracts by combining convolutional neural network (CNN) and bi-directional LSTM. Second, we use pointergenerator network to control whether generating or copying words to solve the problem of OOV. Finally, we use coverage mechanism to monitor the abstract we have generated to avoid duplication problems. Compared with the classic pointergenerator network, the ROUGE scores of our model have greatly improved and the performance on LCSTS dataset is better than the state-of-the-art model at present.
具有词性特征的指针生成器抽象文本摘要模型
典型的带有注意机制的序列到序列模型在抽象文本摘要任务中取得了较好的效果。然而,这种模型总是有一些缺点:它们有词汇外(OOV)问题,有时可能会重复,并且质量总是很低。为了解决这些问题,我们提出了一种具有词性特征的指针生成器文本摘要模型。首先,我们使用单词向量和部分语音向量作为模型的输入,然后将卷积神经网络(CNN)和双向LSTM相结合,提高生成摘要的质量。其次,我们使用点间生成器网络来控制是否生成或复制单词,以解决OOV问题。最后,我们使用覆盖机制来监控我们生成的摘要,以避免重复问题。与经典的指针生成器网络相比,我们的模型的ROUGE分数有了很大的提高,并且在LCSTS数据集上的性能优于目前最先进的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信