Extractive Document Summarization Using a Supervised Learning Approach

2018 IEEE Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER) Pub Date : 2018-08-01 DOI:10.1109/DISCOVER.2018.8674133

S. Charitha, Nagaratna B. Chittaragi, S. Koolagudi

引用次数: 8

Abstract

In this paper, we present a model for extractive multi-document text summarization using a supervised learning approach. The model uses a convolutional neural networks (CNN) which is capable of learning sentence features on its own for sentence ranking. This approach has been used in order to avoid the overhead of extracting features from sentences manually. Integer linear programming (ILP) approach has been adopted for selecting sentences to generate the summary based on sentence ranks. This ILP model minimizes the redundancy in the generated summary. We have evaluated our proposed approach on the DUC 2007 dataset and its performance is found to be competitive or better in comparison with state-of-the-art systems.

查看原文本刊更多论文

使用监督学习方法提取文档摘要

在本文中，我们提出了一个使用监督学习方法提取多文档文本摘要的模型。该模型使用卷积神经网络(CNN)，该网络能够自行学习句子特征进行句子排序。使用这种方法是为了避免手动从句子中提取特征的开销。采用整数线性规划(ILP)方法选择基于句子排名的句子生成摘要。这个ILP模型最小化了生成摘要中的冗余。我们在DUC 2007数据集上评估了我们提出的方法，发现其性能与最先进的系统相比具有竞争力或更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)

自引率

0.00%

发文量