Word concept extraction using HOSVD for automatic text summarization

2015 AI & Robotics (IRANOPEN) Pub Date : 2015-04-12 DOI:10.1109/RIOS.2015.7270733

Atiyeh Biyabangard, M. S. Abadeh

引用次数: 1

Abstract

Computers understand little about the meaning of human language. Vector space models of semantics are beginning to overcome these limits. In this regard, one of the modern issues is using high dimensional data, which is formulated as tensors. Also, due to the increased information and texts, automatic text summarization has become one of the most important issues in information retrieval and natural language processing. In this paper, we propose a new method, using higher-order singular value decomposition (HOSVD) for extracting the concept of the words from word-document-time three-dimensional tensor and then select important sentences with more cosine similarity to this concept. In the following, we measure WordNet-based semantic similarity between sentences and remove redundancy sentences with less importance. The evaluation of the proposed method is done using the ROUGE evaluation on the DUC 2007 standard data set that the obtained results indicate the predominance of our method over many dominant systems.

查看原文本刊更多论文

使用HOSVD进行自动文本摘要的词概念提取

计算机对人类语言的含义理解得很少。语义的向量空间模型开始克服这些限制。在这方面，现代问题之一是使用高维数据，它被表述为张量。此外，由于信息和文本的增加，自动文本摘要已成为信息检索和自然语言处理的重要问题之一。本文提出了一种新的方法，利用高阶奇异值分解(HOSVD)从word-document-time三维张量中提取词的概念，然后选择与该概念余弦相似度较高的重要句子。在下面，我们测量基于wordnet的句子之间的语义相似度，并删除不太重要的冗余句子。对所提出的方法的评估是使用DUC 2007标准数据集上的ROUGE评估完成的，所获得的结果表明我们的方法优于许多优势系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 AI & Robotics (IRANOPEN)

自引率

0.00%

发文量