Dependency Graphs for Summarization and Keyphrase Extraction: We present a real-time long document summarization and key-phrase extraction algorithm that utilizes a unified dependency graph.

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI:10.1145/3582768.3582792

Yifan Guo, David Brock, Alicia Lin, Tam Doan, Ali Khan, Paul Tarau

引用次数: 0

Abstract

We introduce a graph-based summarization and keyphrase extraction system that uses dependency trees as inputs for building a document graph. The document graph is built by connecting nodes containing lemmas and sentence identifiers after redirecting dependency links to emphasize semantically important entities. After applying a ranking algorithm to the document graph, we extract the highest ranked sentences as the summary. At the same time, the highest ranked lemmas are aggregated into keyphrases using their context in the dependency graph. Our algorithm specializes in handling long documents, including scientific, technical, legal, and medical documents.

查看原文本刊更多论文

摘要和关键字提取的依赖图:我们提出了一种利用统一依赖图的实时长文档摘要和关键字提取算法。

我们介绍了一个基于图的摘要和关键词提取系统，该系统使用依赖树作为输入来构建文档图。在重定向依赖链接以强调语义上重要的实体后，通过连接包含引理和句子标识符的节点来构建文档图。在对文档图应用排序算法后，我们提取排名最高的句子作为摘要。同时，使用依赖关系图中的上下文将排名最高的词聚合为关键短语。我们的算法专门处理长文件，包括科学、技术、法律和医疗文件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

自引率

0.00%

发文量