A Toolbox for the Nearly-Unsupervised Construction of Digital Library Knowledge Graphs

2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) Pub Date : 2021-09-01 DOI:10.1109/JCDL52503.2021.00014

H. Kroll, Jan Pirklbauer, Wolf-Tilo Balke

引用次数: 7

Abstract

Knowledge graphs are essential for digital libraries to store entity-centric knowledge. The applications of knowledge graphs range from summarizing entity information over answering complex queries to inferring new knowledge. Yet, building knowledge graphs means either relying on manual curation or designing supervised extraction processes to harvest knowledge from unstructured text. Obviously, both approaches are cost-intensive. Yet, the question is whether we can minimize the efforts to build a knowledge graph. And indeed, we propose a toolbox that provides methods to extract knowledge from arbitrary text. Our toolkit bypasses the need for supervision nearly completely and includes a novel algorithm to close the missing gaps. As a practical demonstration, we analyze our toolbox on established biomedical benchmarks. As far as we know, we are the first who propose, analyze and share a nearly unsupervised and complete toolbox for building knowledge graphs from text.

查看原文本刊更多论文

数字图书馆知识图谱近乎无监督构建工具箱

知识图谱是数字图书馆存储以实体为中心的知识所必需的。知识图的应用范围从总结实体信息到回答复杂的查询，再到推断新知识。然而，构建知识图谱意味着要么依靠人工管理，要么设计有监督的提取过程，从非结构化文本中获取知识。显然，这两种方法的成本都很高。然而，问题是我们是否可以最小化构建知识图谱的努力。事实上，我们提出了一个工具箱，提供了从任意文本中提取知识的方法。我们的工具包几乎完全绕过了监督的需要，并包括一种新颖的算法来弥补缺失的差距。作为一个实际的演示，我们在已建立的生物医学基准上分析我们的工具箱。据我们所知，我们是第一个提出、分析和分享一个几乎不受监督的完整工具箱，用于从文本构建知识图谱的人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)

自引率

0.00%

发文量