LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities

World Wide Web Pub Date : 2024-08-21 DOI:10.1007/s11280-024-01297-w

Yuqi Zhu, Xiaohan Wang, Jing Chen, Shuofei Qiao, Yixin Ou, Yunzhi Yao, Shumin Deng, Huajun Chen, Ningyu Zhang

{"title":"LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities","authors":"Yuqi Zhu, Xiaohan Wang, Jing Chen, Shuofei Qiao, Yixin Ou, Yunzhi Yao, Shumin Deng, Huajun Chen, Ningyu Zhang","doi":"10.1007/s11280-024-01297-w","DOIUrl":null,"url":null,"abstract":"<p>This paper presents an exhaustive quantitative and qualitative evaluation of Large Language Models (LLMs) for Knowledge Graph (KG) construction and reasoning. We engage in experiments across eight diverse datasets, focusing on four representative tasks encompassing entity and relation extraction, event extraction, link prediction, and question-answering, thereby thoroughly exploring LLMs’ performance in the domain of construction and inference. Empirically, our findings suggest that LLMs, represented by GPT-4, are more suited as inference assistants rather than few-shot information extractors. Specifically, while GPT-4 exhibits good performance in tasks related to KG construction, it excels further in reasoning tasks, surpassing fine-tuned models in certain cases. Moreover, our investigation extends to the potential generalization ability of LLMs for information extraction, leading to the proposition of a Virtual Knowledge Extraction task and the development of the corresponding VINE dataset. Based on these empirical findings, we further propose <b>AutoKG</b>, a multi-agent-based approach employing LLMs and external sources for KG construction and reasoning. We anticipate that this research can provide invaluable insights for future undertakings in the field of knowledge graphs.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Wide Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11280-024-01297-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper presents an exhaustive quantitative and qualitative evaluation of Large Language Models (LLMs) for Knowledge Graph (KG) construction and reasoning. We engage in experiments across eight diverse datasets, focusing on four representative tasks encompassing entity and relation extraction, event extraction, link prediction, and question-answering, thereby thoroughly exploring LLMs’ performance in the domain of construction and inference. Empirically, our findings suggest that LLMs, represented by GPT-4, are more suited as inference assistants rather than few-shot information extractors. Specifically, while GPT-4 exhibits good performance in tasks related to KG construction, it excels further in reasoning tasks, surpassing fine-tuned models in certain cases. Moreover, our investigation extends to the potential generalization ability of LLMs for information extraction, leading to the proposition of a Virtual Knowledge Extraction task and the development of the corresponding VINE dataset. Based on these empirical findings, we further propose AutoKG, a multi-agent-based approach employing LLMs and external sources for KG construction and reasoning. We anticipate that this research can provide invaluable insights for future undertakings in the field of knowledge graphs.

Abstract Image

查看原文本刊更多论文

用于知识图谱构建和推理的 LLM：最新能力和未来机遇

本文对用于知识图谱（KG）构建和推理的大型语言模型（LLM）进行了详尽的定量和定性评估。我们在八个不同的数据集上进行了实验，重点关注四个具有代表性的任务，包括实体和关系提取、事件提取、链接预测和问题解答，从而全面探索 LLM 在构建和推理领域的性能。实证研究结果表明，以 GPT-4 为代表的 LLM 更适合作为推理助手，而不是少量信息提取器。具体来说，虽然 GPT-4 在与 KG 构建相关的任务中表现出色，但在推理任务中却更胜一筹，在某些情况下甚至超过了微调模型。此外，我们的研究还扩展到了 LLM 在信息提取方面的潜在泛化能力，从而提出了虚拟知识提取任务，并开发了相应的 VINE 数据集。基于这些实证研究结果，我们进一步提出了 AutoKG，这是一种基于多机器人的方法，利用 LLMs 和外部资源进行 KG 构建和推理。我们期待这项研究能为知识图谱领域未来的工作提供宝贵的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

World Wide Web

自引率

0.00%

发文量