Semantic-aware entity alignment for low resource language knowledge graph

IF 4.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science Pub Date : 2023-12-18 DOI:10.1007/s11704-023-2542-x

Junfei Tang, Ran Song, Yuxin Huang, Shengxiang Gao, Zhengtao Yu

{"title":"Semantic-aware entity alignment for low resource language knowledge graph","authors":"Junfei Tang, Ran Song, Yuxin Huang, Shengxiang Gao, Zhengtao Yu","doi":"10.1007/s11704-023-2542-x","DOIUrl":null,"url":null,"abstract":"<p>Entity alignment (EA) is an important technique aiming to find the same real entity between two different source knowledge graphs (KGs). Current methods typically learn the embedding of entities for EA from the structure of KGs for EA. Most EA models are designed for rich-resource languages, requiring sufficient resources such as a parallel corpus and pre-trained language models. However, low-resource language KGs have received less attention, and current models demonstrate poor performance on those low-resource KGs. Recently, researchers have fused relation information and attributes for entity representations to enhance the entity alignment performance, but the relation semantics are often ignored. To address these issues, we propose a novel Semantic-aware Graph Neural Network (SGNN) for entity alignment. First, we generate pseudo sentences according to the relation triples and produce representations using pre-trained models. Second, our approach explores semantic information from the connected relations by a graph neural network. Our model captures expanded feature information from KGs. Experimental results using three low-resource languages demonstrate that our proposed SGNN approach out performs better than state-of-the-art alignment methods on three proposed datasets and three public datasets.</p>","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"108 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers of Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11704-023-2542-x","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Entity alignment (EA) is an important technique aiming to find the same real entity between two different source knowledge graphs (KGs). Current methods typically learn the embedding of entities for EA from the structure of KGs for EA. Most EA models are designed for rich-resource languages, requiring sufficient resources such as a parallel corpus and pre-trained language models. However, low-resource language KGs have received less attention, and current models demonstrate poor performance on those low-resource KGs. Recently, researchers have fused relation information and attributes for entity representations to enhance the entity alignment performance, but the relation semantics are often ignored. To address these issues, we propose a novel Semantic-aware Graph Neural Network (SGNN) for entity alignment. First, we generate pseudo sentences according to the relation triples and produce representations using pre-trained models. Second, our approach explores semantic information from the connected relations by a graph neural network. Our model captures expanded feature information from KGs. Experimental results using three low-resource languages demonstrate that our proposed SGNN approach out performs better than state-of-the-art alignment methods on three proposed datasets and three public datasets.

查看原文本刊更多论文

针对低资源语言知识图谱的语义感知实体对齐

实体对齐（EA）是一项重要技术，旨在从两个不同的源知识图谱（KG）中找到相同的真实实体。目前的方法通常是从用于 EA 的知识图谱结构中学习用于 EA 的实体嵌入。大多数 EA 模型都是为资源丰富的语言设计的，需要足够的资源，如并行语料库和预训练的语言模型。然而，低资源语言的 KGs 受到的关注较少，而且当前的模型在这些低资源 KGs 上表现不佳。最近，研究人员融合了关系信息和实体表征的属性，以提高实体配准性能，但关系语义往往被忽视。为了解决这些问题，我们提出了一种用于实体配准的新型语义感知图神经网络（SGNN）。首先，我们根据关系三元组生成伪句子，并使用预训练模型生成表示。其次，我们的方法通过图神经网络从连接关系中挖掘语义信息。我们的模型可以捕捉 KG 中的扩展特征信息。使用三种低资源语言的实验结果表明，我们提出的 SGNN 方法在三个拟议数据集和三个公开数据集上的表现优于最先进的对齐方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers of Computer Science COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

8.60

自引率

2.40%

发文量

799

审稿时长

6-12 weeks

期刊介绍： Frontiers of Computer Science aims to provide a forum for the publication of peer-reviewed papers to promote rapid communication and exchange between computer scientists. The journal publishes research papers and review articles in a wide range of topics, including: architecture, software, artificial intelligence, theoretical computer science, networks and communication, information systems, multimedia and graphics, information security, interdisciplinary, etc. The journal especially encourages papers from new emerging and multidisciplinary areas, as well as papers reflecting the international trends of research and development and on special topics reporting progress made by Chinese computer scientists.