Knowledge Fragment Cleaning in a Genealogy Knowledge Graph

Guliu Liu, Lei Li
{"title":"Knowledge Fragment Cleaning in a Genealogy Knowledge Graph","authors":"Guliu Liu, Lei Li","doi":"10.1109/ICBK50248.2020.00079","DOIUrl":null,"url":null,"abstract":"As an important topic of artificial intelligence, knowledge graphs have a wide range of applications such as semantic search, intelligent question answering, and visual decision support. Among them, the genealogy knowledge graph, as a kind of domain knowledge graph, has important application value in genetic disease analysis, population behavior analysis, etc. In the case of multiple data sources and multi-person collaboration, the construction of a genealogy knowledge graph involves the techniques of knowledge representation, knowledge acquisition, and knowledge fusion. In the knowledge fusion process, there are many situations such as the lack and chaos of a relationship, redundant entities, the isolation of some entities and knowledge fragments. How to effectively detect and process these problematic knowledge fragments is significant to the construction of a genealogy knowledge graph. In response to this problem, we propose a method for cleaning the problematic knowledge fragments in a genealogy knowledge graph. The method consists of three phases. In phase 1, we propose a method for detecting and analyzing the problematic knowledge fragments. In phase 2, we design a method for supplementing the entity-relationship of people for different error patterns and a method fusion method for the aligned entity. In phase 3, for the cleaning of isolated knowledge fragments, we propose an entity alignment method based on the father-son relationship and people’s names and a connection method of isolated knowledge fragments. Finally, we conduct experiments on a family tree dataset of the Huapu System, and the experimental results indicate the effectiveness and practicality of the method.","PeriodicalId":432857,"journal":{"name":"2020 IEEE International Conference on Knowledge Graph (ICKG)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Knowledge Graph (ICKG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBK50248.2020.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

As an important topic of artificial intelligence, knowledge graphs have a wide range of applications such as semantic search, intelligent question answering, and visual decision support. Among them, the genealogy knowledge graph, as a kind of domain knowledge graph, has important application value in genetic disease analysis, population behavior analysis, etc. In the case of multiple data sources and multi-person collaboration, the construction of a genealogy knowledge graph involves the techniques of knowledge representation, knowledge acquisition, and knowledge fusion. In the knowledge fusion process, there are many situations such as the lack and chaos of a relationship, redundant entities, the isolation of some entities and knowledge fragments. How to effectively detect and process these problematic knowledge fragments is significant to the construction of a genealogy knowledge graph. In response to this problem, we propose a method for cleaning the problematic knowledge fragments in a genealogy knowledge graph. The method consists of three phases. In phase 1, we propose a method for detecting and analyzing the problematic knowledge fragments. In phase 2, we design a method for supplementing the entity-relationship of people for different error patterns and a method fusion method for the aligned entity. In phase 3, for the cleaning of isolated knowledge fragments, we propose an entity alignment method based on the father-son relationship and people’s names and a connection method of isolated knowledge fragments. Finally, we conduct experiments on a family tree dataset of the Huapu System, and the experimental results indicate the effectiveness and practicality of the method.
宗谱知识图中的知识片段清理
知识图作为人工智能的一个重要课题,在语义搜索、智能问答、视觉决策支持等方面有着广泛的应用。其中,家谱知识图作为一种领域知识图,在遗传病分析、群体行为分析等方面具有重要的应用价值。在多数据源、多人协作的情况下,家谱知识图谱的构建涉及到知识表示、知识获取和知识融合技术。在知识融合过程中,存在着关系缺失和混乱、实体冗余、部分实体孤立、知识碎片等多种情况。如何有效地检测和处理这些有问题的知识片段,对构建系谱知识图谱具有重要意义。针对这一问题,提出了一种清除谱系知识图中有问题的知识片段的方法。该方法包括三个阶段。在第一阶段,我们提出了一种检测和分析问题知识片段的方法。在第二阶段,我们设计了一种针对不同错误模式的人的实体关系的补充方法和一种针对对齐实体的方法融合方法。第三阶段,针对孤立知识片段的清理,提出了基于父子关系和人名的实体对齐方法和孤立知识片段的连接方法。最后,在华普系统的家谱数据集上进行了实验,实验结果表明了该方法的有效性和实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信