{"title":"Automatic Construction of a Domain-specific Knowledge Graph for Chinese Patent Based on Information Extraction","authors":"Mingye Wang, Xiaohui Hu, Pan Xie, Yao Du","doi":"10.1109/ICMSSE53595.2021.00008","DOIUrl":null,"url":null,"abstract":"Knowledge graph has been proved as an effective tool in diverse domains including patent service. However, despite partial structure of data, most patent information lies in unstructured data like abstract text, from which it is relatively difficult to build a knowledge graph, since traditional methods rely on predefined human-annotated resources of entities and their relationships. Furthermore, the difficulty in Chinese language processing worsens the problem. This paper proposed an unsupervised method to automatically construct a Chinese patent knowledge graph without pre-built dataset. This research first builds a basic graph frame using structured patent data. Then the proposed method mainly utilizes keyphrase extraction algorithm to find patent properties and semantic role labeling method to dig deeper relations. The experimental result proves the effectiveness of the proposed method.","PeriodicalId":331570,"journal":{"name":"2021 International Conference on Management Science and Software Engineering (ICMSSE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Management Science and Software Engineering (ICMSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMSSE53595.2021.00008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Knowledge graph has been proved as an effective tool in diverse domains including patent service. However, despite partial structure of data, most patent information lies in unstructured data like abstract text, from which it is relatively difficult to build a knowledge graph, since traditional methods rely on predefined human-annotated resources of entities and their relationships. Furthermore, the difficulty in Chinese language processing worsens the problem. This paper proposed an unsupervised method to automatically construct a Chinese patent knowledge graph without pre-built dataset. This research first builds a basic graph frame using structured patent data. Then the proposed method mainly utilizes keyphrase extraction algorithm to find patent properties and semantic role labeling method to dig deeper relations. The experimental result proves the effectiveness of the proposed method.