基于句子简化过程和实体信息的关系提取

Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Pub Date : 2021-12-14 DOI:10.1145/3486622.3494010

M. Parniani, M. Reformat

{"title":"基于句子简化过程和实体信息的关系提取","authors":"M. Parniani, M. Reformat","doi":"10.1145/3486622.3494010","DOIUrl":null,"url":null,"abstract":"Graph-based Knowledge Bases (KBs) are composed of relational facts that can be perceived as two entities, called head and tail, linked via a relation. Processes of constructing KBs, i.e., populating them with such facts, as well as revising and updating them are of special interest. These should be performed automatically, especially in the case when the main sources of facts are textual documents. For this reason, a task of Relation Extraction (RE), i.e., predicting a relation that links two entities mentioned in a sentence, is one of the most important activities. Using RE processes, new relational facts can be extracted, and KBs can be built and updated using unstructured information. In this paper, we propose a novel procedure for RE. It is based on a sentence distilling technique that works on dependency trees and removes noisy tokens from sentences while preserving the most relevant and useful ones. In addition, the proposed procedure utilizes information about types of linked entities, it means types of relations’ heads and tails. Our neural network model using processed and new input information is evaluated on the widely used NYT dataset and compared to other state-of-the-art RE methods. Experimental results show the effectiveness of the proposed procedure against other methods.","PeriodicalId":89230,"journal":{"name":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"22 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Relation Extraction with Sentence Simplification Process and Entity Information\",\"authors\":\"M. Parniani, M. Reformat\",\"doi\":\"10.1145/3486622.3494010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph-based Knowledge Bases (KBs) are composed of relational facts that can be perceived as two entities, called head and tail, linked via a relation. Processes of constructing KBs, i.e., populating them with such facts, as well as revising and updating them are of special interest. These should be performed automatically, especially in the case when the main sources of facts are textual documents. For this reason, a task of Relation Extraction (RE), i.e., predicting a relation that links two entities mentioned in a sentence, is one of the most important activities. Using RE processes, new relational facts can be extracted, and KBs can be built and updated using unstructured information. In this paper, we propose a novel procedure for RE. It is based on a sentence distilling technique that works on dependency trees and removes noisy tokens from sentences while preserving the most relevant and useful ones. In addition, the proposed procedure utilizes information about types of linked entities, it means types of relations’ heads and tails. Our neural network model using processed and new input information is evaluated on the widely used NYT dataset and compared to other state-of-the-art RE methods. Experimental results show the effectiveness of the proposed procedure against other methods.\",\"PeriodicalId\":89230,\"journal\":{\"name\":\"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology\",\"volume\":\"22 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3486622.3494010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3486622.3494010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

基于图的知识库(KBs)由关系事实组成，可以将其视为两个实体，称为头和尾，通过关系连接起来。构建KBs的过程，即用这些事实填充它们，以及修改和更新它们的过程是特别有趣的。这些应自动进行，特别是当事实的主要来源是文字文件时。因此，关系提取(RE)任务，即预测句子中提到的两个实体之间的关系，是最重要的活动之一。使用RE过程，可以提取新的关系事实，并且可以使用非结构化信息构建和更新知识库。在本文中，我们提出了一种新的正则化过程。它基于一种基于依赖树的句子提取技术，在保留最相关和最有用的标记的同时，从句子中去除嘈杂的标记。此外，所建议的过程利用了有关链接实体类型的信息，这意味着关系的头和尾的类型。我们的神经网络模型使用处理过的和新的输入信息，在广泛使用的纽约时报数据集上进行评估，并与其他最先进的RE方法进行比较。实验结果表明，该方法与其他方法相比是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Relation Extraction with Sentence Simplification Process and Entity Information

Graph-based Knowledge Bases (KBs) are composed of relational facts that can be perceived as two entities, called head and tail, linked via a relation. Processes of constructing KBs, i.e., populating them with such facts, as well as revising and updating them are of special interest. These should be performed automatically, especially in the case when the main sources of facts are textual documents. For this reason, a task of Relation Extraction (RE), i.e., predicting a relation that links two entities mentioned in a sentence, is one of the most important activities. Using RE processes, new relational facts can be extracted, and KBs can be built and updated using unstructured information. In this paper, we propose a novel procedure for RE. It is based on a sentence distilling technique that works on dependency trees and removes noisy tokens from sentences while preserving the most relevant and useful ones. In addition, the proposed procedure utilizes information about types of linked entities, it means types of relations’ heads and tails. Our neural network model using processed and new input information is evaluated on the widely used NYT dataset and compared to other state-of-the-art RE methods. Experimental results show the effectiveness of the proposed procedure against other methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

自引率

0.00%

发文量