{"title":"Effective Piecewise CNN with Attention Mechanism for Distant Supervision on Relation Extraction Task","authors":"Yuming Li, Pin Ni, Gangmin Li, Victor I. Chang","doi":"10.5220/0009582700530060","DOIUrl":null,"url":null,"abstract":"Relation Extraction is an important sub-task in the field of information extraction. Its goal is to identify entities from text and extract semantic relationships between entities. However, the current Relationship Extraction task based on deep learning methods generally have practical problems such as insufficient amount of manually labeled data, so training under weak supervision has become a big challenge. Distant Supervision is a novel idea that can automatically annotate a large number of unlabeled data based on a small amount of labeled data. Based on this idea, this paper proposes a method combining the Piecewise Convolutional Neural Networks and Attention mechanism for automatically annotating the data of Relation Extraction task. The experiments proved that the proposed method achieved the highest precision is 76.24% on NYT-FB (New York Times Freebase) dataset (top 100 relation categories). The results show that the proposed method performed better than CNN-based models in most cases.","PeriodicalId":414016,"journal":{"name":"International Conference on Complex Information Systems","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Complex Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0009582700530060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Relation Extraction is an important sub-task in the field of information extraction. Its goal is to identify entities from text and extract semantic relationships between entities. However, the current Relationship Extraction task based on deep learning methods generally have practical problems such as insufficient amount of manually labeled data, so training under weak supervision has become a big challenge. Distant Supervision is a novel idea that can automatically annotate a large number of unlabeled data based on a small amount of labeled data. Based on this idea, this paper proposes a method combining the Piecewise Convolutional Neural Networks and Attention mechanism for automatically annotating the data of Relation Extraction task. The experiments proved that the proposed method achieved the highest precision is 76.24% on NYT-FB (New York Times Freebase) dataset (top 100 relation categories). The results show that the proposed method performed better than CNN-based models in most cases.
关系抽取是信息抽取领域的一个重要子任务。它的目标是从文本中识别实体,并提取实体之间的语义关系。然而,目前基于深度学习方法的关系抽取任务普遍存在人工标记数据量不足等实际问题,因此弱监督下的训练成为一个很大的挑战。远程监督是一种基于少量标注数据自动标注大量未标注数据的新思路。基于这一思想,本文提出了一种结合分段卷积神经网络和注意机制的关系抽取任务数据自动标注方法。实验证明,该方法在NYT-FB (New York Times Freebase)数据集(前100个关系类别)上达到了76.24%的最高准确率。结果表明,在大多数情况下,该方法优于基于cnn的模型。