用于开放信息提取的生成对抗性网络

Advances in computational intelligence Pub Date : 2021-08-02 DOI:10.1007/s43674-021-00006-8

Jiabao Han, Hongzhi Wang

{"title":"用于开放信息提取的生成对抗性网络","authors":"Jiabao Han, Hongzhi Wang","doi":"10.1007/s43674-021-00006-8","DOIUrl":null,"url":null,"abstract":"<div><p>Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"1 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s43674-021-00006-8","citationCount":"2","resultStr":"{\"title\":\"Generative adversarial networks for open information extraction\",\"authors\":\"Jiabao Han, Hongzhi Wang\",\"doi\":\"10.1007/s43674-021-00006-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.</p></div>\",\"PeriodicalId\":72089,\"journal\":{\"name\":\"Advances in computational intelligence\",\"volume\":\"1 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s43674-021-00006-8\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in computational intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s43674-021-00006-8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in computational intelligence","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s43674-021-00006-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

开放信息提取是自然语言处理的核心任务。在这方面已经作出了许多努力，仍然有许多问题需要解决。传统的Open IE方法使用一组手工制作的模式从语料库中提取关系元组。其次，在它们的过程中使用了许多NLP工具；因此，它们面临错误传播。为了解决这些问题，并受到生成对抗性网络（GANs）最近成功的启发，我们采用了一种对抗性训练架构，并将其命名为对抗性OIE。在对抗性OIE中，Open IE模型的训练由鉴别器辅助，鉴别器是（卷积神经网络）CNN模型。鉴别器的目标是将Open IE模型生成的提取结果与训练数据进行区分。Open IE模型的目标是生成高质量的三元组来欺骗鉴别器。利用策略梯度方法来共同训练Open IE模型和鉴别器。特别是，由于训练不足，鉴别器通常会导致GAN训练的不稳定性。为了解决这个问题，我们使用远程监督方法为对抗性OIE模型生成训练数据。为了证明我们的方法，对两个大型基准数据集的实证研究表明，我们的方法显著优于许多现有的基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Generative adversarial networks for open information extraction

查看原文本刊更多论文

Generative adversarial networks for open information extraction

Open information extraction (Open IE) is a core task of natural language processing (NLP). Even many efforts have been made in this area, and there are still many problems that need to be tackled. Conventional Open IE approaches use a set of handcrafted patterns to extract relational tuples from the corpus. Secondly, many NLP tools are employed in their procedure; therefore, they face error propagation. To address these problems and inspired by the recent success of Generative Adversarial Networks (GANs), we employ an adversarial training architecture and name it Adversarial-OIE. In Adversarial-OIE, the training of the Open IE model is assisted by a discriminator, which is a (Convolutional Neural Network) CNN model. The goal of the discriminator is to differentiate the extraction result generated by the Open IE model from the training data. The goal of the Open IE model is to produce high-quality triples to cheat the discriminator. A policy gradient method is leveraged to co-train the Open IE model and the discriminator. In particular, due to insufficient training, the discriminator usually leads to the instability of GAN training. We use the distant supervision method to generate training data for the Adversarial-OIE model to solve this problem. To demonstrate our approach, an empirical study on two large benchmark dataset shows that our approach significantly outperforms many existing baselines.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advances in computational intelligence

自引率

0.00%

发文量