{"title":"一种用于N-Ary文档级关系提取的强化学习框架","authors":"Chenhan Yuan;Ryan Rossi;Andrew Katz;Hoda Eldardiry","doi":"10.1109/TBDATA.2024.3410099","DOIUrl":null,"url":null,"abstract":"Knowledge Bases (KBs) have become more complex because some facts in KBs include more than two entities. The construction and completion of these KBs require a new relation extraction task to retrieve complex facts from the text. To address this issue, we present a new N-ary Document-Level relation extraction task that involves extracting relations that 1) include an arbitrary number of entities, and 2) can span multiple sentences within a document. This new task requires inferring relation labels and entity completeness, i.e., whether the entities in the document are (insufficient to describe the relation. We propose a reinforcement learning-based relation classifier training framework that can adapt most existing binary document-level relation extractors to this task. Extensive experimental evaluation demonstrates that our proposed framework is effective in reducing the impact of noise introduced by distant supervision or unrelated sentences in the document.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"11 2","pages":"512-523"},"PeriodicalIF":7.5000,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Reinforcement Learning Framework for N-Ary Document-Level Relation Extraction\",\"authors\":\"Chenhan Yuan;Ryan Rossi;Andrew Katz;Hoda Eldardiry\",\"doi\":\"10.1109/TBDATA.2024.3410099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowledge Bases (KBs) have become more complex because some facts in KBs include more than two entities. The construction and completion of these KBs require a new relation extraction task to retrieve complex facts from the text. To address this issue, we present a new N-ary Document-Level relation extraction task that involves extracting relations that 1) include an arbitrary number of entities, and 2) can span multiple sentences within a document. This new task requires inferring relation labels and entity completeness, i.e., whether the entities in the document are (insufficient to describe the relation. We propose a reinforcement learning-based relation classifier training framework that can adapt most existing binary document-level relation extractors to this task. Extensive experimental evaluation demonstrates that our proposed framework is effective in reducing the impact of noise introduced by distant supervision or unrelated sentences in the document.\",\"PeriodicalId\":13106,\"journal\":{\"name\":\"IEEE Transactions on Big Data\",\"volume\":\"11 2\",\"pages\":\"512-523\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Big Data\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10549760/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10549760/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A Reinforcement Learning Framework for N-Ary Document-Level Relation Extraction
Knowledge Bases (KBs) have become more complex because some facts in KBs include more than two entities. The construction and completion of these KBs require a new relation extraction task to retrieve complex facts from the text. To address this issue, we present a new N-ary Document-Level relation extraction task that involves extracting relations that 1) include an arbitrary number of entities, and 2) can span multiple sentences within a document. This new task requires inferring relation labels and entity completeness, i.e., whether the entities in the document are (insufficient to describe the relation. We propose a reinforcement learning-based relation classifier training framework that can adapt most existing binary document-level relation extractors to this task. Extensive experimental evaluation demonstrates that our proposed framework is effective in reducing the impact of noise introduced by distant supervision or unrelated sentences in the document.
期刊介绍:
The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.