Relation Extraction Based on Prompt Information and Feature Reuse

IF 1.3 3区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Intelligence Pub Date : 2023-02-22 DOI:10.1162/dint_a_00192

Ping Feng, Xin Zhang, Jian Zhao, Yingying Wang, Biao Huang

{"title":"Relation Extraction Based on Prompt Information and Feature Reuse","authors":"Ping Feng, Xin Zhang, Jian Zhao, Yingying Wang, Biao Huang","doi":"10.1162/dint_a_00192","DOIUrl":null,"url":null,"abstract":"ABSTRACT To alleviate the problem of under-utilization features of sentence-level relation extraction, which leads to insufficient performance of the pre-trained language model and underutilization of the feature vector, a sentence-level relation extraction method based on adding prompt information and feature reuse is proposed. At first, in addition to the pair of nominals and sentence information, a piece of prompt information is added, and the overall feature information consists of sentence information, entity pair information, and prompt information, and then the features are encoded by the pre-trained language model ROBERTA. Moreover, in the pre-trained language model, BIGRU is also introduced in the composition of the neural network to extract information, and the feature information is passed through the neural network to form several sets of feature vectors. After that, these feature vectors are reused in different combinations to form multiple outputs, and the outputs are aggregated using ensemble-learning soft voting to perform relation classification. In addition to this, the sum of cross-entropy, KL divergence, and negative log-likelihood loss is used as the final loss function in this paper. In the comparison experiments, the model based on adding prompt information and feature reuse achieved higher results of the SemEval-2010 task 8 relational dataset.","PeriodicalId":34023,"journal":{"name":"Data Intelligence","volume":"5 1","pages":"824-840"},"PeriodicalIF":1.3000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1162/dint_a_00192","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 1

Abstract

ABSTRACT To alleviate the problem of under-utilization features of sentence-level relation extraction, which leads to insufficient performance of the pre-trained language model and underutilization of the feature vector, a sentence-level relation extraction method based on adding prompt information and feature reuse is proposed. At first, in addition to the pair of nominals and sentence information, a piece of prompt information is added, and the overall feature information consists of sentence information, entity pair information, and prompt information, and then the features are encoded by the pre-trained language model ROBERTA. Moreover, in the pre-trained language model, BIGRU is also introduced in the composition of the neural network to extract information, and the feature information is passed through the neural network to form several sets of feature vectors. After that, these feature vectors are reused in different combinations to form multiple outputs, and the outputs are aggregated using ensemble-learning soft voting to perform relation classification. In addition to this, the sum of cross-entropy, KL divergence, and negative log-likelihood loss is used as the final loss function in this paper. In the comparison experiments, the model based on adding prompt information and feature reuse achieved higher results of the SemEval-2010 task 8 relational dataset.

查看原文本刊更多论文

基于提示信息和特征重用的关系提取

摘要为了解决句子级关系提取中特征利用不足，导致预先训练的语言模型性能不足和特征向量利用不足的问题，提出了一种基于添加提示信息和特征重用的句子级关系抽取方法。首先，除了名词对和句子信息外，还添加了一条提示信息，整体特征信息由句子信息、实体对信息和提示信息组成，然后通过预先训练的语言模型ROBERTA对特征进行编码。此外，在预训练的语言模型中，在神经网络的组成中也引入了BIGRU来提取信息，并将特征信息通过神经网络形成多组特征向量。之后，将这些特征向量以不同的组合重复使用，以形成多个输出，并使用集成学习软投票对输出进行聚合，以执行关系分类。除此之外，本文还使用交叉熵、KL散度和负对数似然损失之和作为最终损失函数。在对比实验中，基于添加提示信息和特征重用的模型在SemEval-2010任务8关系数据集中获得了更高的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊