{"title":"Document level Relationship Extraction based on context feature enhancement","authors":"Nan Zhang, Ziming Cui, Qiang Cai","doi":"10.1016/j.patrec.2025.07.006","DOIUrl":null,"url":null,"abstract":"<div><div>Document level Relationship Extraction (DocRE) tasks aim to extract relationships between multiple entities from long texts. However, obtaining feature representations for entity pairs that span multiple sentences is a challenge. Additionally, the feature information for triplets depends on both intra-document and inter-sentence information. To address this issue, this paper proposes a model named Plus-DocRE for DocRE(PDRE). Firstly, we introduce entity segmentation based on spans to increase the potential number of entities and improve negative sample recognition. Secondly, we utilize the BERT pre-trained model to obtain paragraph and local context information, enriching the features of entity pairs. Finally, through linear layers and self-attention mechanisms, we fuse the features of local and paragraph context for multi-label relationship classification, enabling entity relationship extraction. Meanwhile, we introduce a new data mechanism, C-DocRE, to simulate a more realistic scenario with annotation errors. Experimental results show that the PDRE model outperforms other baseline models in performance, achieving an F1 score of 53.6.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"197 ","pages":"Pages 24-30"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525002582","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Document level Relationship Extraction (DocRE) tasks aim to extract relationships between multiple entities from long texts. However, obtaining feature representations for entity pairs that span multiple sentences is a challenge. Additionally, the feature information for triplets depends on both intra-document and inter-sentence information. To address this issue, this paper proposes a model named Plus-DocRE for DocRE(PDRE). Firstly, we introduce entity segmentation based on spans to increase the potential number of entities and improve negative sample recognition. Secondly, we utilize the BERT pre-trained model to obtain paragraph and local context information, enriching the features of entity pairs. Finally, through linear layers and self-attention mechanisms, we fuse the features of local and paragraph context for multi-label relationship classification, enabling entity relationship extraction. Meanwhile, we introduce a new data mechanism, C-DocRE, to simulate a more realistic scenario with annotation errors. Experimental results show that the PDRE model outperforms other baseline models in performance, achieving an F1 score of 53.6.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.