{"title":"Position-aware end-to-end cross-document event coreference resolution for Dutch","authors":"Loic De Langhe, Orphée De Clercq, Veronique Hoste","doi":"10.1016/j.nlp.2025.100184","DOIUrl":null,"url":null,"abstract":"<div><div>Natural language understanding entails the ability to comprehend the relations between various people, objects or events throughout one, or multiple, text(s). Event coreference resolution (ECR) is a discourse-based natural language processing (NLP) task which aims to link those textual events, be they real or fictional, that refer to the same conceptual event. In this paper, we introduce a novel end-to-end approach for cross-document ECR which combines expert-level positional knowledge and graph-based representations in order to create a memory-efficient and accurate system meant for the detection and resolution of events in large document collections. We make three fundamental architectural changes to a current state-of-the-art cross-document ECR system and show that our approach outperforms this earlier model (+ 4% CONLL F1) on a large Dutch ECR dataset. Moreover, we show through in-depth qualitative and quantitative analysis that our proposed approach consistently detects more relevant events and suffers notably less from the typical issues models exhibit when predicting coreference chains.</div></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"13 ","pages":"Article 100184"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719125000603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Natural language understanding entails the ability to comprehend the relations between various people, objects or events throughout one, or multiple, text(s). Event coreference resolution (ECR) is a discourse-based natural language processing (NLP) task which aims to link those textual events, be they real or fictional, that refer to the same conceptual event. In this paper, we introduce a novel end-to-end approach for cross-document ECR which combines expert-level positional knowledge and graph-based representations in order to create a memory-efficient and accurate system meant for the detection and resolution of events in large document collections. We make three fundamental architectural changes to a current state-of-the-art cross-document ECR system and show that our approach outperforms this earlier model (+ 4% CONLL F1) on a large Dutch ECR dataset. Moreover, we show through in-depth qualitative and quantitative analysis that our proposed approach consistently detects more relevant events and suffers notably less from the typical issues models exhibit when predicting coreference chains.