NLP4ITF @因果新闻语料库2022:利用语言信息进行事件因果分类

The Case manager Pub Date : 2022-01-01 DOI:10.18653/v1/2022.case-1.3

Theresa Krumbiegel, Sophie Decher

{"title":"NLP4ITF @因果新闻语料库2022:利用语言信息进行事件因果分类","authors":"Theresa Krumbiegel, Sophie Decher","doi":"10.18653/v1/2022.case-1.3","DOIUrl":null,"url":null,"abstract":"We present our submission to Subtask 1 of theCASE-2022 Shared Task 3: Event CausalityIdentification with Causal News Corpus as partof the 5th Workshop on Challenges and Applicationsof Automated Extraction of SociopoliticalEvents from Text (CASE 2022) (Tanet al., 2022a). The task focuses on causal eventclassification on the sentence level and involvesdifferentiating between sentences that include acause-effect relation and sentences that do not.We approached this as a binary text classificationtask and experimented with multiple trainingsets augmented with additional linguisticinformation. Our best model was generated bytraining roberta-base on a combination ofdata from both Subtasks 1 and 2 with the additionof named entity annotations. During thedevelopment phase we achieved a macro F1 of0.8641 with this model on the development setprovided by the task organizers. When testingthe model on the final test data, we achieved amacro F1 of 0.8516.","PeriodicalId":80307,"journal":{"name":"The Case manager","volume":"8 1","pages":"16-20"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"NLP4ITF @ Causal News Corpus 2022: Leveraging Linguistic Information for Event Causality Classification\",\"authors\":\"Theresa Krumbiegel, Sophie Decher\",\"doi\":\"10.18653/v1/2022.case-1.3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present our submission to Subtask 1 of theCASE-2022 Shared Task 3: Event CausalityIdentification with Causal News Corpus as partof the 5th Workshop on Challenges and Applicationsof Automated Extraction of SociopoliticalEvents from Text (CASE 2022) (Tanet al., 2022a). The task focuses on causal eventclassification on the sentence level and involvesdifferentiating between sentences that include acause-effect relation and sentences that do not.We approached this as a binary text classificationtask and experimented with multiple trainingsets augmented with additional linguisticinformation. Our best model was generated bytraining roberta-base on a combination ofdata from both Subtasks 1 and 2 with the additionof named entity annotations. During thedevelopment phase we achieved a macro F1 of0.8641 with this model on the development setprovided by the task organizers. When testingthe model on the final test data, we achieved amacro F1 of 0.8516.\",\"PeriodicalId\":80307,\"journal\":{\"name\":\"The Case manager\",\"volume\":\"8 1\",\"pages\":\"16-20\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Case manager\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18653/v1/2022.case-1.3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Case manager","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.case-1.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

作为第五届“从文本中自动提取社会政治事件的挑战和应用”研讨会(CASE 2022) (Tanet al.， 2022a)的一部分，我们提交了caase -2022共享任务3的子任务1:使用因果新闻语料库识别事件因果关系。该任务侧重于句子层面的因果事件分类，并涉及区分包含因果关系的句子和不包含因果关系的句子。我们将其作为一个二元文本分类任务来处理，并使用多个训练集进行了实验，这些训练集增加了额外的语言信息。我们最好的模型是通过训练roberta-base生成的，该模型是基于子任务1和子任务2的数据组合，并添加了命名实体注释。在开发阶段，我们在任务组织者提供的开发集上使用该模型获得了0.8641的宏F1。在最终测试数据上对模型进行测试时，我们获得了0.8516的宏F1。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

NLP4ITF @ Causal News Corpus 2022: Leveraging Linguistic Information for Event Causality Classification

We present our submission to Subtask 1 of theCASE-2022 Shared Task 3: Event CausalityIdentification with Causal News Corpus as partof the 5th Workshop on Challenges and Applicationsof Automated Extraction of SociopoliticalEvents from Text (CASE 2022) (Tanet al., 2022a). The task focuses on causal eventclassification on the sentence level and involvesdifferentiating between sentences that include acause-effect relation and sentences that do not.We approached this as a binary text classificationtask and experimented with multiple trainingsets augmented with additional linguisticinformation. Our best model was generated bytraining roberta-base on a combination ofdata from both Subtasks 1 and 2 with the additionof named entity annotations. During thedevelopment phase we achieved a macro F1 of0.8641 with this model on the development setprovided by the task organizers. When testingthe model on the final test data, we achieved amacro F1 of 0.8516.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Case manager

自引率

0.00%

发文量