Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives

International Workshop on Health Text Mining and Information Analysis Pub Date : 1900-01-01 DOI:10.18653/v1/2022.louhi-1.2

Matías Rojas, C. Carrino, Aitor Gonzalez-Agirre, J. Dunstan, Marta Villegas

{"title":"Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives","authors":"Matías Rojas, C. Carrino, Aitor Gonzalez-Agirre, J. Dunstan, Marta Villegas","doi":"10.18653/v1/2022.louhi-1.2","DOIUrl":null,"url":null,"abstract":"Nested Named Entity Recognition (NER) is an information extraction task that aims to identify entities that may be nested within other entity mentions. Despite the availability of several corpora with nested entities in the Spanish clinical domain, most previous work has overlooked them due to the lack of models and a clear annotation scheme for dealing with the task. To fill this gap, this paper provides an empirical study of straightforward methods for tackling the nested NER task on two Spanish clinical datasets, Clinical Trials, and the Chilean Waiting List. We assess the advantages and limitations of two sequence labeling approaches; one based on Multiple LSTM-CRF architectures and another on Joint labeling models. To better understand the differences between these models, we compute task-specific metrics that adequately measure the ability of models to detect nested entities and perform a fine-grained comparison across models. Our experimental results show that employing domain-specific language models trained from scratch significantly improves the performance obtained with strong domain-specific and general-domain baselines, achieving state-of-the-art results in both datasets. Specifically, we obtained F1 scores of 89.21 and 83.16 in Clinical Trials and the Chilean Waiting List, respectively. Interestingly enough, we observe that the task-specific metrics and analysis properly reflect the limitations of the models when recognizing nested entities. Finally, we perform a case study on an aggregated NER dataset created from several clinical corpora in Spanish. We highlight how entity length and the simultaneous recognition of inner and outer entities are the most critical variables for the nested NER task.","PeriodicalId":448872,"journal":{"name":"International Workshop on Health Text Mining and Information Analysis","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Health Text Mining and Information Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.louhi-1.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Nested Named Entity Recognition (NER) is an information extraction task that aims to identify entities that may be nested within other entity mentions. Despite the availability of several corpora with nested entities in the Spanish clinical domain, most previous work has overlooked them due to the lack of models and a clear annotation scheme for dealing with the task. To fill this gap, this paper provides an empirical study of straightforward methods for tackling the nested NER task on two Spanish clinical datasets, Clinical Trials, and the Chilean Waiting List. We assess the advantages and limitations of two sequence labeling approaches; one based on Multiple LSTM-CRF architectures and another on Joint labeling models. To better understand the differences between these models, we compute task-specific metrics that adequately measure the ability of models to detect nested entities and perform a fine-grained comparison across models. Our experimental results show that employing domain-specific language models trained from scratch significantly improves the performance obtained with strong domain-specific and general-domain baselines, achieving state-of-the-art results in both datasets. Specifically, we obtained F1 scores of 89.21 and 83.16 in Clinical Trials and the Chilean Waiting List, respectively. Interestingly enough, we observe that the task-specific metrics and analysis properly reflect the limitations of the models when recognizing nested entities. Finally, we perform a case study on an aggregated NER dataset created from several clinical corpora in Spanish. We highlight how entity length and the simultaneous recognition of inner and outer entities are the most critical variables for the nested NER task.

查看原文本刊更多论文

评估西班牙临床叙述中嵌套命名实体识别的直接模型的局限性

嵌套命名实体识别(NER)是一种信息提取任务，旨在识别可能嵌套在其他实体提及中的实体。尽管在西班牙语临床领域有几个具有嵌套实体的语料库，但由于缺乏模型和用于处理任务的清晰注释方案，大多数先前的工作都忽略了它们。为了填补这一空白，本文提供了一项直接方法的实证研究，用于解决两个西班牙临床数据集，临床试验和智利等待名单上嵌套的NER任务。我们评估了两种序列标记方法的优点和局限性;一个基于多个LSTM-CRF架构，另一个基于联合标记模型。为了更好地理解这些模型之间的差异，我们计算了特定于任务的度量，这些度量充分度量了模型检测嵌套实体和跨模型执行细粒度比较的能力。我们的实验结果表明，使用从头开始训练的领域特定语言模型显著提高了在强领域特定基线和通用领域基线下获得的性能，在两个数据集上都获得了最先进的结果。具体而言，我们在临床试验和智利等待名单中分别获得了89.21和83.16的F1分数。有趣的是，我们观察到特定于任务的度量和分析在识别嵌套实体时正确地反映了模型的局限性。最后，我们对由几个西班牙临床语料库创建的聚合NER数据集进行了案例研究。我们强调实体长度和内部和外部实体的同时识别是嵌套NER任务的最关键变量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Workshop on Health Text Mining and Information Analysis

自引率

0.00%

发文量