Segmentation-aware relational graph convolutional network with multi-layer CRF for nested named entity recognition

IF 5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Complex & Intelligent Systems Pub Date : 2024-08-10 DOI:10.1007/s40747-024-01551-8

Daojun Han, Zemin Wang, Yunsong Li, Xiangbo ma, Juntao Zhang

{"title":"Segmentation-aware relational graph convolutional network with multi-layer CRF for nested named entity recognition","authors":"Daojun Han, Zemin Wang, Yunsong Li, Xiangbo ma, Juntao Zhang","doi":"10.1007/s40747-024-01551-8","DOIUrl":null,"url":null,"abstract":"Named Entity Recognition (NER) is fundamental in natural language processing, involving identifying entity spans and types within a sentence. Nested NER contains other entities, which pose a significant challenge, especially pronounced in the domain of medical-named entities due to intricate nesting patterns inherent in medical terminology. Existing studies can not capture interdependencies among different entity categories, resulting in inadequate performance in nested NER tasks. To address this problem, we propose a novel Layer-based architecture with Segmentation-aware Relational Graph Convolutional Network (LSRGCN) for Nested NER in the medical domain. LSRGCN comprises two key modules: a shared segmentation-aware encoder and a multi-layer conditional random field decoder. The former part provides token representation including boundary information from sentence segmentation. The latter part can learn the connections between different entity classes and improve recognition accuracy through secondary decoding. We conduct experiments on four datasets. Experimental results demonstrate the effectiveness of our model. Additionally, extensive studies are conducted to enhance our understanding of the model and its capabilities.","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"36 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-024-01551-8","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Named Entity Recognition (NER) is fundamental in natural language processing, involving identifying entity spans and types within a sentence. Nested NER contains other entities, which pose a significant challenge, especially pronounced in the domain of medical-named entities due to intricate nesting patterns inherent in medical terminology. Existing studies can not capture interdependencies among different entity categories, resulting in inadequate performance in nested NER tasks. To address this problem, we propose a novel Layer-based architecture with Segmentation-aware Relational Graph Convolutional Network (LSRGCN) for Nested NER in the medical domain. LSRGCN comprises two key modules: a shared segmentation-aware encoder and a multi-layer conditional random field decoder. The former part provides token representation including boundary information from sentence segmentation. The latter part can learn the connections between different entity classes and improve recognition accuracy through secondary decoding. We conduct experiments on four datasets. Experimental results demonstrate the effectiveness of our model. Additionally, extensive studies are conducted to enhance our understanding of the model and its capabilities.

Abstract Image

查看原文本刊更多论文

用于嵌套命名实体识别的分段感知关系图卷积网络与多层 CRF

命名实体识别（NER）是自然语言处理的基础，涉及识别句子中的实体跨度和类型。嵌套 NER 包含其他实体，这构成了巨大的挑战，尤其是在医学命名实体领域，由于医学术语固有的复杂嵌套模式，这种挑战尤为明显。现有研究无法捕捉不同实体类别之间的相互依赖关系，导致嵌套 NER 任务的性能不足。为解决这一问题，我们提出了一种基于层的新型架构，该架构具有分段感知关系图卷积网络（LSRGCN），适用于医学领域的嵌套式 NER。LSRGCN 包括两个关键模块：共享分割感知编码器和多层条件随机场解码器。前者提供标记表示，包括来自句子分割的边界信息。后一部分可以学习不同实体类别之间的联系，并通过二次解码提高识别准确率。我们在四个数据集上进行了实验。实验结果证明了我们模型的有效性。此外，我们还进行了大量研究，以加深对模型及其功能的理解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

9.60

自引率

10.30%

发文量

297

期刊介绍： Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.