{"title":"局部度量 NER:从多标签角度看命名实体识别的新范式","authors":"Zaifeng Hua, Yifei Chen","doi":"10.1016/j.knosys.2024.112686","DOIUrl":null,"url":null,"abstract":"<div><div>As the field of Nested Named Entity Recognition (NNER) advances, it is marked by a growing complexity due to the increasing number of multi-label entity instances. How to more effectively identify multi-label entities and explore the correlation between labels is the focus of our work. Unlike previous models that are modeled in single-label multi-classification problems, we propose a novel multi-label local metric NER model to rethink Nested Entity Recognition from a multi-label perspective. Simultaneously, to address the significant sample imbalance problem commonly encountered in multi-label scenarios, we introduce a parts-of-speech-based strategy that significantly improves the model’s performance on imbalanced datasets. Experiments on nested, multi-label, and flat datasets verify the generalization and superiority of our model, with results surpassing the existing state-of-the-art (SOTA) on several multi-label and flat benchmarks. After a series of experimental analyses, we highlight the persistent challenges in the multi-label NER. We are hopeful that the insights derived from our work will not only provide new perspectives on the nested NER landscape but also contribute to the ongoing momentum necessary for advancing research in the field of multi-label NER.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"305 ","pages":"Article 112686"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Local Metric NER: A new paradigm for named entity recognition from a multi-label perspective\",\"authors\":\"Zaifeng Hua, Yifei Chen\",\"doi\":\"10.1016/j.knosys.2024.112686\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As the field of Nested Named Entity Recognition (NNER) advances, it is marked by a growing complexity due to the increasing number of multi-label entity instances. How to more effectively identify multi-label entities and explore the correlation between labels is the focus of our work. Unlike previous models that are modeled in single-label multi-classification problems, we propose a novel multi-label local metric NER model to rethink Nested Entity Recognition from a multi-label perspective. Simultaneously, to address the significant sample imbalance problem commonly encountered in multi-label scenarios, we introduce a parts-of-speech-based strategy that significantly improves the model’s performance on imbalanced datasets. Experiments on nested, multi-label, and flat datasets verify the generalization and superiority of our model, with results surpassing the existing state-of-the-art (SOTA) on several multi-label and flat benchmarks. After a series of experimental analyses, we highlight the persistent challenges in the multi-label NER. We are hopeful that the insights derived from our work will not only provide new perspectives on the nested NER landscape but also contribute to the ongoing momentum necessary for advancing research in the field of multi-label NER.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"305 \",\"pages\":\"Article 112686\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705124013200\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124013200","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
随着嵌套命名实体识别(NNER)领域的发展,由于多标签实体实例的数量不断增加,其复杂性也随之增加。如何更有效地识别多标签实体并探索标签之间的相关性是我们工作的重点。与以往以单标签多分类问题为模型的模型不同,我们提出了一种新颖的多标签局部度量 NER 模型,从多标签的角度重新思考嵌套实体识别。同时,为了解决多标签场景中常见的严重样本不平衡问题,我们引入了基于语音部分的策略,显著提高了模型在不平衡数据集上的性能。在嵌套、多标签和平面数据集上的实验验证了我们模型的通用性和优越性,在多个多标签和平面基准上的结果超过了现有的最先进模型(SOTA)。在一系列实验分析之后,我们强调了多标签 NER 中持续存在的挑战。我们希望,从我们的工作中得出的见解不仅能为嵌套 NER 领域提供新的视角,还能为推动多标签 NER 领域的研究提供必要的持续动力。
Local Metric NER: A new paradigm for named entity recognition from a multi-label perspective
As the field of Nested Named Entity Recognition (NNER) advances, it is marked by a growing complexity due to the increasing number of multi-label entity instances. How to more effectively identify multi-label entities and explore the correlation between labels is the focus of our work. Unlike previous models that are modeled in single-label multi-classification problems, we propose a novel multi-label local metric NER model to rethink Nested Entity Recognition from a multi-label perspective. Simultaneously, to address the significant sample imbalance problem commonly encountered in multi-label scenarios, we introduce a parts-of-speech-based strategy that significantly improves the model’s performance on imbalanced datasets. Experiments on nested, multi-label, and flat datasets verify the generalization and superiority of our model, with results surpassing the existing state-of-the-art (SOTA) on several multi-label and flat benchmarks. After a series of experimental analyses, we highlight the persistent challenges in the multi-label NER. We are hopeful that the insights derived from our work will not only provide new perspectives on the nested NER landscape but also contribute to the ongoing momentum necessary for advancing research in the field of multi-label NER.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.