Haoze Yu , Qunhe Ji , Yan Li , Huanpu Yin , Haisheng Li , Xiaohui Li , Junping Du
{"title":"MDM-NER: A multiple dependency modeling driven named entity recognition approach for judicial documents","authors":"Haoze Yu , Qunhe Ji , Yan Li , Huanpu Yin , Haisheng Li , Xiaohui Li , Junping Du","doi":"10.1016/j.inffus.2025.103807","DOIUrl":null,"url":null,"abstract":"<div><div>The characteristic of nested entities spanning a wide range in judicial documents poses significant challenges for entity recognition tasks. This paper proposes multiple dependency modeling driven named entity recognition model (MDM-NER), which can capture the association relationships between characters and words through the encoder module integrating Multi-Head Attention (MHA) and Cross-Attention (CA), and realize multi-dimensional collaborative recognition and label sequence optimization of nested entities through the decoder module composed of a joint predictor and Conditional Random Field model (CRF). It has demonstrated better comprehensive performance compared to existing models in comparative experiments when applied to Chinese corpora, English corpora, and constructed Judicial Document Corpus (JudDC), proving its adaptability, robustness, and transferability. In addition, the effectiveness of the significant components integrated attention (MHA-CA) and CRF was verified through ablation experiments, and the influence of two hyperparameters, quantity of heads and dilation rate, on the performance of model was discussed. As the key preliminary step, the proposed MDM-NER and constructed JudDR can be applied to the construction of judicial document knowledge graph, and the local deployment & data expansion tasks of vertical LLMs for judicial authorities.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"127 ","pages":"Article 103807"},"PeriodicalIF":15.5000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525008693","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The characteristic of nested entities spanning a wide range in judicial documents poses significant challenges for entity recognition tasks. This paper proposes multiple dependency modeling driven named entity recognition model (MDM-NER), which can capture the association relationships between characters and words through the encoder module integrating Multi-Head Attention (MHA) and Cross-Attention (CA), and realize multi-dimensional collaborative recognition and label sequence optimization of nested entities through the decoder module composed of a joint predictor and Conditional Random Field model (CRF). It has demonstrated better comprehensive performance compared to existing models in comparative experiments when applied to Chinese corpora, English corpora, and constructed Judicial Document Corpus (JudDC), proving its adaptability, robustness, and transferability. In addition, the effectiveness of the significant components integrated attention (MHA-CA) and CRF was verified through ablation experiments, and the influence of two hyperparameters, quantity of heads and dilation rate, on the performance of model was discussed. As the key preliminary step, the proposed MDM-NER and constructed JudDR can be applied to the construction of judicial document knowledge graph, and the local deployment & data expansion tasks of vertical LLMs for judicial authorities.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.