Speaker identification combining role knowledge graph correction and contextual block attention

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-04-29 DOI:10.1016/j.eswa.2025.127758

Ye Tao , Fei Wang , Yanchang Cai , Wei Li

{"title":"Speaker identification combining role knowledge graph correction and contextual block attention","authors":"Ye Tao , Fei Wang , Yanchang Cai , Wei Li","doi":"10.1016/j.eswa.2025.127758","DOIUrl":null,"url":null,"abstract":"<div><div>Character dialogue play a crucial role in novels, serving as a key element for understanding both the plot and character relationships. With advancements in artificial intelligence and natural language processing, dialogue speaker recognition has seen significant progress. In this paper, we integrate a character role knowledge graph with a self-training mechanism to perform the speaker recognition on Chinese novel quotations using large-scale novel corpora. Quotes in novels can generally be classified as explicit or implicit. Implicit quotes require identifying speakers from extensive contextual information, which poses challenges for existing models in handling long and detail-rich contexts. In addition, existing end-to-end speaker recognition methods are ineffective because they do not fully consider the relationship between context and quotes. In this paper, we first propose a Narrative Unit-based Context Selection (NUCS) algorithm for determining the context to which quotes belong. Secondly, a speaker recognition method based on Role Knowledge Graph Correction (RKGC) and Contextual Block Attention (CBA) is proposed. The proposed role knowledge graph correction algorithm improves speaker attribution by deeply analyzing role entities, their relationships, and relevant trigger words within quotations. It then refines candidate speaker probabilities obtained from the previous module. Additionally, the algorithm captures character relationships within the context using role mappings from existing novels. The CBA method introduces a block attention mechanism to capture character relationships within the context. It effectively computes the probability of each character being the speaker and determines the start and end indices of the most probable speaker segments. This allows the model to focus more precisely on speaker-related content, leading to more accurate speaker predictions. Experimental results demonstrate that our approach achieves competitive performance: on our self-constructed CNSI dataset, it attains 91.9% EM and 93.3% F1 scores. Although slightly lower than the SOTA SPC method (EM 92.3%, F1 93.6%), our approach demonstrates significant advantages in contextual reasoning capabilities, particularly for complex multi-character dialogue scenarios.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"284 ","pages":"Article 127758"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013806","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Character dialogue play a crucial role in novels, serving as a key element for understanding both the plot and character relationships. With advancements in artificial intelligence and natural language processing, dialogue speaker recognition has seen significant progress. In this paper, we integrate a character role knowledge graph with a self-training mechanism to perform the speaker recognition on Chinese novel quotations using large-scale novel corpora. Quotes in novels can generally be classified as explicit or implicit. Implicit quotes require identifying speakers from extensive contextual information, which poses challenges for existing models in handling long and detail-rich contexts. In addition, existing end-to-end speaker recognition methods are ineffective because they do not fully consider the relationship between context and quotes. In this paper, we first propose a Narrative Unit-based Context Selection (NUCS) algorithm for determining the context to which quotes belong. Secondly, a speaker recognition method based on Role Knowledge Graph Correction (RKGC) and Contextual Block Attention (CBA) is proposed. The proposed role knowledge graph correction algorithm improves speaker attribution by deeply analyzing role entities, their relationships, and relevant trigger words within quotations. It then refines candidate speaker probabilities obtained from the previous module. Additionally, the algorithm captures character relationships within the context using role mappings from existing novels. The CBA method introduces a block attention mechanism to capture character relationships within the context. It effectively computes the probability of each character being the speaker and determines the start and end indices of the most probable speaker segments. This allows the model to focus more precisely on speaker-related content, leading to more accurate speaker predictions. Experimental results demonstrate that our approach achieves competitive performance: on our self-constructed CNSI dataset, it attains 91.9% EM and 93.3% F1 scores. Although slightly lower than the SOTA SPC method (EM 92.3%, F1 93.6%), our approach demonstrates significant advantages in contextual reasoning capabilities, particularly for complex multi-character dialogue scenarios.

查看原文本刊更多论文

结合角色知识图校正和上下文块注意的说话人识别

人物对话在小说中扮演着至关重要的角色，是理解情节和人物关系的关键元素。随着人工智能和自然语言处理的进步，对话说话人识别已经取得了重大进展。在本文中，我们将角色知识图谱与自我训练机制相结合，利用大规模的小说语料库对汉语小说引文进行说话人识别。小说中的引语一般可分为显性引语和隐性引语。隐含引用需要从大量的上下文信息中识别说话者，这对现有模型在处理长且细节丰富的上下文时提出了挑战。此外，现有的端到端说话人识别方法由于没有充分考虑上下文和引语之间的关系而效果不佳。在本文中，我们首先提出了一种基于叙事单元的上下文选择（NUCS）算法，用于确定引用所属的上下文。其次，提出了一种基于角色知识图校正和上下文块注意的说话人识别方法。本文提出的角色知识图谱校正算法通过深入分析角色实体、角色实体之间的关系以及引文中的相关触发词，提高说话人归因能力。然后对从前一模块得到的候选说话人概率进行细化。此外，该算法使用现有小说中的角色映射来捕获上下文中的角色关系。CBA方法引入了块注意机制来捕获上下文中的字符关系。它有效地计算每个字符成为说话人的概率，并确定最可能的说话人片段的开始和结束索引。这使得模型可以更精确地关注与说话者相关的内容，从而更准确地预测说话者。实验结果表明，我们的方法取得了具有竞争力的性能：在我们自己构建的CNSI数据集上，它获得了91.9%的EM和93.3%的F1分数。虽然略低于SOTA SPC方法（EM 92.3%, F1 93.6%），但我们的方法在上下文推理能力方面具有显著优势，特别是对于复杂的多字符对话场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.