GraphDrift-net: a dynamic graph-based framework for concept drift detection in short unstructured text streams

IF 2.9 3区 物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY
Megha Ashok Patil, Sunil Kumar, Sandeep Kumar
{"title":"GraphDrift-net: a dynamic graph-based framework for concept drift detection in short unstructured text streams","authors":"Megha Ashok Patil,&nbsp;Sunil Kumar,&nbsp;Sandeep Kumar","doi":"10.1140/epjp/s13360-025-06812-0","DOIUrl":null,"url":null,"abstract":"<div><p>Detecting concept drift in text streams is challenging due to the rapid evolution of language, shifting user behavior, and temporal dependencies. Issues like data sparsity, high dimensionality, lack of labeled data, and multimodal drift further complicate real-time detection and adaptation. This paper proposes GraphDrift-net, a novel dynamic graph-based framework for detecting and adapting to concept drift in evolving text streams. The model comprise of the following components: evolving Time BERT (EvoTimeBERT), which captures temporal language evolution via historical token memory and multi-scale temporal convolutions, hierarchical temporal graph network with dynamic topics and adaptive memory (HTGN-DTAM), a heterogeneous graph neural network that dynamically constructs topic-aware graphs to track changing semantics and Chronograph Detection, a time-series-based drift detection method leveraging graph statistics such as node centrality and clustering coefficient changes. In addition, graph neural reinforcement learning framework (GNRL), a reinforcement learning-based adaptive learning module, enables model adaptability by word embedding update, memory decay rate tuning, and few-shot adaptation. Experimental evaluations over various real-world datasets, including Twitter-1, Twitter-2, Enron, and News20, demonstrate that GraphDrift-net outperforms other methods in accuracy, F1-score, and drift detection sensitivity. The model achieves accuracy as high as 99.7%, is able to identify more drift points, and is more stable with computational efficiency, making it extremely appropriate for real-time text stream applications.</p></div>","PeriodicalId":792,"journal":{"name":"The European Physical Journal Plus","volume":"140 9","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal Plus","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjp/s13360-025-06812-0","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting concept drift in text streams is challenging due to the rapid evolution of language, shifting user behavior, and temporal dependencies. Issues like data sparsity, high dimensionality, lack of labeled data, and multimodal drift further complicate real-time detection and adaptation. This paper proposes GraphDrift-net, a novel dynamic graph-based framework for detecting and adapting to concept drift in evolving text streams. The model comprise of the following components: evolving Time BERT (EvoTimeBERT), which captures temporal language evolution via historical token memory and multi-scale temporal convolutions, hierarchical temporal graph network with dynamic topics and adaptive memory (HTGN-DTAM), a heterogeneous graph neural network that dynamically constructs topic-aware graphs to track changing semantics and Chronograph Detection, a time-series-based drift detection method leveraging graph statistics such as node centrality and clustering coefficient changes. In addition, graph neural reinforcement learning framework (GNRL), a reinforcement learning-based adaptive learning module, enables model adaptability by word embedding update, memory decay rate tuning, and few-shot adaptation. Experimental evaluations over various real-world datasets, including Twitter-1, Twitter-2, Enron, and News20, demonstrate that GraphDrift-net outperforms other methods in accuracy, F1-score, and drift detection sensitivity. The model achieves accuracy as high as 99.7%, is able to identify more drift points, and is more stable with computational efficiency, making it extremely appropriate for real-time text stream applications.

GraphDrift-net:一个动态的基于图形的框架,用于在短的非结构化文本流中进行概念漂移检测
由于语言的快速演变、用户行为的变化和时间依赖性,检测文本流中的概念漂移具有挑战性。数据稀疏性、高维性、缺乏标记数据和多模态漂移等问题进一步使实时检测和适应复杂化。本文提出了一种新的基于动态图的框架GraphDrift-net,用于检测和适应不断发展的文本流中的概念漂移。该模型由以下部分组成:进化时间BERT (EvoTimeBERT),它通过历史标记记忆和多尺度时间卷积捕捉时态语言的演变;具有动态主题和自适应记忆的分层时态图网络(HTGN-DTAM),一种异构图神经网络,它动态构建主题感知图以跟踪变化的语义和计时检测;一种基于时间序列的漂移检测方法,利用图形统计数据,如节点中心性和聚类系数的变化。此外,图神经强化学习框架(GNRL)是一种基于强化学习的自适应学习模块,通过词嵌入更新、记忆衰减率调整和少镜头自适应实现模型自适应性。对各种真实世界数据集(包括Twitter-1、Twitter-2、Enron和News20)的实验评估表明,GraphDrift-net在准确性、f1分数和漂移检测灵敏度方面优于其他方法。该模型准确率高达99.7%,能够识别更多漂移点,计算效率更稳定,非常适合实时文本流应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The European Physical Journal Plus
The European Physical Journal Plus PHYSICS, MULTIDISCIPLINARY-
CiteScore
5.40
自引率
8.80%
发文量
1150
审稿时长
4-8 weeks
期刊介绍: The aims of this peer-reviewed online journal are to distribute and archive all relevant material required to document, assess, validate and reconstruct in detail the body of knowledge in the physical and related sciences. The scope of EPJ Plus encompasses a broad landscape of fields and disciplines in the physical and related sciences - such as covered by the topical EPJ journals and with the explicit addition of geophysics, astrophysics, general relativity and cosmology, mathematical and quantum physics, classical and fluid mechanics, accelerator and medical physics, as well as physics techniques applied to any other topics, including energy, environment and cultural heritage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信