Sliding window based rare partial periodic pattern mining algorithms over temporal data streams.

IF 2.4 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Frontiers in Big Data Pub Date : 2025-06-04 eCollection Date: 2025-01-01 DOI:10.3389/fdata.2025.1600267
K Jyothi Upadhya, Ronan Lobo, Mini Shail Chhabra, Aman Paleja, B Dinesh Rao, Geetha M, Prachi Sisodia, Bolusani Akshita Reddy
{"title":"Sliding window based rare partial periodic pattern mining algorithms over temporal data streams.","authors":"K Jyothi Upadhya, Ronan Lobo, Mini Shail Chhabra, Aman Paleja, B Dinesh Rao, Geetha M, Prachi Sisodia, Bolusani Akshita Reddy","doi":"10.3389/fdata.2025.1600267","DOIUrl":null,"url":null,"abstract":"<p><p>Periodic pattern mining, a branch of data mining, is expanding to provide insight into the occurrence behavior of large volumes of data. Recently, a variety of industries, including fraud detection, telecommunications, retail marketing, research, and medical have found applications for rare association rule mining, which uncovers unusual or unexpected combinations. A limited amount of literature demonstrated how periodicity is essential in mining low-support rare patterns. In addition, attention must be placed on temporal datasets that analyze crucial information about the timing of pattern occurrences and stream datasets to manage high-speed streaming data. Several algorithms have been developed that effectively track the cyclic behavior of patterns and identify the patterns that display complete or partial periodic behavior in temporal datasets. Numerous frameworks have been created to examine the periodic behavior of streaming data. Nevertheless, such a method that focuses on the temporal information in the data stream and extracts rare partial periodic patterns has yet to be proposed. With a focus on identifying rare partial periodic patterns from temporal data streams, this paper proposes two novel sliding window-based single scan approaches called <i>R3PStreamSW-Growth</i> and <i>R3PStreamSW-BitVectorMiner</i>. The findings showed that when a dense dataset <i>Accidents</i> is considered, for different threshold variations <i>R3P-StreamSWBitVectorMiner</i> outperformed <i>R3PStreamSW-Growth</i> by about 93%. Similarly, when the sparse dataset <i>T10I4D100K</i> is taken into account, <i>R3P-StreamSWBitVectorMiner</i> exhibits a 90% boost in performance. This demonstrates that on a range of synthetic, real-world, sparse, and dense datasets for different thresholds, <i>R3P-StreamSWBitVectorMiner</i> is significantly faster than <i>R3PStreamSW-Growth</i>.</p>","PeriodicalId":52859,"journal":{"name":"Frontiers in Big Data","volume":"8 ","pages":"1600267"},"PeriodicalIF":2.4000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12175007/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdata.2025.1600267","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Periodic pattern mining, a branch of data mining, is expanding to provide insight into the occurrence behavior of large volumes of data. Recently, a variety of industries, including fraud detection, telecommunications, retail marketing, research, and medical have found applications for rare association rule mining, which uncovers unusual or unexpected combinations. A limited amount of literature demonstrated how periodicity is essential in mining low-support rare patterns. In addition, attention must be placed on temporal datasets that analyze crucial information about the timing of pattern occurrences and stream datasets to manage high-speed streaming data. Several algorithms have been developed that effectively track the cyclic behavior of patterns and identify the patterns that display complete or partial periodic behavior in temporal datasets. Numerous frameworks have been created to examine the periodic behavior of streaming data. Nevertheless, such a method that focuses on the temporal information in the data stream and extracts rare partial periodic patterns has yet to be proposed. With a focus on identifying rare partial periodic patterns from temporal data streams, this paper proposes two novel sliding window-based single scan approaches called R3PStreamSW-Growth and R3PStreamSW-BitVectorMiner. The findings showed that when a dense dataset Accidents is considered, for different threshold variations R3P-StreamSWBitVectorMiner outperformed R3PStreamSW-Growth by about 93%. Similarly, when the sparse dataset T10I4D100K is taken into account, R3P-StreamSWBitVectorMiner exhibits a 90% boost in performance. This demonstrates that on a range of synthetic, real-world, sparse, and dense datasets for different thresholds, R3P-StreamSWBitVectorMiner is significantly faster than R3PStreamSW-Growth.

Abstract Image

Abstract Image

Abstract Image

基于滑动窗口的时间数据流稀有部分周期模式挖掘算法。
周期性模式挖掘是数据挖掘的一个分支,它正在扩展到提供对大量数据的发生行为的洞察。最近,包括欺诈检测、电信、零售营销、研究和医疗在内的各种行业都发现了罕见关联规则挖掘的应用程序,这些应用程序可以发现不寻常或意外的组合。有限数量的文献证明了周期性在挖掘低支持稀有模式中是多么重要。此外,必须注意分析模式发生时间的关键信息的时间数据集和流数据集,以管理高速流数据。已经开发了几种算法,可以有效地跟踪模式的循环行为,并识别在时间数据集中显示完整或部分周期行为的模式。已经创建了许多框架来检查流数据的周期性行为。然而,这种关注数据流中的时间信息并提取罕见的部分周期模式的方法尚未被提出。为了从时间数据流中识别罕见的部分周期模式,本文提出了两种新的基于滑动窗口的单扫描方法,称为R3PStreamSW-Growth和R3PStreamSW-BitVectorMiner。研究结果表明,当考虑密集数据集事故时,对于不同的阈值变化,R3P-StreamSWBitVectorMiner的性能优于R3PStreamSW-Growth约93%。同样,当考虑到稀疏数据集T10I4D100K时,r3d - streamswbitvectorminer的性能提升了90%。这表明,在不同阈值的合成、真实、稀疏和密集数据集上,R3P-StreamSWBitVectorMiner明显快于R3PStreamSW-Growth。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.20
自引率
3.20%
发文量
122
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信