OPF-Miner: Order-Preserving Pattern Mining With Forgetting Mechanism for Time Series

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2024-08-05 DOI:10.1109/TKDE.2024.3438274

Yan Li;Chenyu Ma;Rong Gao;Youxi Wu;Jinyan Li;Wenjian Wang;Xindong Wu

{"title":"OPF-Miner: Order-Preserving Pattern Mining With Forgetting Mechanism for Time Series","authors":"Yan Li;Chenyu Ma;Rong Gao;Youxi Wu;Jinyan Li;Wenjian Wang;Xindong Wu","doi":"10.1109/TKDE.2024.3438274","DOIUrl":null,"url":null,"abstract":"Order-preserving pattern (OPP) mining is a type of sequential pattern mining method in which a group of ranks of time series is used to represent an OPP. This approach can discover frequent trends in time series. Existing OPP mining algorithms consider data points at different time to be equally important; however, newer data usually have a more significant impact, while older data have a weaker impact. We therefore introduce the forgetting mechanism into OPP mining to reduce the importance of older data. This paper explores the mining of OPPs with forgetting mechanism (OPF) and proposes an algorithm called OPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks, candidate pattern generation and support calculation. In candidate pattern generation, OPF-Miner employs a maximal support priority strategy and a group pattern fusion strategy to avoid redundant pattern fusions. For support calculation, we propose an algorithm called support calculation with forgetting mechanism, which uses prefix and suffix pattern pruning strategies to avoid redundant support calculations. The experiments are conducted on nine datasets and 12 alternative algorithms. The results verify that OPF-Miner is superior to other competitive algorithms. More importantly, OPF-Miner yields good clustering performance for time series, since the forgetting mechanism is employed.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"36 12","pages":"8981-8995"},"PeriodicalIF":8.9000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10623306/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Order-preserving pattern (OPP) mining is a type of sequential pattern mining method in which a group of ranks of time series is used to represent an OPP. This approach can discover frequent trends in time series. Existing OPP mining algorithms consider data points at different time to be equally important; however, newer data usually have a more significant impact, while older data have a weaker impact. We therefore introduce the forgetting mechanism into OPP mining to reduce the importance of older data. This paper explores the mining of OPPs with forgetting mechanism (OPF) and proposes an algorithm called OPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks, candidate pattern generation and support calculation. In candidate pattern generation, OPF-Miner employs a maximal support priority strategy and a group pattern fusion strategy to avoid redundant pattern fusions. For support calculation, we propose an algorithm called support calculation with forgetting mechanism, which uses prefix and suffix pattern pruning strategies to avoid redundant support calculations. The experiments are conducted on nine datasets and 12 alternative algorithms. The results verify that OPF-Miner is superior to other competitive algorithms. More importantly, OPF-Miner yields good clustering performance for time series, since the forgetting mechanism is employed.

查看原文本刊更多论文

OPF-Miner：具有遗忘机制的时间序列保序模式挖掘

保序模式（OPP）挖掘是一种序列模式挖掘方法，其中使用一组时间序列的等级来表示 OPP。这种方法可以发现时间序列中的频繁趋势。现有的 OPP 挖掘算法认为不同时间的数据点同等重要；但是，较新的数据通常影响更大，而较老的数据影响较弱。因此，我们在 OPP 挖掘中引入了遗忘机制，以降低旧数据的重要性。本文探讨了带有遗忘机制（OPF）的OPP挖掘，并提出了一种名为OPF-Miner的算法，可以发现频繁的OPF。OPF-Miner 执行两项任务：候选模式生成和支持计算。在候选模式生成中，OPF-Miner 采用了最大支持优先策略和分组模式融合策略，以避免冗余模式融合。在支持计算方面，我们提出了一种名为 "带遗忘机制的支持计算 "的算法，它使用前缀和后缀模式剪枝策略来避免冗余支持计算。我们在 9 个数据集和 12 种备选算法上进行了实验。结果验证了 OPF-Miner 优于其他竞争算法。更重要的是，由于采用了遗忘机制，OPF-Miner 对时间序列具有良好的聚类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.