Yan Li, Chenyu Ma, Rong Gao, Youxi Wu, Jinyan Li, Wenjian Wang, Xindong Wu
{"title":"具有遗忘机制的保序模式挖掘","authors":"Yan Li, Chenyu Ma, Rong Gao, Youxi Wu, Jinyan Li, Wenjian Wang, Xindong Wu","doi":"arxiv-2408.15563","DOIUrl":null,"url":null,"abstract":"Order-preserving pattern (OPP) mining is a type of sequential pattern mining\nmethod in which a group of ranks of time series is used to represent an OPP.\nThis approach can discover frequent trends in time series. Existing OPP mining\nalgorithms consider data points at different time to be equally important;\nhowever, newer data usually have a more significant impact, while older data\nhave a weaker impact. We therefore introduce the forgetting mechanism into OPP\nmining to reduce the importance of older data. This paper explores the mining\nof OPPs with forgetting mechanism (OPF) and proposes an algorithm called\nOPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks,\ncandidate pattern generation and support calculation. In candidate pattern\ngeneration, OPF-Miner employs a maximal support priority strategy and a group\npattern fusion strategy to avoid redundant pattern fusions. For support\ncalculation, we propose an algorithm called support calculation with forgetting\nmechanism, which uses prefix and suffix pattern pruning strategies to avoid\nredundant support calculations. The experiments are conducted on nine datasets\nand 12 alternative algorithms. The results verify that OPF-Miner is superior to\nother competitive algorithms. More importantly, OPF-Miner yields good\nclustering performance for time series, since the forgetting mechanism is\nemployed. All algorithms can be downloaded from\nhttps://github.com/wuc567/Pattern-Mining/tree/master/OPF-Miner.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":"61 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Order-preserving pattern mining with forgetting mechanism\",\"authors\":\"Yan Li, Chenyu Ma, Rong Gao, Youxi Wu, Jinyan Li, Wenjian Wang, Xindong Wu\",\"doi\":\"arxiv-2408.15563\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Order-preserving pattern (OPP) mining is a type of sequential pattern mining\\nmethod in which a group of ranks of time series is used to represent an OPP.\\nThis approach can discover frequent trends in time series. Existing OPP mining\\nalgorithms consider data points at different time to be equally important;\\nhowever, newer data usually have a more significant impact, while older data\\nhave a weaker impact. We therefore introduce the forgetting mechanism into OPP\\nmining to reduce the importance of older data. This paper explores the mining\\nof OPPs with forgetting mechanism (OPF) and proposes an algorithm called\\nOPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks,\\ncandidate pattern generation and support calculation. In candidate pattern\\ngeneration, OPF-Miner employs a maximal support priority strategy and a group\\npattern fusion strategy to avoid redundant pattern fusions. For support\\ncalculation, we propose an algorithm called support calculation with forgetting\\nmechanism, which uses prefix and suffix pattern pruning strategies to avoid\\nredundant support calculations. The experiments are conducted on nine datasets\\nand 12 alternative algorithms. The results verify that OPF-Miner is superior to\\nother competitive algorithms. More importantly, OPF-Miner yields good\\nclustering performance for time series, since the forgetting mechanism is\\nemployed. All algorithms can be downloaded from\\nhttps://github.com/wuc567/Pattern-Mining/tree/master/OPF-Miner.\",\"PeriodicalId\":501123,\"journal\":{\"name\":\"arXiv - CS - Databases\",\"volume\":\"61 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.15563\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.15563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
保序模式(OPP)挖掘是一种序列模式挖掘方法,其中一组时间序列的等级被用来表示一个 OPP。现有的 OPP 挖掘算法认为不同时间的数据点同等重要;但是,较新的数据通常影响更大,而较老的数据影响较弱。因此,我们在 OPP 挖掘中引入了遗忘机制,以降低旧数据的重要性。本文探讨了带有遗忘机制(OPF)的 OPP 挖掘,并提出了一种名为 OPF-Miner 的算法,它可以发现频繁的 OPF。OPF-Miner 执行两项任务,即候选模式生成和支持计算。在候选模式生成中,OPF-Miner 采用了最大支持优先策略和组模式融合策略,以避免冗余模式融合。在支持计算方面,我们提出了一种名为 "带遗忘机制的支持计算 "的算法,它使用前缀和后缀模式剪枝策略来避免冗余的支持计算。我们在九个数据集和 12 种备选算法上进行了实验。结果验证了 OPF-Miner 优于其他竞争算法。更重要的是,由于采用了遗忘机制,OPF-Miner 对时间序列具有良好的聚类性能。所有算法可从https://github.com/wuc567/Pattern-Mining/tree/master/OPF-Miner 下载。
Order-preserving pattern mining with forgetting mechanism
Order-preserving pattern (OPP) mining is a type of sequential pattern mining
method in which a group of ranks of time series is used to represent an OPP.
This approach can discover frequent trends in time series. Existing OPP mining
algorithms consider data points at different time to be equally important;
however, newer data usually have a more significant impact, while older data
have a weaker impact. We therefore introduce the forgetting mechanism into OPP
mining to reduce the importance of older data. This paper explores the mining
of OPPs with forgetting mechanism (OPF) and proposes an algorithm called
OPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks,
candidate pattern generation and support calculation. In candidate pattern
generation, OPF-Miner employs a maximal support priority strategy and a group
pattern fusion strategy to avoid redundant pattern fusions. For support
calculation, we propose an algorithm called support calculation with forgetting
mechanism, which uses prefix and suffix pattern pruning strategies to avoid
redundant support calculations. The experiments are conducted on nine datasets
and 12 alternative algorithms. The results verify that OPF-Miner is superior to
other competitive algorithms. More importantly, OPF-Miner yields good
clustering performance for time series, since the forgetting mechanism is
employed. All algorithms can be downloaded from
https://github.com/wuc567/Pattern-Mining/tree/master/OPF-Miner.