{"title":"Mining Frequent Patterns in Data Stream over Sliding Windows","authors":"Feng Wu, Quanyuan Wu, Yan Zhong, Xin Jin","doi":"10.1109/CISE.2009.5363461","DOIUrl":null,"url":null,"abstract":"Frequent pattern mining in data stream is an important task. Under the time decay model, this paper presents a new algorithm SWFP for mining frequent patterns over sliding windows. The new definitions of the infrequent, critical and frequent patterns which reflect the actual statistical property of each pattern within the sliding windows, grasp the real substance of mining process and help to improve the mining quality essentially. The support decay mechanism is designed not only to differentiate the current and history transaction, but also to make the online pattern maintain operation easily and accurately. The reasonable strategy for the pattern pruning periodically is used to make big cuts in the maintenance cost and the error controlled in a small bound. Theoretical analysis guarantees no false negatives of SWFP. Experimental evaluation over a number of synthetic data sets demonstrates the efficiency and scalability of our method. Keywords-frequent pattern mining; sliding windows; time decay model; data stream","PeriodicalId":135441,"journal":{"name":"2009 International Conference on Computational Intelligence and Software Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Computational Intelligence and Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISE.2009.5363461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Frequent pattern mining in data stream is an important task. Under the time decay model, this paper presents a new algorithm SWFP for mining frequent patterns over sliding windows. The new definitions of the infrequent, critical and frequent patterns which reflect the actual statistical property of each pattern within the sliding windows, grasp the real substance of mining process and help to improve the mining quality essentially. The support decay mechanism is designed not only to differentiate the current and history transaction, but also to make the online pattern maintain operation easily and accurately. The reasonable strategy for the pattern pruning periodically is used to make big cuts in the maintenance cost and the error controlled in a small bound. Theoretical analysis guarantees no false negatives of SWFP. Experimental evaluation over a number of synthetic data sets demonstrates the efficiency and scalability of our method. Keywords-frequent pattern mining; sliding windows; time decay model; data stream