{"title":"Upper bound on the length of generalized disjunction-free patterns","authors":"Marzena Kryszkiewicz","doi":"10.1109/SSDBM.2004.72","DOIUrl":null,"url":null,"abstract":"A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.","PeriodicalId":383615,"journal":{"name":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSDBM.2004.72","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
A number of lossless representations of frequent patterns were proposed in recent years. The representation that consists of all frequent closed itemsets and the representations based on generalized disjunction-free patterns or on non-derivable itemsets are proven the most concise ones. Experiments show further that the latter ones are by a few orders of magnitude more concise (and determinable) than the former one. As follows from experiments, the representations based on generalized disjunction-free patterns are also more concise than the available in the literature representations of frequent patterns, which determine supports of patterns in an approximate way. In this paper, we provide an upper bound on the length of generalized disjunction-free patterns. The bound determines the maximum number of scans of the database carried out by a priori-like algorithms discovering the representations based on generalized disjunction-free patterns.