Implementing BDFS(b) with diff-sets for real-time frequent pattern mining in dense datasets - first findings

Rajanish Dass, A. Mahanti
{"title":"Implementing BDFS(b) with diff-sets for real-time frequent pattern mining in dense datasets - first findings","authors":"Rajanish Dass, A. Mahanti","doi":"10.1109/UDM.2005.10","DOIUrl":null,"url":null,"abstract":"Finding frequent patterns from databases has been the most researched topic in association-rule mining. Business-intelligence using data mining has felt an increased thrust for real-time frequent pattern mining algorithms finding huge demand from numerous real-time business applications like e-commerce, recommender-systems, group-decision-support-systems, supply-chain-management, to name a few. Last decade has seen development of mind-whelming algorithms, among which, vertical-mining algorithms have been found to be very effective. However, with dense-datasets, the performances of these algorithms significantly degrade. Moreover, these algorithms are not suited to respond to the real-time need. In this paper, we describe BDFS(b)-diff-sets, an algorithm to perform real-time frequent pattern mining using diff-sets and using an intelligent staged search technique, by-passing usual breadth-first and depth-first search-techniques. Empirical evaluations show that our algorithm can make a fair estimation of the probable frequent-patterns reacting to the user-defined time bound and reaches some of the longest frequent patterns much faster than the existing algorithms.","PeriodicalId":223683,"journal":{"name":"International Workshop on Ubiquitous Data Management","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Ubiquitous Data Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UDM.2005.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Finding frequent patterns from databases has been the most researched topic in association-rule mining. Business-intelligence using data mining has felt an increased thrust for real-time frequent pattern mining algorithms finding huge demand from numerous real-time business applications like e-commerce, recommender-systems, group-decision-support-systems, supply-chain-management, to name a few. Last decade has seen development of mind-whelming algorithms, among which, vertical-mining algorithms have been found to be very effective. However, with dense-datasets, the performances of these algorithms significantly degrade. Moreover, these algorithms are not suited to respond to the real-time need. In this paper, we describe BDFS(b)-diff-sets, an algorithm to perform real-time frequent pattern mining using diff-sets and using an intelligent staged search technique, by-passing usual breadth-first and depth-first search-techniques. Empirical evaluations show that our algorithm can make a fair estimation of the probable frequent-patterns reacting to the user-defined time bound and reaches some of the longest frequent patterns much faster than the existing algorithms.
利用差分集实现BDFS(b),用于密集数据集的实时频繁模式挖掘——初步发现
从数据库中发现频繁模式一直是关联规则挖掘中研究最多的课题。使用数据挖掘的商业智能对实时频繁模式挖掘算法的需求越来越大,这些算法在众多实时业务应用程序(如电子商务、推荐系统、群体决策支持系统、供应链管理等)中发现了巨大的需求。在过去的十年里,我们看到了令人眼花缭乱的算法的发展,其中垂直挖掘算法已经被发现是非常有效的。然而,对于密集的数据集,这些算法的性能会显著下降。此外,这些算法不适合响应实时需求。在本文中,我们描述了BDFS(b)- diffi -sets,这是一种使用diffi -set和智能分阶段搜索技术来执行实时频繁模式挖掘的算法,绕过了通常的宽度优先和深度优先搜索技术。经验评估表明,我们的算法可以对响应用户定义的时间范围的可能频率模式做出公平的估计,并且比现有算法更快地达到一些最长的频率模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信