IHUMN: an improved high-utility itemsets mining algorithm with negative utility items

Huijiao Wang, Jinghai Wei, Xin Wang, Xing Li, Hua Jiang
{"title":"IHUMN: an improved high-utility itemsets mining algorithm with negative utility items","authors":"Huijiao Wang, Jinghai Wei, Xin Wang, Xing Li, Hua Jiang","doi":"10.1145/3579654.3579766","DOIUrl":null,"url":null,"abstract":"High-utility itemset mining is to mine high profit itemsets from transaction databases. But if there are some itemsets with negative utility values in the transaction database, the high-utility itemsets with the negative values may be pruned incorrectly and the subset of the low-utility itemsets may be the high-utility itemsets. In this paper, an improved high-utility itemsets mining algorithm with negative utility items (IHUMN) is proposed. A novel utility-list buffer structure with negative unit profits is proposed to efficiently store and retrieve utility-list, and reduce the memory consumption during the mining process. Moreover, Transitive Extension with Negative utility formula is constructed to compute the upper bound of utility avoiding the overestimation of low-utility itemsets as high-utility itemsets. The performance of IHUMN is evaluated, and compared against the FHN and GHUM method. The results of the experiments confirm that IHUMN has a favorable improvement in terms of time costs, the memory utilization and the number of visited nodes. The IHUMN algorithm consumes 40% less memory than GHUM. Moreover, the algorithm has good performance on dense datasets.","PeriodicalId":146783,"journal":{"name":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Algorithms, Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3579654.3579766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

High-utility itemset mining is to mine high profit itemsets from transaction databases. But if there are some itemsets with negative utility values in the transaction database, the high-utility itemsets with the negative values may be pruned incorrectly and the subset of the low-utility itemsets may be the high-utility itemsets. In this paper, an improved high-utility itemsets mining algorithm with negative utility items (IHUMN) is proposed. A novel utility-list buffer structure with negative unit profits is proposed to efficiently store and retrieve utility-list, and reduce the memory consumption during the mining process. Moreover, Transitive Extension with Negative utility formula is constructed to compute the upper bound of utility avoiding the overestimation of low-utility itemsets as high-utility itemsets. The performance of IHUMN is evaluated, and compared against the FHN and GHUM method. The results of the experiments confirm that IHUMN has a favorable improvement in terms of time costs, the memory utilization and the number of visited nodes. The IHUMN algorithm consumes 40% less memory than GHUM. Moreover, the algorithm has good performance on dense datasets.
IHUMN:一种改进的具有负效用项的高效用项集挖掘算法
高效用项集挖掘是指从事务数据库中挖掘高收益项集。但是,如果事务数据库中存在一些负效用值的项目集,则具有负效用值的高效用项目集可能会被错误地修剪,并且低效用项目集的子集可能是高效用项目集。本文提出了一种改进的带负效用项的高效用项集挖掘算法。为了有效地存储和检索公用事业表,减少挖掘过程中的内存消耗,提出了一种新的负单位利润公用事业表缓冲结构。此外,构造了具有负效用的传递扩展公式来计算效用的上界,避免了低效用项目集被高估为高效用项目集。对IHUMN的性能进行了评价,并与FHN和GHUM方法进行了比较。实验结果表明,IHUMN在时间成本、内存利用率和访问节点数量方面都有较好的改进。ihuman算法比GHUM算法消耗的内存少40%。此外,该算法在密集数据集上具有良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信