Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems

Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh
{"title":"Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems","authors":"Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh","doi":"10.1109/IDSTA55301.2022.9923070","DOIUrl":null,"url":null,"abstract":"Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.","PeriodicalId":268343,"journal":{"name":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDSTA55301.2022.9923070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.
利用k -均值聚类集成提高推荐系统的性能
协同过滤是推荐系统中常用的一种方法。它们的工作原理是识别具有相似品味的用户,并为每个活跃用户推荐商品。此外,在大数据背景下,聚类技术被广泛用于创建基于协同过滤推荐的系统。然而,近年来,聚类集成作为一种强大的技术已经出现,它可以取代单一的聚类算法来提高推荐和预测的性能。本文提出了一种基于k均值聚类集成的推荐系统性能改进方法。该系统结合余弦相似度和Pearson相关系数作为相似度度量来形成聚类。此外,它使用超图划分算法(HyperGraph Partitioning Algorithm, HGPA)来结合k-means聚类技术的结果。推荐算法基于HGPA集成聚类得到的聚类构造推荐。为此,它为每个活动用户找到最近的集群,并选择其top N项。最后,它将这些最受欢迎的项目推荐到用户的收藏列表中。在两个已知数据集上的实验表明,HGPA聚类集成优于基线方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信