Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems

2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA) Pub Date : 2022-09-05 DOI:10.1109/IDSTA55301.2022.9923070

Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh

{"title":"Using K-means Clustering Ensemble to Improve the Performance in Recommender Systems","authors":"Hafed Zarzour, Faiz Maazouzi, Mohammad Al-Zinati, Amjad Nusayr, M. Alsmirat, M. Al-Ayyoub, Y. Jararweh","doi":"10.1109/IDSTA55301.2022.9923070","DOIUrl":null,"url":null,"abstract":"Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.","PeriodicalId":268343,"journal":{"name":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDSTA55301.2022.9923070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Collaborative filtering methods are often utilized in the industry of recommender systems. They work by identifying users with similar tastes and recommending items for each active user. Besides, clustering techniques are extensively utilized to create systems based on collaborative filtering recommendation in the context of big data. Nevertheless, the cluster ensemble has emerged in last years as a powerful technique that can replace single clustering algorithms in enhancing the performance of recommendation and prediction. This paper presents a k-means clustering ensemble-based method to improve the performance in recommender systems. The proposed system incorporates the Cosine Similarity and the Pearson Correlation Coefficient as similarity metrics to form clusters. Moreover, it uses the HyperGraph Partitioning Algorithm (HGPA) to combine the results of the k-means clustering technique. The recommendation algorithm constructs the recommendations based on the clusters obtained earlier by the HGPA ensemble clustering. To this end, it finds the nearest cluster for each active user and selects its top N items. Finally, it recommends these top items to the user‘s favorite list. The experiments on two well-known datasets demonstrate that cluster ensembles by HGPA outperform the baseline methods.

查看原文本刊更多论文

利用k -均值聚类集成提高推荐系统的性能

协同过滤是推荐系统中常用的一种方法。它们的工作原理是识别具有相似品味的用户，并为每个活跃用户推荐商品。此外，在大数据背景下，聚类技术被广泛用于创建基于协同过滤推荐的系统。然而，近年来，聚类集成作为一种强大的技术已经出现，它可以取代单一的聚类算法来提高推荐和预测的性能。本文提出了一种基于k均值聚类集成的推荐系统性能改进方法。该系统结合余弦相似度和Pearson相关系数作为相似度度量来形成聚类。此外，它使用超图划分算法(HyperGraph Partitioning Algorithm, HGPA)来结合k-means聚类技术的结果。推荐算法基于HGPA集成聚类得到的聚类构造推荐。为此，它为每个活动用户找到最近的集群，并选择其top N项。最后，它将这些最受欢迎的项目推荐到用户的收藏列表中。在两个已知数据集上的实验表明，HGPA聚类集成优于基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)

自引率

0.00%

发文量