An adaptive approach to collaborative filtering using attribute autocorrelation

2015 International Conference and Workshop on Computing and Communication (IEMCON) Pub Date : 2015-10-01 DOI:10.1109/IEMCON.2015.7344484

J. Das, Shreya Dugar, Harshil Gupta, S. Majumder, Prosenjit Gupta

{"title":"An adaptive approach to collaborative filtering using attribute autocorrelation","authors":"J. Das, Shreya Dugar, Harshil Gupta, S. Majumder, Prosenjit Gupta","doi":"10.1109/IEMCON.2015.7344484","DOIUrl":null,"url":null,"abstract":"Recommender Systems (RS) provide a rich collection of tools for enabling users to filter through large amount of information available on the Web. Collaborative Filtering (CF) is one of the most widely used and successful techniques behind the development of RS. CF based RS recommend items by computing similarities between users and/or items. The items recommended to a user are those preferred by similar users. However, with the tremendous growth in users and items on the Web, CF algorithms suffer from serious scalability problems because similarities between every pair of users and/or items need to be computed during the training phase. In this paper, we propose a scalable CF method by using data clustering techniques. The proposed work partitions the users of the CF system using an adaptive K-means clustering algorithm and then use those partitions (clusters) to select the similar users (neighborhood) of a target user. In this work, we also try to determine the optimal value of K (number of clusters). Once a target cluster is determined, the neighborhood of the target user is selected by looking into the similarity score between the target user and all other users in that cluster. The basic idea is to partition the users of the RS and apply the CF based recommendation algorithm separately to the partitions. The cluster-based approach reduces the runtime of the system as we avoid similarity computations over the entire rating data. Experiments performed on MovieLens-1M dataset indicate that our method is efficient in reducing the runtime as well as maintaining an acceptable recommendation quality.","PeriodicalId":111626,"journal":{"name":"2015 International Conference and Workshop on Computing and Communication (IEMCON)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference and Workshop on Computing and Communication (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2015.7344484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Recommender Systems (RS) provide a rich collection of tools for enabling users to filter through large amount of information available on the Web. Collaborative Filtering (CF) is one of the most widely used and successful techniques behind the development of RS. CF based RS recommend items by computing similarities between users and/or items. The items recommended to a user are those preferred by similar users. However, with the tremendous growth in users and items on the Web, CF algorithms suffer from serious scalability problems because similarities between every pair of users and/or items need to be computed during the training phase. In this paper, we propose a scalable CF method by using data clustering techniques. The proposed work partitions the users of the CF system using an adaptive K-means clustering algorithm and then use those partitions (clusters) to select the similar users (neighborhood) of a target user. In this work, we also try to determine the optimal value of K (number of clusters). Once a target cluster is determined, the neighborhood of the target user is selected by looking into the similarity score between the target user and all other users in that cluster. The basic idea is to partition the users of the RS and apply the CF based recommendation algorithm separately to the partitions. The cluster-based approach reduces the runtime of the system as we avoid similarity computations over the entire rating data. Experiments performed on MovieLens-1M dataset indicate that our method is efficient in reducing the runtime as well as maintaining an acceptable recommendation quality.

查看原文本刊更多论文

一种基于属性自相关的自适应协同过滤方法

推荐系统(RS)提供了丰富的工具集合，使用户能够过滤Web上可用的大量信息。协同过滤(CF)是RS发展背后应用最广泛和最成功的技术之一，基于CF的RS通过计算用户和/或物品之间的相似度来推荐物品。向用户推荐的项目是类似用户喜欢的项目。然而，随着Web上用户和项目的巨大增长，CF算法面临严重的可伸缩性问题，因为在训练阶段需要计算每对用户和/或项目之间的相似性。在本文中，我们提出了一个可扩展的CF方法，使用数据聚类技术。本文采用自适应K-means聚类算法对CF系统的用户进行划分，然后利用这些划分(聚类)来选择目标用户的相似用户(邻域)。在这项工作中，我们还尝试确定K(簇数)的最优值。一旦确定了目标集群，通过查看目标用户与该集群中所有其他用户之间的相似性得分来选择目标用户的邻域。其基本思想是对RS的用户进行分区，并将基于CF的推荐算法分别应用于分区。基于集群的方法减少了系统的运行时间，因为我们避免了对整个评级数据的相似性计算。在MovieLens-1M数据集上进行的实验表明，我们的方法在减少运行时间和保持可接受的推荐质量方面是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Conference and Workshop on Computing and Communication (IEMCON)

自引率

0.00%

发文量