An adaptive approach to collaborative filtering using attribute autocorrelation

J. Das, Shreya Dugar, Harshil Gupta, S. Majumder, Prosenjit Gupta
{"title":"An adaptive approach to collaborative filtering using attribute autocorrelation","authors":"J. Das, Shreya Dugar, Harshil Gupta, S. Majumder, Prosenjit Gupta","doi":"10.1109/IEMCON.2015.7344484","DOIUrl":null,"url":null,"abstract":"Recommender Systems (RS) provide a rich collection of tools for enabling users to filter through large amount of information available on the Web. Collaborative Filtering (CF) is one of the most widely used and successful techniques behind the development of RS. CF based RS recommend items by computing similarities between users and/or items. The items recommended to a user are those preferred by similar users. However, with the tremendous growth in users and items on the Web, CF algorithms suffer from serious scalability problems because similarities between every pair of users and/or items need to be computed during the training phase. In this paper, we propose a scalable CF method by using data clustering techniques. The proposed work partitions the users of the CF system using an adaptive K-means clustering algorithm and then use those partitions (clusters) to select the similar users (neighborhood) of a target user. In this work, we also try to determine the optimal value of K (number of clusters). Once a target cluster is determined, the neighborhood of the target user is selected by looking into the similarity score between the target user and all other users in that cluster. The basic idea is to partition the users of the RS and apply the CF based recommendation algorithm separately to the partitions. The cluster-based approach reduces the runtime of the system as we avoid similarity computations over the entire rating data. Experiments performed on MovieLens-1M dataset indicate that our method is efficient in reducing the runtime as well as maintaining an acceptable recommendation quality.","PeriodicalId":111626,"journal":{"name":"2015 International Conference and Workshop on Computing and Communication (IEMCON)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference and Workshop on Computing and Communication (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2015.7344484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Recommender Systems (RS) provide a rich collection of tools for enabling users to filter through large amount of information available on the Web. Collaborative Filtering (CF) is one of the most widely used and successful techniques behind the development of RS. CF based RS recommend items by computing similarities between users and/or items. The items recommended to a user are those preferred by similar users. However, with the tremendous growth in users and items on the Web, CF algorithms suffer from serious scalability problems because similarities between every pair of users and/or items need to be computed during the training phase. In this paper, we propose a scalable CF method by using data clustering techniques. The proposed work partitions the users of the CF system using an adaptive K-means clustering algorithm and then use those partitions (clusters) to select the similar users (neighborhood) of a target user. In this work, we also try to determine the optimal value of K (number of clusters). Once a target cluster is determined, the neighborhood of the target user is selected by looking into the similarity score between the target user and all other users in that cluster. The basic idea is to partition the users of the RS and apply the CF based recommendation algorithm separately to the partitions. The cluster-based approach reduces the runtime of the system as we avoid similarity computations over the entire rating data. Experiments performed on MovieLens-1M dataset indicate that our method is efficient in reducing the runtime as well as maintaining an acceptable recommendation quality.
一种基于属性自相关的自适应协同过滤方法
推荐系统(RS)提供了丰富的工具集合,使用户能够过滤Web上可用的大量信息。协同过滤(CF)是RS发展背后应用最广泛和最成功的技术之一,基于CF的RS通过计算用户和/或物品之间的相似度来推荐物品。向用户推荐的项目是类似用户喜欢的项目。然而,随着Web上用户和项目的巨大增长,CF算法面临严重的可伸缩性问题,因为在训练阶段需要计算每对用户和/或项目之间的相似性。在本文中,我们提出了一个可扩展的CF方法,使用数据聚类技术。本文采用自适应K-means聚类算法对CF系统的用户进行划分,然后利用这些划分(聚类)来选择目标用户的相似用户(邻域)。在这项工作中,我们还尝试确定K(簇数)的最优值。一旦确定了目标集群,通过查看目标用户与该集群中所有其他用户之间的相似性得分来选择目标用户的邻域。其基本思想是对RS的用户进行分区,并将基于CF的推荐算法分别应用于分区。基于集群的方法减少了系统的运行时间,因为我们避免了对整个评级数据的相似性计算。在MovieLens-1M数据集上进行的实验表明,我们的方法在减少运行时间和保持可接受的推荐质量方面是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信