Using incremental clustering technique in collaborative filtering data update

Xiwei Wang, Jun Zhang
{"title":"Using incremental clustering technique in collaborative filtering data update","authors":"Xiwei Wang, Jun Zhang","doi":"10.1109/IRI.2014.7051920","DOIUrl":null,"url":null,"abstract":"Collaborative filtering (CF) techniques are widely used by online shops in their recommender systems. It is well known that the nonnegative matrix factorization (NMF) based CF algorithms are popular and can provide reasonable product recommendations. However, the dimensions of the factor matrices in NMF need to be predetermined and updated when necessary. Moreover, data arrives in every second so the recommender systems must be capable of updating the fast growing data in a timely manner. In this paper, we propose an approach that incorporates incremental clustering technique into NMF based data update algorithm which can determine the dimensions of the factor matrices and update them automatically. The approach clusters users' and items' auxiliary information and uses them as constraints in NMF for data update. The cluster quantities are used as the dimensions of the factor matrices. With more data coming in, the incremental clustering algorithm determines whether to increase the number of clusters or merge the existing clusters. Experiments on three different datasets (MovieLens, Sushi and LibimSeTi) are conducted to examine the proposed approach. The results show that our approach can update the data quickly and provide encouraging prediction accuracy.","PeriodicalId":360013,"journal":{"name":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2014.7051920","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Collaborative filtering (CF) techniques are widely used by online shops in their recommender systems. It is well known that the nonnegative matrix factorization (NMF) based CF algorithms are popular and can provide reasonable product recommendations. However, the dimensions of the factor matrices in NMF need to be predetermined and updated when necessary. Moreover, data arrives in every second so the recommender systems must be capable of updating the fast growing data in a timely manner. In this paper, we propose an approach that incorporates incremental clustering technique into NMF based data update algorithm which can determine the dimensions of the factor matrices and update them automatically. The approach clusters users' and items' auxiliary information and uses them as constraints in NMF for data update. The cluster quantities are used as the dimensions of the factor matrices. With more data coming in, the incremental clustering algorithm determines whether to increase the number of clusters or merge the existing clusters. Experiments on three different datasets (MovieLens, Sushi and LibimSeTi) are conducted to examine the proposed approach. The results show that our approach can update the data quickly and provide encouraging prediction accuracy.
采用增量聚类技术协同过滤数据更新
协同过滤(CF)技术被广泛应用于网上商店的推荐系统中。众所周知,基于非负矩阵分解(NMF)的CF算法很受欢迎,可以提供合理的产品推荐。然而,NMF中因子矩阵的维数需要预先确定并在必要时更新。此外,数据每秒钟都会到达,因此推荐系统必须能够及时更新快速增长的数据。本文提出了一种将增量聚类技术与基于NMF的数据更新算法相结合的方法,该方法可以自动确定因子矩阵的维数并进行更新。该方法将用户和项目的辅助信息聚类,并将其作为NMF中的约束条件进行数据更新。聚类数量用作因子矩阵的维数。随着越来越多的数据进入,增量聚类算法决定是增加簇的数量还是合并现有的簇。在三个不同的数据集(MovieLens, Sushi和LibimSeTi)上进行了实验来检验所提出的方法。结果表明,该方法可以快速更新数据,并提供令人鼓舞的预测精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信