It Takes Two to Tango: An Exploration of Domain Pairs for Cross-Domain Collaborative Filtering

Proceedings of the 9th ACM Conference on Recommender Systems Pub Date : 2015-09-16 DOI:10.1145/2792838.2800188

Shaghayegh Sherry Sahebi, Peter Brusilovsky

{"title":"It Takes Two to Tango: An Exploration of Domain Pairs for Cross-Domain Collaborative Filtering","authors":"Shaghayegh Sherry Sahebi, Peter Brusilovsky","doi":"10.1145/2792838.2800188","DOIUrl":null,"url":null,"abstract":"As the heterogeneity of data sources are increasing on the web, and due to the sparsity of data in each of these data sources, cross-domain recommendation is becoming an emerging research topic in the recent years. Cross-domain collaborative filtering aims to transfer the user rating pattern from source (auxiliary) domains to a target domain for the purpose of alleviating the sparsity problem and providing better target recommendations. However, the studies so far have either focused on a limited number of domains that are assumed to be related to each other (such as books and movies), or a division of the same dataset (such as movies) into different domains based on an item characteristic (such as genre). In this paper, we study a broad set of domains and their characteristics to understand the factors that affect the success or failure of cross-domain collaborative filtering, the amount of improvement in cross-domain approaches, and the selection of best source domains for a specific target domain. We propose to use Canonical Correlation Analysis (CCA) as a significant major factor in finding the most promising source domains for a target domain, and suggest a cross-domain collaborative filtering based on CCA (CD-CCA) that proves to be successful in using the shared information between domains in the target recommendations.","PeriodicalId":325637,"journal":{"name":"Proceedings of the 9th ACM Conference on Recommender Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th ACM Conference on Recommender Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2792838.2800188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 34

Abstract

As the heterogeneity of data sources are increasing on the web, and due to the sparsity of data in each of these data sources, cross-domain recommendation is becoming an emerging research topic in the recent years. Cross-domain collaborative filtering aims to transfer the user rating pattern from source (auxiliary) domains to a target domain for the purpose of alleviating the sparsity problem and providing better target recommendations. However, the studies so far have either focused on a limited number of domains that are assumed to be related to each other (such as books and movies), or a division of the same dataset (such as movies) into different domains based on an item characteristic (such as genre). In this paper, we study a broad set of domains and their characteristics to understand the factors that affect the success or failure of cross-domain collaborative filtering, the amount of improvement in cross-domain approaches, and the selection of best source domains for a specific target domain. We propose to use Canonical Correlation Analysis (CCA) as a significant major factor in finding the most promising source domains for a target domain, and suggest a cross-domain collaborative filtering based on CCA (CD-CCA) that proves to be successful in using the shared information between domains in the target recommendations.

查看原文本刊更多论文

两个人跳探戈:跨领域协同过滤的领域对探索

随着网络上数据源的异构性日益增加，并且由于每个数据源中数据的稀疏性，跨域推荐成为近年来新兴的研究课题。跨域协同过滤旨在将用户评价模式从源(辅助)域转移到目标域，以缓解稀疏性问题，提供更好的目标推荐。然而，到目前为止的研究要么集中在有限数量的假定彼此相关的领域(如书籍和电影)，要么根据项目特征(如类型)将相同的数据集(如电影)划分为不同的领域。在本文中，我们研究了一组广泛的域及其特征，以了解影响跨域协同过滤成功或失败的因素，跨域方法的改进量，以及为特定目标域选择最佳源域。我们建议使用典型相关分析(CCA)作为寻找目标域最有希望的源域的重要因素，并提出了基于CCA的跨域协同过滤(CD-CCA)，该方法在目标推荐中成功地利用了域之间的共享信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 9th ACM Conference on Recommender Systems

自引率

0.00%

发文量