Federated Prototype-based Contrastive Learning for Privacy-Preserving Cross-domain Recommendation

arXiv - CS - Information Retrieval Pub Date : 2024-09-05 DOI:arxiv-2409.03294

Li Wang, Quangui Zhang, Lei Sang, Qiang Wu, Min Xu

{"title":"Federated Prototype-based Contrastive Learning for Privacy-Preserving Cross-domain Recommendation","authors":"Li Wang, Quangui Zhang, Lei Sang, Qiang Wu, Min Xu","doi":"arxiv-2409.03294","DOIUrl":null,"url":null,"abstract":"Cross-domain recommendation (CDR) aims to improve recommendation accuracy in\nsparse domains by transferring knowledge from data-rich domains. However,\nexisting CDR methods often assume the availability of user-item interaction\ndata across domains, overlooking user privacy concerns. Furthermore, these\nmethods suffer from performance degradation in scenarios with sparse\noverlapping users, as they typically depend on a large number of fully shared\nusers for effective knowledge transfer. To address these challenges, we propose\na Federated Prototype-based Contrastive Learning (CL) method for\nPrivacy-Preserving CDR, named FedPCL-CDR. This approach utilizes\nnon-overlapping user information and prototypes to improve multi-domain\nperformance while protecting user privacy. FedPCL-CDR comprises two modules:\nlocal domain (client) learning and global server aggregation. In the local\ndomain, FedPCL-CDR clusters all user data to learn representative prototypes,\neffectively utilizing non-overlapping user information and addressing the\nsparse overlapping user issue. It then facilitates knowledge transfer by\nemploying both local and global prototypes returned from the server in a CL\nmanner. Simultaneously, the global server aggregates representative prototypes\nfrom local domains to learn both local and global prototypes. The combination\nof prototypes and federated learning (FL) ensures that sensitive user data\nremains decentralized, with only prototypes being shared across domains,\nthereby protecting user privacy. Extensive experiments on four CDR tasks using\ntwo real-world datasets demonstrate that FedPCL-CDR outperforms the\nstate-of-the-art baselines.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"24 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Cross-domain recommendation (CDR) aims to improve recommendation accuracy in sparse domains by transferring knowledge from data-rich domains. However, existing CDR methods often assume the availability of user-item interaction data across domains, overlooking user privacy concerns. Furthermore, these methods suffer from performance degradation in scenarios with sparse overlapping users, as they typically depend on a large number of fully shared users for effective knowledge transfer. To address these challenges, we propose a Federated Prototype-based Contrastive Learning (CL) method for Privacy-Preserving CDR, named FedPCL-CDR. This approach utilizes non-overlapping user information and prototypes to improve multi-domain performance while protecting user privacy. FedPCL-CDR comprises two modules: local domain (client) learning and global server aggregation. In the local domain, FedPCL-CDR clusters all user data to learn representative prototypes, effectively utilizing non-overlapping user information and addressing the sparse overlapping user issue. It then facilitates knowledge transfer by employing both local and global prototypes returned from the server in a CL manner. Simultaneously, the global server aggregates representative prototypes from local domains to learn both local and global prototypes. The combination of prototypes and federated learning (FL) ensures that sensitive user data remains decentralized, with only prototypes being shared across domains, thereby protecting user privacy. Extensive experiments on four CDR tasks using two real-world datasets demonstrate that FedPCL-CDR outperforms the state-of-the-art baselines.

查看原文本刊更多论文

基于联合原型的对比学习，实现保护隐私的跨域推荐

跨领域推荐（CDR）旨在通过转移数据丰富领域的知识来提高不同领域的推荐准确性。然而，现有的跨域推荐方法通常假定跨域的用户-项目交互数据是可用的，从而忽略了用户隐私问题。此外，这些方法在用户稀疏重叠的场景中性能下降，因为它们通常依赖于大量完全共享的用户来实现有效的知识转移。为了应对这些挑战，我们提出了一种基于联邦原型的对比学习（CL）方法，用于保护隐私的 CDR，命名为 FedPCL-CDR。这种方法利用不重叠的用户信息和原型来提高多域性能，同时保护用户隐私。FedPCL-CDR 包括两个模块：本地域（客户端）学习和全球服务器聚合。在本地域，FedPCL-CDR 对所有用户数据进行聚类，以学习具有代表性的原型，从而有效利用非重叠用户信息，解决稀疏重叠用户问题。然后，它通过使用从服务器返回的本地和全局原型来促进知识转移。与此同时，全局服务器汇聚来自本地域的代表性原型，以学习本地和全局原型。原型和联合学习（FL）的结合确保了用户敏感数据的分散性，只有原型可以跨域共享，从而保护了用户隐私。使用两个真实数据集对四项 CDR 任务进行的广泛实验表明，FedPCL-CDR 的性能优于最先进的基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Information Retrieval

自引率

0.00%

发文量