Private Sample Alignment for Vertical Federated Learning: An Efficient and Reliable Realization

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

IEEE Transactions on Information Forensics and Security Pub Date : 2025-03-28 DOI:10.1109/TIFS.2025.3555794

Yuxin Xi;Yu Guo;Shiyuan Xu;Chengjun Cai;Xiaohua Jia

{"title":"Private Sample Alignment for Vertical Federated Learning: An Efficient and Reliable Realization","authors":"Yuxin Xi;Yu Guo;Shiyuan Xu;Chengjun Cai;Xiaohua Jia","doi":"10.1109/TIFS.2025.3555794","DOIUrl":null,"url":null,"abstract":"Sample alignment is recognized as a vital component of vertical federated learning, which facilitates the integration of differential samples and high-quality model training. In this trend, providing Private Sample Alignment (PSA) among multi-clients becomes naturally necessary for preventing unauthorized sample access and client privacy exposure. However, exiting PSA protocols mainly focus on two-party scenarios and cannot be directly adapted to the multi-client delegated computing scenarios required for vertical federated learning. Besides, these studies fail to address the need for protocol robustness in practical federated Learning network environments. Therefore, we aim to design an efficient and reliable PSA protocol in multi-client vertical federated learning. In this work, we present the first practical PSA protocol for vertical federated learning, allowing multi-clients to efficiently identify common samples without revealing additional information. Toward this direction, our PSA protocol first explores the Learning With Errors (LWE) problem to create a lightweight delegated Private Set Intersection (PSI) scheme, enabling efficient sample intersection among multiple clients. To achieve the reliability of the PSA protocol, we devise a multi-client vector aggregation algorithm that securely delegates the server to calculate the sample intersection. Building on this foundation, we develop an efficient Threshold-based Private Sample Alignment (T-PSA) protocol that allows multiple clients to determine the intersection of their input samples only if the intersection size surpasses a specific threshold. We implement a prototype and conduct a thorough security analysis. Comprehensive evaluation results confirm the efficiency and practicality of our design.","PeriodicalId":13492,"journal":{"name":"IEEE Transactions on Information Forensics and Security","volume":"20 ","pages":"3834-3848"},"PeriodicalIF":6.3000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Information Forensics and Security","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10945511/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Sample alignment is recognized as a vital component of vertical federated learning, which facilitates the integration of differential samples and high-quality model training. In this trend, providing Private Sample Alignment (PSA) among multi-clients becomes naturally necessary for preventing unauthorized sample access and client privacy exposure. However, exiting PSA protocols mainly focus on two-party scenarios and cannot be directly adapted to the multi-client delegated computing scenarios required for vertical federated learning. Besides, these studies fail to address the need for protocol robustness in practical federated Learning network environments. Therefore, we aim to design an efficient and reliable PSA protocol in multi-client vertical federated learning. In this work, we present the first practical PSA protocol for vertical federated learning, allowing multi-clients to efficiently identify common samples without revealing additional information. Toward this direction, our PSA protocol first explores the Learning With Errors (LWE) problem to create a lightweight delegated Private Set Intersection (PSI) scheme, enabling efficient sample intersection among multiple clients. To achieve the reliability of the PSA protocol, we devise a multi-client vector aggregation algorithm that securely delegates the server to calculate the sample intersection. Building on this foundation, we develop an efficient Threshold-based Private Sample Alignment (T-PSA) protocol that allows multiple clients to determine the intersection of their input samples only if the intersection size surpasses a specific threshold. We implement a prototype and conduct a thorough security analysis. Comprehensive evaluation results confirm the efficiency and practicality of our design.

查看原文本刊更多论文

垂直联邦学习的私有样本对齐：一种高效可靠的实现

样本对齐被认为是垂直联邦学习的重要组成部分，它有助于差分样本和高质量模型训练的整合。在这种趋势下，为防止未经授权的样本访问和客户端隐私暴露，在多客户端之间提供私有样本对齐（PSA）自然是必要的。然而，现有的PSA协议主要关注双方场景，不能直接适应垂直联邦学习所需的多客户机委托计算场景。此外，这些研究未能解决实际联邦学习网络环境中对协议鲁棒性的需求。因此，我们的目标是在多客户端垂直联邦学习中设计一个高效可靠的PSA协议。在这项工作中，我们提出了第一个用于垂直联邦学习的实用PSA协议，允许多客户端在不泄露额外信息的情况下有效地识别公共样本。在这个方向上，我们的PSA协议首先探索了带错误学习（LWE）问题，创建了一个轻量级的委托私有集交叉（PSI）方案，实现了多个客户端之间有效的样本交叉。为了实现PSA协议的可靠性，我们设计了一个多客户端矢量聚合算法，该算法安全地委托服务器来计算样本交集。在此基础上，我们开发了一种高效的基于阈值的私有样本对齐（T-PSA）协议，该协议允许多个客户端仅在交集大小超过特定阈值时才确定其输入样本的交集。我们实现了一个原型并进行了彻底的安全分析。综合评价结果证实了本设计的有效性和实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Information Forensics and Security 工程技术-工程：电子与电气

CiteScore

14.40

自引率

7.40%

发文量

234

审稿时长

6.5 months

期刊介绍： The IEEE Transactions on Information Forensics and Security covers the sciences, technologies, and applications relating to information forensics, information security, biometrics, surveillance and systems applications that incorporate these features