{"title":"Efficient two-party Private Set Union and circuit-based version for large-scale data set based on novel oblivious filter","authors":"Qian Xu, Huajie Shen, Yuhan Yang, Bo Yu, Wei He","doi":"10.1016/j.sysarc.2025.103581","DOIUrl":null,"url":null,"abstract":"<div><div>Private Set Union (PSU) protocol enables secure computation of set unions across multiple parties while preserving input privacy — a critical capability for large-scale data aggregation. To date, two-party PSU schemes predominantly adopt the “split-then-execute” paradigm, which as demonstrated by Jia et al. (2024), inherently suffers from <em>during-execution leakage</em>. While their proposed symmetric-key approach addresses this vulnerability, the dependency on private equality test (PET) protocol additionally introduces non-negligible computational and communication overheads. Meanwhile, research on circuit-PSU protocol capable of securely computing a function (merge, sort or federated learning) over the union set remains blank.</div><div>In this paper, we focus on two main objectives: (1) We present an optimized two-party PSU protocol that eliminates during-execution leakage while achieving enhanced efficiency; (2) We propose the <em><strong>first</strong></em> circuit-PSU. Unlike conventional PSU where participant obtains plaintext result, circuit-PSU provides secret-shared outputs that inherently prevent both during-execution leakage and intersection cardinality exposure. Furthermore, circuit-PSU removes the requirement for predefined result recipient and thus can offer superior flexibility. Meanwhile, our technical innovations include an optimized Share Translation (ShareTrans) protocol and a novel Oblivious Filter (OF) primitive that synergistically enhance both PSU and circuit-PSU efficiency. The experimental results indicate that under constrained computational resources (8-core CPU, 16GB RAM), the proposed circuit-PSU achieves a comparatively better improvement in communication complexity compared to existing state-of-the-art schemes, and our two-party PSU also exhibits comparable performance, for example, achieving approximately <span><math><mrow><mn>4</mn><mo>×</mo></mrow></math></span> efficiency improvement over Jia et al. (2024).</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"168 ","pages":"Article 103581"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems Architecture","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S138376212500253X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Private Set Union (PSU) protocol enables secure computation of set unions across multiple parties while preserving input privacy — a critical capability for large-scale data aggregation. To date, two-party PSU schemes predominantly adopt the “split-then-execute” paradigm, which as demonstrated by Jia et al. (2024), inherently suffers from during-execution leakage. While their proposed symmetric-key approach addresses this vulnerability, the dependency on private equality test (PET) protocol additionally introduces non-negligible computational and communication overheads. Meanwhile, research on circuit-PSU protocol capable of securely computing a function (merge, sort or federated learning) over the union set remains blank.
In this paper, we focus on two main objectives: (1) We present an optimized two-party PSU protocol that eliminates during-execution leakage while achieving enhanced efficiency; (2) We propose the first circuit-PSU. Unlike conventional PSU where participant obtains plaintext result, circuit-PSU provides secret-shared outputs that inherently prevent both during-execution leakage and intersection cardinality exposure. Furthermore, circuit-PSU removes the requirement for predefined result recipient and thus can offer superior flexibility. Meanwhile, our technical innovations include an optimized Share Translation (ShareTrans) protocol and a novel Oblivious Filter (OF) primitive that synergistically enhance both PSU and circuit-PSU efficiency. The experimental results indicate that under constrained computational resources (8-core CPU, 16GB RAM), the proposed circuit-PSU achieves a comparatively better improvement in communication complexity compared to existing state-of-the-art schemes, and our two-party PSU also exhibits comparable performance, for example, achieving approximately efficiency improvement over Jia et al. (2024).
期刊介绍:
The Journal of Systems Architecture: Embedded Software Design (JSA) is a journal covering all design and architectural aspects related to embedded systems and software. It ranges from the microarchitecture level via the system software level up to the application-specific architecture level. Aspects such as real-time systems, operating systems, FPGA programming, programming languages, communications (limited to analysis and the software stack), mobile systems, parallel and distributed architectures as well as additional subjects in the computer and system architecture area will fall within the scope of this journal. Technology will not be a main focus, but its use and relevance to particular designs will be. Case studies are welcome but must contribute more than just a design for a particular piece of software.
Design automation of such systems including methodologies, techniques and tools for their design as well as novel designs of software components fall within the scope of this journal. Novel applications that use embedded systems are also central in this journal. While hardware is not a part of this journal hardware/software co-design methods that consider interplay between software and hardware components with and emphasis on software are also relevant here.