Zkfhed: A Verifiable and Scalable Blockchain-Enhanced Federated Learning System

IF 10.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2025-03-17 DOI:10.1109/TKDE.2025.3550546

Bingxue Zhang;Guangguang Lu;Yuncheng Wu;Kunpeng Ren;Feida Zhu

{"title":"Zkfhed: A Verifiable and Scalable Blockchain-Enhanced Federated Learning System","authors":"Bingxue Zhang;Guangguang Lu;Yuncheng Wu;Kunpeng Ren;Feida Zhu","doi":"10.1109/TKDE.2025.3550546","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) is an emerging paradigm that enables multiple clients to collaboratively train a machine learning (ML) model without the need to exchange their raw data. However, it relies on a centralized authority to coordinate participants’ activities. This not only interrupts the entire training task in case of a single point of failure, but also lacks an effective regulatory mechanism to prevent malicious behavior. Although blockchain, with its decentralized architecture and data immutability, has significantly advanced the development of FL, it still struggles to withstand poisoning attacks and faces limitations in computational scalability. We propose Zkfhed, a verifiable and scalable FL system that overcomes the limitations of blockchain-based FL in poison attacks and computational scalability. First, we propose a two-stage audit scheme based on zero-knowledge proofs (ZKPs), which verifies that the training data are extracted from trusted organizations and that computations on the data exactly follow the specified training protocols. Second, we propose a homomorphic encryption delegation learning (HEDL), based on fully homomorphic encryption (FHE). It is capable of outsourcing complex computing to external computing resources without sacrificing the client's data privacy. Final, extensive experiments on real-world datasets demonstrate that Zkfhed can effectively identify malicious clients and is highly efficient and scalable in terms of online time and communication efficiency.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 6","pages":"3841-3854"},"PeriodicalIF":10.4000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10930720/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning (FL) is an emerging paradigm that enables multiple clients to collaboratively train a machine learning (ML) model without the need to exchange their raw data. However, it relies on a centralized authority to coordinate participants’ activities. This not only interrupts the entire training task in case of a single point of failure, but also lacks an effective regulatory mechanism to prevent malicious behavior. Although blockchain, with its decentralized architecture and data immutability, has significantly advanced the development of FL, it still struggles to withstand poisoning attacks and faces limitations in computational scalability. We propose Zkfhed, a verifiable and scalable FL system that overcomes the limitations of blockchain-based FL in poison attacks and computational scalability. First, we propose a two-stage audit scheme based on zero-knowledge proofs (ZKPs), which verifies that the training data are extracted from trusted organizations and that computations on the data exactly follow the specified training protocols. Second, we propose a homomorphic encryption delegation learning (HEDL), based on fully homomorphic encryption (FHE). It is capable of outsourcing complex computing to external computing resources without sacrificing the client's data privacy. Final, extensive experiments on real-world datasets demonstrate that Zkfhed can effectively identify malicious clients and is highly efficient and scalable in terms of online time and communication efficiency.

查看原文本刊更多论文

Zkfhed：一个可验证和可扩展的区块链增强联邦学习系统

联邦学习（FL）是一种新兴范例，它使多个客户端能够协作训练机器学习（ML）模型，而无需交换原始数据。然而，它依赖于一个集中的权威机构来协调参与者的活动。这不仅在单点故障的情况下会中断整个训练任务，而且缺乏有效的监管机制来防止恶意行为。尽管区块链凭借其去中心化架构和数据不变性，极大地推动了FL的发展，但它仍然难以抵御中毒攻击，并且在计算可扩展性方面面临限制。我们提出了Zkfhed，这是一个可验证和可扩展的FL系统，克服了基于区块链的FL在中毒攻击和计算可扩展性方面的限制。首先，我们提出了一种基于零知识证明（zkp）的两阶段审计方案，该方案验证了训练数据是从可信组织中提取的，并且对数据的计算完全遵循指定的训练协议。其次，我们提出了一种基于完全同态加密（FHE）的同态加密委托学习（HEDL）。它能够在不牺牲客户数据隐私的情况下将复杂的计算外包给外部计算资源。最后，在真实数据集上的大量实验表明，Zkfhed可以有效地识别恶意客户端，并且在在线时间和通信效率方面具有高效率和可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.