{"title":"Privacy-preserving Blockchain-based Global Data Sharing for Federated Learning with Non-IID Data","authors":"Zhuotao Lian, Qingkui Zeng, Chunhua Su","doi":"10.1109/ICDCSW56584.2022.00044","DOIUrl":null,"url":null,"abstract":"Federated learning is a popular privacy-enhanced distributed machine learning method that solves the problem of local data privacy by gathering the training results (such as model weights, gradients, etc.) instead of the raw data to generate a global model. But a practical problem it faces is the non-independent and identical distribution of data, which means the local data of each participant is highly inconsistent, both in terms of quantity and distribution. Moreover, there is a lack of research related to the efficiency and privacy issues in the pre-training process. Therefore, in this paper, we propose a novel solution that utilizes blockchain technology to realize small-scale global data sharing to assist the training progress. Simulation experiments verify that our method not only guarantees data security but also greatly improves performance in terms of training speed and accuracy.","PeriodicalId":357138,"journal":{"name":"2022 IEEE 42nd International Conference on Distributed Computing Systems Workshops (ICDCSW)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 42nd International Conference on Distributed Computing Systems Workshops (ICDCSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCSW56584.2022.00044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Federated learning is a popular privacy-enhanced distributed machine learning method that solves the problem of local data privacy by gathering the training results (such as model weights, gradients, etc.) instead of the raw data to generate a global model. But a practical problem it faces is the non-independent and identical distribution of data, which means the local data of each participant is highly inconsistent, both in terms of quantity and distribution. Moreover, there is a lack of research related to the efficiency and privacy issues in the pre-training process. Therefore, in this paper, we propose a novel solution that utilizes blockchain technology to realize small-scale global data sharing to assist the training progress. Simulation experiments verify that our method not only guarantees data security but also greatly improves performance in terms of training speed and accuracy.