Data Encoding for Byzantine-Resilient Distributed Gradient Descent

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton) Pub Date : 2018-10-01 DOI:10.1109/ALLERTON.2018.8636017

Deepesh Data, Linqi Song, S. Diggavi

{"title":"Data Encoding for Byzantine-Resilient Distributed Gradient Descent","authors":"Deepesh Data, Linqi Song, S. Diggavi","doi":"10.1109/ALLERTON.2018.8636017","DOIUrl":null,"url":null,"abstract":"We consider distributed gradient computation, where both data and computation are distributed among m worker machines, t of which can be Byzantine adversaries, and a designated (master) node computes the model/parameter vector, iteratively using gradient descent (GD). The Byzantine adversaries can (collaboratively) deviate arbitrarily from their gradient computation. To solve this, we propose a method based on data encoding and (real) error correction to combat the adversarial behavior. We can tolerate up to$t\\leq \\displaystyle \\lfloor\\frac{m-1}{2}\\rfloor$ corrupt worker nodes, which is information-theoretically optimal. Our method does not assume any probability distribution on the data. We develop a sparse encoding scheme which enables computationally efficient data encoding. We demonstrate a trade-off between the number of adversaries tolerated and the resource requirement (storage and computational complexity). As an example, our scheme incurs a constant overhead (storage and computational complexity) over that required by the distributed GD algorithm, without adversaries, for$t\\leq \\displaystyle \\frac{m}{3}$.","PeriodicalId":299280,"journal":{"name":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2018.8636017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

Abstract

We consider distributed gradient computation, where both data and computation are distributed among m worker machines, t of which can be Byzantine adversaries, and a designated (master) node computes the model/parameter vector, iteratively using gradient descent (GD). The Byzantine adversaries can (collaboratively) deviate arbitrarily from their gradient computation. To solve this, we propose a method based on data encoding and (real) error correction to combat the adversarial behavior. We can tolerate up to$t\leq \displaystyle \lfloor\frac{m-1}{2}\rfloor$ corrupt worker nodes, which is information-theoretically optimal. Our method does not assume any probability distribution on the data. We develop a sparse encoding scheme which enables computationally efficient data encoding. We demonstrate a trade-off between the number of adversaries tolerated and the resource requirement (storage and computational complexity). As an example, our scheme incurs a constant overhead (storage and computational complexity) over that required by the distributed GD algorithm, without adversaries, for$t\leq \displaystyle \frac{m}{3}$.

查看原文本刊更多论文

拜占庭弹性分布式梯度下降的数据编码

我们考虑分布式梯度计算，其中数据和计算都分布在m个工作机器中，其中t个可能是拜占庭对手，指定的(主)节点使用梯度下降(GD)迭代计算模型/参数向量。拜占庭对手可以(协同)任意偏离他们的梯度计算。为了解决这个问题，我们提出了一种基于数据编码和(真实)纠错的方法来对抗对抗行为。我们最多可以容忍$t\leq \displaystyle \lfloor\frac{m-1}{2}\rfloor$损坏的工作节点，这在信息理论上是最优的。我们的方法不假设数据有任何概率分布。我们开发了一种稀疏编码方案，使计算效率的数据编码。我们演示了容忍的攻击者数量与资源需求(存储和计算复杂性)之间的权衡。例如，对于$t\leq \displaystyle \frac{m}{3}$，我们的方案比分布式GD算法所需的开销(存储和计算复杂性)更大，没有对手。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

自引率

0.00%

发文量