Fundamental Limits of Approximate Gradient Coding

Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems Pub Date : 2020-06-08 DOI:10.1145/3393691.3394188

Sinong Wang, Jiashang Liu, N. Shroff

{"title":"Fundamental Limits of Approximate Gradient Coding","authors":"Sinong Wang, Jiashang Liu, N. Shroff","doi":"10.1145/3393691.3394188","DOIUrl":null,"url":null,"abstract":"In the distributed graident coding problem, it has been established that, to exactly recover the gradient under s slow machines, the mmimum computation load (number of stored data partitions) of each worker is at least linear ($s+1$), which incurs a large overhead when s is large[13]. In this paper, we focus on approximate gradient coding that aims to recover the gradient with bounded error ε. Theoretically, our main contributions are three-fold: (i) we analyze the structure of optimal gradient codes, and derive the information-theoretical lower bound of minimum computation load: O(log(n)/log(n/s)) for ε = 0 and d≥ O(log(1/ε)/log(n/s)) for ε>0, where d is the computation load, and ε is the error in the gradient computation; (ii) we design two approximate gradient coding schemes that exactly match such lower bounds based on random edge removal process; (iii) we implement our schemes and demonstrate the advantage of the approaches over the current fastest gradient coding strategies. The proposed schemes provide order-wise improvement over the state of the art in terms of computation load, and are also optimal in terms of both computation load and latency.","PeriodicalId":188517,"journal":{"name":"Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3393691.3394188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In the distributed graident coding problem, it has been established that, to exactly recover the gradient under s slow machines, the mmimum computation load (number of stored data partitions) of each worker is at least linear ($s+1$), which incurs a large overhead when s is large[13]. In this paper, we focus on approximate gradient coding that aims to recover the gradient with bounded error ε. Theoretically, our main contributions are three-fold: (i) we analyze the structure of optimal gradient codes, and derive the information-theoretical lower bound of minimum computation load: O(log(n)/log(n/s)) for ε = 0 and d≥ O(log(1/ε)/log(n/s)) for ε>0, where d is the computation load, and ε is the error in the gradient computation; (ii) we design two approximate gradient coding schemes that exactly match such lower bounds based on random edge removal process; (iii) we implement our schemes and demonstrate the advantage of the approaches over the current fastest gradient coding strategies. The proposed schemes provide order-wise improvement over the state of the art in terms of computation load, and are also optimal in terms of both computation load and latency.

查看原文本刊更多论文

近似梯度编码的基本极限

在分布式梯度编码问题中，已经确定，要准确恢复s台慢机下的梯度，每个worker的最小计算负载(存储数据分区数)至少是线性的($s+1$)，当s很大时，会产生很大的开销[13]。在本文中，我们重点研究近似梯度编码，旨在恢复具有有界误差ε的梯度。在理论上，我们的主要贡献有三方面:(i)分析了最优梯度码的结构，推导出最小计算负荷的信息理论下界:ε = 0时O(log(n)/log(n/s))， ε>0时d≥O(log(1/ε)/log(n/s))，其中d为计算负荷，ε为梯度计算误差;(ii)基于随机边缘去除过程，设计了两种精确匹配下界的近似梯度编码方案;(iii)我们实现了我们的方案，并证明了这些方法相对于目前最快的梯度编码策略的优势。所提出的方案在计算负载方面提供了对现有技术的顺序改进，并且在计算负载和延迟方面也是最佳的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems

自引率

0.00%

发文量