A Code-Based Distributed Gradient Descent Method

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton) Pub Date : 2018-10-01 DOI:10.1109/ALLERTON.2018.8635869

Elie Atallah, N. Rahnavard

{"title":"A Code-Based Distributed Gradient Descent Method","authors":"Elie Atallah, N. Rahnavard","doi":"10.1109/ALLERTON.2018.8635869","DOIUrl":null,"url":null,"abstract":"Distributed gradient descent is an optimization algorithm that is used to solve a minimization problem distributed over a network through minimizing local functions that sum up to form the overall objective function. These local functions fi contribute to local gradients adding up incrementally to form the overall gradient. Recently, the gradient coding paradigm was introduced for networks with a centralized fusion center to resolve the problem of straggler nodes. Through introducing some kind of redundancy on each node, such coding schemes are utilized to form new coded local functions gi from the original local functions fi. In this work, we consider a distributed network with a defined network topology and no fusion center. At each node, linear combinations of the local coded gradients $\\nabla\\overline{g}_{i}$ can be constructed to form the overall gradient. Our iterative method, referred to as Code-Based Distributed Gradient Descent (CDGD), updates each node's local estimate by applying an adequate weighing scheme. This scheme adapts the coded local gradient descent step along with local estimates from neighboring nodes. We provide the convergence analysis for CDGD and we analytically show that we enhance the convergence rate by a scaling factor over conventional incremental methods without any predefined tuning. Furthermore, we demonstrate through numerical results significant performance and enhancements for convergence rates.","PeriodicalId":299280,"journal":{"name":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2018.8635869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Distributed gradient descent is an optimization algorithm that is used to solve a minimization problem distributed over a network through minimizing local functions that sum up to form the overall objective function. These local functions fi contribute to local gradients adding up incrementally to form the overall gradient. Recently, the gradient coding paradigm was introduced for networks with a centralized fusion center to resolve the problem of straggler nodes. Through introducing some kind of redundancy on each node, such coding schemes are utilized to form new coded local functions gi from the original local functions fi. In this work, we consider a distributed network with a defined network topology and no fusion center. At each node, linear combinations of the local coded gradients $\nabla\overline{g}_{i}$ can be constructed to form the overall gradient. Our iterative method, referred to as Code-Based Distributed Gradient Descent (CDGD), updates each node's local estimate by applying an adequate weighing scheme. This scheme adapts the coded local gradient descent step along with local estimates from neighboring nodes. We provide the convergence analysis for CDGD and we analytically show that we enhance the convergence rate by a scaling factor over conventional incremental methods without any predefined tuning. Furthermore, we demonstrate through numerical results significant performance and enhancements for convergence rates.

查看原文本刊更多论文

基于代码的分布式梯度下降方法

分布式梯度下降算法是一种通过最小化局部函数来解决分布在网络上的最小化问题的优化算法，这些局部函数之和形成总体目标函数。这些局部函数对局部梯度有贡献，逐渐加起来形成整体梯度。近年来，为了解决分散节点的问题，在具有集中融合中心的网络中引入了梯度编码范式。这些编码方案通过在每个节点上引入某种冗余，将原有的局部函数fi编码为新的局部函数gi。在这项工作中，我们考虑了一个具有定义的网络拓扑和无融合中心的分布式网络。在每个节点上，可以构造局部编码梯度$\nabla\overline{g}_{i}$的线性组合，形成整体梯度。我们的迭代方法，称为基于代码的分布式梯度下降(CDGD)，通过应用适当的加权方案来更新每个节点的局部估计。该方案采用了编码的局部梯度下降步骤以及邻近节点的局部估计。我们提供了CDGD的收敛性分析，并分析表明，我们在没有任何预定义调优的情况下，比传统的增量方法提高了一个比例因子的收敛率。此外，我们通过数值结果证明了显著的性能和收敛速度的增强。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

自引率

0.00%

发文量