具有通信延迟的分布式零阶在线优化

IF 2.3 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

IET Control Theory and Applications Pub Date : 2025-01-02 DOI:10.1049/cth2.12759

Keito Inoue, Naoki Hayashi, Shigemasa Takai

{"title":"具有通信延迟的分布式零阶在线优化","authors":"Keito Inoue, Naoki Hayashi, Shigemasa Takai","doi":"10.1049/cth2.12759","DOIUrl":null,"url":null,"abstract":"<p>This paper investigates distributed online optimization in a networked multiagent system, where each agent has its own private objective and constraint functions that vary over time. In many real-world scenarios, computing the gradient of the cost function can be challenging, especially when agents have limited computational capabilities. Moreover, communication delays are common in practical networked systems due to various factors. This paper considers a unified framework for distributed online optimization that can handle bandit feedback and communication delays feedback simultaneously. A distributed primal-dual algorithm is proposed that utilizes bandit feedback, in which the agents estimate the gradients of their objective and constraint functions by sampling the function values. An enlarged network model that incorporates the delayed information exchanged among the agents is introduced. Through theoretical analysis, it is shown that the proposed algorithm achieves sublinear upper bounds on both the dynamic regret and the constraint violation despite communication delays.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":"19 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12759","citationCount":"0","resultStr":"{\"title\":\"Distributed zeroth-order online optimization with communication delays\",\"authors\":\"Keito Inoue, Naoki Hayashi, Shigemasa Takai\",\"doi\":\"10.1049/cth2.12759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper investigates distributed online optimization in a networked multiagent system, where each agent has its own private objective and constraint functions that vary over time. In many real-world scenarios, computing the gradient of the cost function can be challenging, especially when agents have limited computational capabilities. Moreover, communication delays are common in practical networked systems due to various factors. This paper considers a unified framework for distributed online optimization that can handle bandit feedback and communication delays feedback simultaneously. A distributed primal-dual algorithm is proposed that utilizes bandit feedback, in which the agents estimate the gradients of their objective and constraint functions by sampling the function values. An enlarged network model that incorporates the delayed information exchanged among the agents is introduced. Through theoretical analysis, it is shown that the proposed algorithm achieves sublinear upper bounds on both the dynamic regret and the constraint violation despite communication delays.</p>\",\"PeriodicalId\":50382,\"journal\":{\"name\":\"IET Control Theory and Applications\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12759\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Control Theory and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cth2.12759\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cth2.12759","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了网络多智能体系统中的分布式在线优化问题，其中每个智能体都有自己的私有目标和随时间变化的约束函数。在许多现实场景中，计算成本函数的梯度可能具有挑战性，特别是当代理的计算能力有限时。此外，在实际的网络系统中，由于各种因素，通信延迟是常见的。本文提出了一种统一的分布式在线优化框架，可以同时处理强盗反馈和通信延迟反馈。提出了一种利用强盗反馈的分布式原始对偶算法，该算法通过对目标函数和约束函数的值进行采样来估计其梯度。引入了一种包含agent间延迟信息交换的扩展网络模型。理论分析表明，尽管存在通信延迟，该算法在动态遗憾和约束违反上均实现了亚线性上界。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Distributed zeroth-order online optimization with communication delays

查看原文本刊更多论文

Distributed zeroth-order online optimization with communication delays

This paper investigates distributed online optimization in a networked multiagent system, where each agent has its own private objective and constraint functions that vary over time. In many real-world scenarios, computing the gradient of the cost function can be challenging, especially when agents have limited computational capabilities. Moreover, communication delays are common in practical networked systems due to various factors. This paper considers a unified framework for distributed online optimization that can handle bandit feedback and communication delays feedback simultaneously. A distributed primal-dual algorithm is proposed that utilizes bandit feedback, in which the agents estimate the gradients of their objective and constraint functions by sampling the function values. An enlarged network model that incorporates the delayed information exchanged among the agents is introduced. Through theoretical analysis, it is shown that the proposed algorithm achieves sublinear upper bounds on both the dynamic regret and the constraint violation despite communication delays.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Control Theory and Applications 工程技术-工程：电子与电气

CiteScore

5.70

自引率

7.70%

发文量

167

审稿时长

5.1 months

期刊介绍： IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.