2021 Seventh Indian Control Conference (ICC)最新文献_第8页

Robustness of Markov perfect equilibrium to model approximations in general-sum dynamic games 马尔可夫完美均衡对一般和动态博弈模型逼近的鲁棒性

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-12-20 DOI: 10.1109/ICC54714.2021.9703156

Jayakumar Subramanian, Amit Sinha, Aditya Mahajan

{"title":"Robustness of Markov perfect equilibrium to model approximations in general-sum dynamic games","authors":"Jayakumar Subramanian, Amit Sinha, Aditya Mahajan","doi":"10.1109/ICC54714.2021.9703156","DOIUrl":"https://doi.org/10.1109/ICC54714.2021.9703156","url":null,"abstract":"Dynamic games (also called stochastic games or Markov games) are an important class of games for modeling multi-agent interactions. In many situations, the dynamics and reward functions of the game are learnt from past data and are therefore approximate. In this paper, we study the robustness of Markov perfect equilibrium to approximations in reward and transition functions. Using approximation results from Markov decision processes, we show that the Markov perfect equilibrium of an approximate (or perturbed) game is always an approximate Markov perfect equilibrium of the original game. We provide explicit bounds on the approximation error in terms of three quantities: (i) the error in approximating the reward functions, (ii) the error in approximating the transition function, and (iii) a property of the value function of the MPE of the approximate game. The second and third quantities depend on the choice of metric on probability spaces. We also present coarser upper bounds which do not depend on the value function but only depend on the properties of the reward and transition functions of the approximate game. We illustrate the results via a numerical example.","PeriodicalId":382373,"journal":{"name":"2021 Seventh Indian Control Conference (ICC)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127861974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Robust Nonlinear Adaptive Control for Control of Nuclear Reactor 核反应堆鲁棒非线性自适应控制

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-12-20 DOI: 10.1109/ICC54714.2021.9703154

P. S. Reddy, S. Shimjith, A. Tiwari, S. Kar

引用次数: 3

A generalized algorithm and framework for online 3-dimensional bin packing in an automated sorting center 自动分拣中心三维在线装箱的广义算法与框架

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-11-01 DOI: 10.1109/ICC54714.2021.9703142

Ankush Ojha, Marichi Agarwal, Aniruddha Singhal, Chayan Sarkar, Supratim Ghosh, Rajesh Sinha

引用次数: 2

A Dynamic Programming Formulation for the Nonlinear Filter 非线性滤波器的动态规划公式

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-10-29 DOI: 10.1109/ICC54714.2021.9703115

Jin W. Kim, P. Mehta

引用次数: 0

A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning 多智能体强化学习的迭代对数律

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-10-27 DOI: 10.1109/ICC54714.2021.9702912

Gugan Thoppe, Bhumesh Kumar

引用次数: 3

Cooperative Target Capture Using Predefined-Time Consensus 使用预定义时间共识的协同目标捕获

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-09-03 DOI: 10.1109/ICC54714.2021.9703170

Abhinav Sinha, S. R. Kumar

引用次数: 4

Zeroth-order randomized block methods for constrained minimization of expectation-valued Lipschitz continuous functions 期望值Lipschitz连续函数约束最小化的零阶随机块方法

2021 Seventh Indian Control Conference (ICC) Pub Date : 2021-07-15 DOI: 10.1109/ICC54714.2021.9703135

U. Shanbhag, Farzad Yousefian

{"title":"Zeroth-order randomized block methods for constrained minimization of expectation-valued Lipschitz continuous functions","authors":"U. Shanbhag, Farzad Yousefian","doi":"10.1109/ICC54714.2021.9703135","DOIUrl":"https://doi.org/10.1109/ICC54714.2021.9703135","url":null,"abstract":"We consider the minimization of an $L_{0}$-Lipschitz continuous and expectation-valued function, denoted by $f$ and defined as $f(mathrm{x}) {buildrel triangleover=}mathbb{E}[tilde{f}(mathrm{x}, omega)]$, over a Cartesian product of closed and convex sets with a view towards obtaining both asymptotics as well as rate and complexity guarantees for computing an approximate stationary point (in a Clarke sense). We adopt a smoothing-based approach reliant on minimizing $f_{eta}$ where $f(mathrm{x}) {buildrel triangleover=}mathbb{E}_{u}[f(mathrm{x}+eta u)],u$ is a random variable defined on a unit sphere, and $eta > 0$. In fact, it is observed that a stationary point of the $eta$-smoothed problem is a $2eta$-stationary point for the original problem in the Clarke sense. In such a setting, we derive a suitable residual function that provides a metric for stationarity for the smoothed problem. By leveraging a zeroth-order framework reliant on utilizing sampled function evaluations implemented in a block-structured regime, we make two sets of contributions for the sequence generated by the proposed scheme. (i) The residual function of the smoothed problem tends to zero almost surely along the generated sequence; (ii) To compute an $mathrm{x}$ that ensures that the expected norm of the residual of the $eta$-smoothed problem is within $epsilon$ requires no greater than $mathcal{O}(frac{1}{etaepsilon^{2}})$ projection steps and $mathcal{O}left(frac{1}{eta^{2}epsilon^{4}}right)$ function evaluations. These statements appear to be novel with few related results available to contend with general nonsmooth, nonconvex, and stochastic regimes via zeroth-order approaches.","PeriodicalId":382373,"journal":{"name":"2021 Seventh Indian Control Conference (ICC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121645879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Learning event-driven switched linear systems 学习事件驱动的切换线性系统

2021 Seventh Indian Control Conference (ICC) Pub Date : 2020-09-27 DOI: 10.1109/ICC54714.2021.9703130

A. Kundu, P. Prabhakar

引用次数: 0