{"title":"风险敏感控制、单控制器博弈和线性规划","authors":"V. Borkar","doi":"10.3934/jdg.2023024","DOIUrl":null,"url":null,"abstract":"This article recalls the recent work on a linear programming formulation of infinite horizon risk-sensitive control via its equivalence with a single controller game, using a classic work of Vrieze. This is then applied to a constrained risk-sensitive control problem with a risk-sensitive cost and risk-sensitive constraint. This facilitates a Lagrange multiplier based resolution thereof. In the process, this leads to an unconstrained linear program and its dual, parametrized by a parameter that is a surrogate for Lagrange multiplier. This also opens up the possibility of a primal - dual type numerical scheme wherein the linear program is a subroutine within the subgradient ascent based update rule for the Lagrange multiplier. This equivalent unconstrained risk-sensitive control formulation does not seem obvious without the linear programming equivalents as intermediaries. We also discuss briefly other related algorithmic possibilities for future research.","PeriodicalId":42722,"journal":{"name":"Journal of Dynamics and Games","volume":"36 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Risk-sensitive control, single controller games and linear programming\",\"authors\":\"V. Borkar\",\"doi\":\"10.3934/jdg.2023024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article recalls the recent work on a linear programming formulation of infinite horizon risk-sensitive control via its equivalence with a single controller game, using a classic work of Vrieze. This is then applied to a constrained risk-sensitive control problem with a risk-sensitive cost and risk-sensitive constraint. This facilitates a Lagrange multiplier based resolution thereof. In the process, this leads to an unconstrained linear program and its dual, parametrized by a parameter that is a surrogate for Lagrange multiplier. This also opens up the possibility of a primal - dual type numerical scheme wherein the linear program is a subroutine within the subgradient ascent based update rule for the Lagrange multiplier. This equivalent unconstrained risk-sensitive control formulation does not seem obvious without the linear programming equivalents as intermediaries. We also discuss briefly other related algorithmic possibilities for future research.\",\"PeriodicalId\":42722,\"journal\":{\"name\":\"Journal of Dynamics and Games\",\"volume\":\"36 1\",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2023-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Dynamics and Games\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3934/jdg.2023024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Dynamics and Games","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3934/jdg.2023024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Risk-sensitive control, single controller games and linear programming
This article recalls the recent work on a linear programming formulation of infinite horizon risk-sensitive control via its equivalence with a single controller game, using a classic work of Vrieze. This is then applied to a constrained risk-sensitive control problem with a risk-sensitive cost and risk-sensitive constraint. This facilitates a Lagrange multiplier based resolution thereof. In the process, this leads to an unconstrained linear program and its dual, parametrized by a parameter that is a surrogate for Lagrange multiplier. This also opens up the possibility of a primal - dual type numerical scheme wherein the linear program is a subroutine within the subgradient ascent based update rule for the Lagrange multiplier. This equivalent unconstrained risk-sensitive control formulation does not seem obvious without the linear programming equivalents as intermediaries. We also discuss briefly other related algorithmic possibilities for future research.
期刊介绍:
The Journal of Dynamics and Games (JDG) is a pure and applied mathematical journal that publishes high quality peer-review and expository papers in all research areas of expertise of its editors. The main focus of JDG is in the interface of Dynamical Systems and Game Theory.