{"title":"对风险敏感的大型和小型游戏","authors":"Yan Chen, Taoying Li, Zhixian Xin","doi":"10.1051/cocv/2022082","DOIUrl":null,"url":null,"abstract":"We investigate a class of mean field games containing a large number of major and minor players. Each player minimizes a quadratic-tracking type risk-sensitive cost functional, where the reference signal is a function of the state average term of the major and minor players. To reduce the complexity for solving the problem, we design a sequence of decentralized strategies by the Nash certainty equivalence principle. Firstly, for the optimal control problems with quadratic type risksensitive cost functionals, we propose a new verification theorem. Secondly, we apply the two-layer state aggregation method to construct the fixed-point equations for the estimations of the state average terms and give the conditions for the existence and uniqueness of the fixed points. Then, we design a sequence of decentralized strategies by the estimations of the state average terms based on local information. It is shown that the estimations of the state average terms are consistent with the true values for the closed-loop systems, and the sequence of strategies designed is a decentralized asymptotic Nash equilibrium. Finally, the effectiveness of the theoretical analysis is demonstrated by a numerical example.","PeriodicalId":50500,"journal":{"name":"Esaim-Control Optimisation and Calculus of Variations","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Risk-sensitive mean field games with major and minor players\",\"authors\":\"Yan Chen, Taoying Li, Zhixian Xin\",\"doi\":\"10.1051/cocv/2022082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We investigate a class of mean field games containing a large number of major and minor players. Each player minimizes a quadratic-tracking type risk-sensitive cost functional, where the reference signal is a function of the state average term of the major and minor players. To reduce the complexity for solving the problem, we design a sequence of decentralized strategies by the Nash certainty equivalence principle. Firstly, for the optimal control problems with quadratic type risksensitive cost functionals, we propose a new verification theorem. Secondly, we apply the two-layer state aggregation method to construct the fixed-point equations for the estimations of the state average terms and give the conditions for the existence and uniqueness of the fixed points. Then, we design a sequence of decentralized strategies by the estimations of the state average terms based on local information. It is shown that the estimations of the state average terms are consistent with the true values for the closed-loop systems, and the sequence of strategies designed is a decentralized asymptotic Nash equilibrium. Finally, the effectiveness of the theoretical analysis is demonstrated by a numerical example.\",\"PeriodicalId\":50500,\"journal\":{\"name\":\"Esaim-Control Optimisation and Calculus of Variations\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Esaim-Control Optimisation and Calculus of Variations\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1051/cocv/2022082\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Esaim-Control Optimisation and Calculus of Variations","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1051/cocv/2022082","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Risk-sensitive mean field games with major and minor players
We investigate a class of mean field games containing a large number of major and minor players. Each player minimizes a quadratic-tracking type risk-sensitive cost functional, where the reference signal is a function of the state average term of the major and minor players. To reduce the complexity for solving the problem, we design a sequence of decentralized strategies by the Nash certainty equivalence principle. Firstly, for the optimal control problems with quadratic type risksensitive cost functionals, we propose a new verification theorem. Secondly, we apply the two-layer state aggregation method to construct the fixed-point equations for the estimations of the state average terms and give the conditions for the existence and uniqueness of the fixed points. Then, we design a sequence of decentralized strategies by the estimations of the state average terms based on local information. It is shown that the estimations of the state average terms are consistent with the true values for the closed-loop systems, and the sequence of strategies designed is a decentralized asymptotic Nash equilibrium. Finally, the effectiveness of the theoretical analysis is demonstrated by a numerical example.
期刊介绍:
ESAIM: COCV strives to publish rapidly and efficiently papers and surveys in the areas of Control, Optimisation and Calculus of Variations.
Articles may be theoretical, computational, or both, and they will cover contemporary subjects with impact in forefront technology, biosciences, materials science, computer vision, continuum physics, decision sciences and other allied disciplines.
Targeted topics include:
in control: modeling, controllability, optimal control, stabilization, control design, hybrid control, robustness analysis, numerical and computational methods for control, stochastic or deterministic, continuous or discrete control systems, finite-dimensional or infinite-dimensional control systems, geometric control, quantum control, game theory;
in optimisation: mathematical programming, large scale systems, stochastic optimisation, combinatorial optimisation, shape optimisation, convex or nonsmooth optimisation, inverse problems, interior point methods, duality methods, numerical methods, convergence and complexity, global optimisation, optimisation and dynamical systems, optimal transport, machine learning, image or signal analysis;
in calculus of variations: variational methods for differential equations and Hamiltonian systems, variational inequalities; semicontinuity and convergence, existence and regularity of minimizers and critical points of functionals, relaxation; geometric problems and the use and development of geometric measure theory tools; problems involving randomness; viscosity solutions; numerical methods; homogenization, multiscale and singular perturbation problems.