{"title":"逆均值场博弈的策略迭代法","authors":"Kui Ren, Nathan Soedjak, Shanyin Tong","doi":"arxiv-2409.06184","DOIUrl":null,"url":null,"abstract":"We propose a policy iteration method to solve an inverse problem for a\nmean-field game model, specifically to reconstruct the obstacle function in the\ngame from the partial observation data of value functions, which represent the\noptimal costs for agents. The proposed approach decouples this complex inverse\nproblem, which is an optimization problem constrained by a coupled nonlinear\nforward and backward PDE system in the MFG, into several iterations of solving\nlinear PDEs and linear inverse problems. This method can also be viewed as a\nfixed-point iteration that simultaneously solves the MFG system and inversion.\nWe further prove its linear rate of convergence. In addition, numerical\nexamples in 1D and 2D, along with performance comparisons to a direct\nleast-squares method, demonstrate the superior efficiency and accuracy of the\nproposed method for solving inverse MFGs.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Policy Iteration Method for Inverse Mean Field Games\",\"authors\":\"Kui Ren, Nathan Soedjak, Shanyin Tong\",\"doi\":\"arxiv-2409.06184\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a policy iteration method to solve an inverse problem for a\\nmean-field game model, specifically to reconstruct the obstacle function in the\\ngame from the partial observation data of value functions, which represent the\\noptimal costs for agents. The proposed approach decouples this complex inverse\\nproblem, which is an optimization problem constrained by a coupled nonlinear\\nforward and backward PDE system in the MFG, into several iterations of solving\\nlinear PDEs and linear inverse problems. This method can also be viewed as a\\nfixed-point iteration that simultaneously solves the MFG system and inversion.\\nWe further prove its linear rate of convergence. In addition, numerical\\nexamples in 1D and 2D, along with performance comparisons to a direct\\nleast-squares method, demonstrate the superior efficiency and accuracy of the\\nproposed method for solving inverse MFGs.\",\"PeriodicalId\":501286,\"journal\":{\"name\":\"arXiv - MATH - Optimization and Control\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - MATH - Optimization and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06184\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Optimization and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06184","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Policy Iteration Method for Inverse Mean Field Games
We propose a policy iteration method to solve an inverse problem for a
mean-field game model, specifically to reconstruct the obstacle function in the
game from the partial observation data of value functions, which represent the
optimal costs for agents. The proposed approach decouples this complex inverse
problem, which is an optimization problem constrained by a coupled nonlinear
forward and backward PDE system in the MFG, into several iterations of solving
linear PDEs and linear inverse problems. This method can also be viewed as a
fixed-point iteration that simultaneously solves the MFG system and inversion.
We further prove its linear rate of convergence. In addition, numerical
examples in 1D and 2D, along with performance comparisons to a direct
least-squares method, demonstrate the superior efficiency and accuracy of the
proposed method for solving inverse MFGs.