{"title":"McKean-Vlasov型和分支粒子系统近似的离散时间部分可观察随机最优控制问题","authors":"Hexiang Wan;Guangchen Wang;Jie Xiong","doi":"10.1109/TAC.2024.3525256","DOIUrl":null,"url":null,"abstract":"This article investigates a broad category of McKean–Vlasov type discrete-time partially observable stochastic optimal control problems. The first goal is to prove the dynamic programming principle (DPP) by means of the measurable selection argument, which provides a methodology for finding both the value function as well as the optimal control. Here, we employ the Nisio semigroup technology, which is an intrinsic characterization of the DPP. Then, we derive the recursive formula for the filter process, which enables us to clearly track the time evolution of the posterior distribution. Next, we approximate the posterior distribution utilizing branching particle systems (branching particle filters) and illustrate its convergence. Making use of branching particle system approximations and Bellman equations, we devise a numerical algorithm for addressing the optimal control problem. A numerical experiment serves as the final part of this article.","PeriodicalId":13201,"journal":{"name":"IEEE Transactions on Automatic Control","volume":"70 7","pages":"4376-4391"},"PeriodicalIF":7.0000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discrete-Time Partially Observable Stochastic Optimal Control Problems of McKean–Vlasov Type and Branching Particle System Approximations\",\"authors\":\"Hexiang Wan;Guangchen Wang;Jie Xiong\",\"doi\":\"10.1109/TAC.2024.3525256\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates a broad category of McKean–Vlasov type discrete-time partially observable stochastic optimal control problems. The first goal is to prove the dynamic programming principle (DPP) by means of the measurable selection argument, which provides a methodology for finding both the value function as well as the optimal control. Here, we employ the Nisio semigroup technology, which is an intrinsic characterization of the DPP. Then, we derive the recursive formula for the filter process, which enables us to clearly track the time evolution of the posterior distribution. Next, we approximate the posterior distribution utilizing branching particle systems (branching particle filters) and illustrate its convergence. Making use of branching particle system approximations and Bellman equations, we devise a numerical algorithm for addressing the optimal control problem. A numerical experiment serves as the final part of this article.\",\"PeriodicalId\":13201,\"journal\":{\"name\":\"IEEE Transactions on Automatic Control\",\"volume\":\"70 7\",\"pages\":\"4376-4391\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automatic Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10820100/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automatic Control","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10820100/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Discrete-Time Partially Observable Stochastic Optimal Control Problems of McKean–Vlasov Type and Branching Particle System Approximations
This article investigates a broad category of McKean–Vlasov type discrete-time partially observable stochastic optimal control problems. The first goal is to prove the dynamic programming principle (DPP) by means of the measurable selection argument, which provides a methodology for finding both the value function as well as the optimal control. Here, we employ the Nisio semigroup technology, which is an intrinsic characterization of the DPP. Then, we derive the recursive formula for the filter process, which enables us to clearly track the time evolution of the posterior distribution. Next, we approximate the posterior distribution utilizing branching particle systems (branching particle filters) and illustrate its convergence. Making use of branching particle system approximations and Bellman equations, we devise a numerical algorithm for addressing the optimal control problem. A numerical experiment serves as the final part of this article.
期刊介绍:
In the IEEE Transactions on Automatic Control, the IEEE Control Systems Society publishes high-quality papers on the theory, design, and applications of control engineering. Two types of contributions are regularly considered:
1) Papers: Presentation of significant research, development, or application of control concepts.
2) Technical Notes and Correspondence: Brief technical notes, comments on published areas or established control topics, corrections to papers and notes published in the Transactions.
In addition, special papers (tutorials, surveys, and perspectives on the theory and applications of control systems topics) are solicited.