{"title":"Diffusive limit approximation of pure jump optimal ergodic control problems","authors":"Marc Abeille , Bruno Bouchard , Lorenzo Croissant","doi":"10.1016/j.spa.2024.104536","DOIUrl":null,"url":null,"abstract":"<div><div>Motivated by the design of fast reinforcement learning algorithms, see (Croissant et al., 2024), we study the diffusive limit of a class of pure jump ergodic stochastic control problems. We show that, whenever the intensity of jumps <span><math><msup><mrow><mi>ɛ</mi></mrow><mrow><mo>−</mo><mn>1</mn></mrow></msup></math></span> is large enough, the approximation error is governed by the Hölder regularity of the Hessian matrix of the solution to the limit ergodic partial differential equation and is, indeed, of order <span><math><msup><mrow><mi>ɛ</mi></mrow><mrow><mfrac><mrow><mi>γ</mi></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></msup></math></span> for all <span><math><mrow><mi>γ</mi><mo>∈</mo><mrow><mo>(</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo>)</mo></mrow></mrow></math></span>. This extends to this context the results of Abeille et al. (2023) obtained for finite horizon problems. Using the limit as an approximation, instead of directly solving the pre-limit problem, allows for a very significant reduction in the numerical resolution cost of the control problem. Additionally, we explain how error correction terms of this approximation can be constructed under appropriate smoothness assumptions. Finally, we quantify the error induced by the use of the Markov control policy constructed from the numerical finite difference scheme associated to the limit diffusive problem, which seems to be new in the literature and of independent interest.</div></div>","PeriodicalId":51160,"journal":{"name":"Stochastic Processes and their Applications","volume":"181 ","pages":"Article 104536"},"PeriodicalIF":1.1000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Processes and their Applications","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304414924002448","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivated by the design of fast reinforcement learning algorithms, see (Croissant et al., 2024), we study the diffusive limit of a class of pure jump ergodic stochastic control problems. We show that, whenever the intensity of jumps is large enough, the approximation error is governed by the Hölder regularity of the Hessian matrix of the solution to the limit ergodic partial differential equation and is, indeed, of order for all . This extends to this context the results of Abeille et al. (2023) obtained for finite horizon problems. Using the limit as an approximation, instead of directly solving the pre-limit problem, allows for a very significant reduction in the numerical resolution cost of the control problem. Additionally, we explain how error correction terms of this approximation can be constructed under appropriate smoothness assumptions. Finally, we quantify the error induced by the use of the Markov control policy constructed from the numerical finite difference scheme associated to the limit diffusive problem, which seems to be new in the literature and of independent interest.
期刊介绍:
Stochastic Processes and their Applications publishes papers on the theory and applications of stochastic processes. It is concerned with concepts and techniques, and is oriented towards a broad spectrum of mathematical, scientific and engineering interests.
Characterization, structural properties, inference and control of stochastic processes are covered. The journal is exacting and scholarly in its standards. Every effort is made to promote innovation, vitality, and communication between disciplines. All papers are refereed.