Diffusive limit approximation of pure jump optimal ergodic control problems

IF 1.1 2区数学 Q3 STATISTICS & PROBABILITY

Stochastic Processes and their Applications Pub Date : 2024-11-22 DOI:10.1016/j.spa.2024.104536

Marc Abeille , Bruno Bouchard , Lorenzo Croissant

{"title":"Diffusive limit approximation of pure jump optimal ergodic control problems","authors":"Marc Abeille , Bruno Bouchard , Lorenzo Croissant","doi":"10.1016/j.spa.2024.104536","DOIUrl":null,"url":null,"abstract":"<div><div>Motivated by the design of fast reinforcement learning algorithms, see (Croissant et al., 2024), we study the diffusive limit of a class of pure jump ergodic stochastic control problems. We show that, whenever the intensity of jumps <span><math><msup><mrow><mi>ɛ</mi></mrow><mrow><mo>−</mo><mn>1</mn></mrow></msup></math></span> is large enough, the approximation error is governed by the Hölder regularity of the Hessian matrix of the solution to the limit ergodic partial differential equation and is, indeed, of order <span><math><msup><mrow><mi>ɛ</mi></mrow><mrow><mfrac><mrow><mi>γ</mi></mrow><mrow><mn>2</mn></mrow></mfrac></mrow></msup></math></span> for all <span><math><mrow><mi>γ</mi><mo>∈</mo><mrow><mo>(</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo>)</mo></mrow></mrow></math></span>. This extends to this context the results of Abeille et al. (2023) obtained for finite horizon problems. Using the limit as an approximation, instead of directly solving the pre-limit problem, allows for a very significant reduction in the numerical resolution cost of the control problem. Additionally, we explain how error correction terms of this approximation can be constructed under appropriate smoothness assumptions. Finally, we quantify the error induced by the use of the Markov control policy constructed from the numerical finite difference scheme associated to the limit diffusive problem, which seems to be new in the literature and of independent interest.</div></div>","PeriodicalId":51160,"journal":{"name":"Stochastic Processes and their Applications","volume":"181 ","pages":"Article 104536"},"PeriodicalIF":1.1000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Processes and their Applications","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304414924002448","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

Motivated by the design of fast reinforcement learning algorithms, see (Croissant et al., 2024), we study the diffusive limit of a class of pure jump ergodic stochastic control problems. We show that, whenever the intensity of jumps

ɛ^{- 1}

is large enough, the approximation error is governed by the Hölder regularity of the Hessian matrix of the solution to the limit ergodic partial differential equation and is, indeed, of order

ɛ^{\frac{γ}{2}}

for all

γ \in (0, 1)

. This extends to this context the results of Abeille et al. (2023) obtained for finite horizon problems. Using the limit as an approximation, instead of directly solving the pre-limit problem, allows for a very significant reduction in the numerical resolution cost of the control problem. Additionally, we explain how error correction terms of this approximation can be constructed under appropriate smoothness assumptions. Finally, we quantify the error induced by the use of the Markov control policy constructed from the numerical finite difference scheme associated to the limit diffusive problem, which seems to be new in the literature and of independent interest.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Stochastic Processes and their Applications 数学-统计学与概率论

CiteScore

2.90

自引率

7.10%

发文量

180

审稿时长

23.6 weeks

期刊介绍： Stochastic Processes and their Applications publishes papers on the theory and applications of stochastic processes. It is concerned with concepts and techniques, and is oriented towards a broad spectrum of mathematical, scientific and engineering interests. Characterization, structural properties, inference and control of stochastic processes are covered. The journal is exacting and scholarly in its standards. Every effort is made to promote innovation, vitality, and communication between disciplines. All papers are refereed.