{"title":"On the Time Consistent Solution to Optimal Stopping Problems with Expectation Constraint","authors":"S. Christensen, M. Klein, B. Schultz","doi":"10.1007/s00245-024-10202-w","DOIUrl":"10.1007/s00245-024-10202-w","url":null,"abstract":"<div><p>We study the (weak) equilibrium problem arising from the problem of optimally stopping a one-dimensional diffusion subject to an expectation constraint on the time until stopping. The weak equilibrium problem is realized with a set of randomized but purely state dependent stopping times as admissible strategies. We derive a verification theorem and necessary conditions for equilibria, which together basically characterize all equilibria. Furthermore, additional structural properties of equilibria are obtained to feed a possible guess-and-verify approach, which is then illustrated by an example.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10202-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142858602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Continuous Time q-Learning for Mean-Field Control Problems","authors":"Xiaoli Wei, Xiang Yu","doi":"10.1007/s00245-024-10205-7","DOIUrl":"10.1007/s00245-024-10205-7","url":null,"abstract":"<div><p>This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (J Mach Learn Res 24:1–61, 2023), for continuous time mean-field control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent’s control problem in Jia and Zhou (J Mach Learn Res 24:1–61, 2023), we reveal that two different q-functions naturally arise in mean-field control problems: (i) the integrated q-function (denoted by <i>q</i>) as the first-order approximation of the integrated Q-function introduced in Gu et al. (Oper Res 71(4):1040–1054, 2023), which can be learnt by a weak martingale condition using all test policies; and (ii) the essential q-function (denoted by <span>(q_e)</span>) that is employed in the policy improvement iterations. We show that two q-functions are related via an integral representation. Based on the weak martingale condition and our proposed searching method of test policies, some model-free learning algorithms are devised. In two examples, one in LQ control framework and one beyond LQ control framework, we can obtain the exact parameterization of the optimal value function and q-functions and illustrate our algorithms with simulation experiments.\u0000</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142826395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching","authors":"Robert Denkert, Huyên Pham, Xavier Warin","doi":"10.1007/s00245-024-10207-5","DOIUrl":"10.1007/s00245-024-10207-5","url":null,"abstract":"<div><p>We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enabling applications across various classes of Markovian continuous time control problems, beyond diffusion models, including e.g. regular, impulse and optimal stopping/switching problems. By utilizing change of measure in the control randomisation technique, we derive a new policy gradient representation for these randomised problems, featuring parametrised intensity policies. We further develop actor-critic algorithms specifically designed to address general Markovian stochastic control issues. Our framework is demonstrated through its application to optimal switching problems, with two numerical case studies in the energy sector focusing on real options.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142826391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Coarse Correlated Equilibria in Linear Quadratic Mean Field Games and Application to an Emission Abatement Game","authors":"Luciano Campi, Federico Cannerozzi, Fanny Cartellier","doi":"10.1007/s00245-024-10198-3","DOIUrl":"10.1007/s00245-024-10198-3","url":null,"abstract":"<div><p>Coarse correlated equilibria (CCE) are a good alternative to Nash equilibria (NE), as they arise more naturally as outcomes of learning algorithms and as they may exhibit higher payoffs than NE. CCEs include a device which allows players’ strategies to be correlated without any cooperation, only through information sent by a mediator. We develop a methodology to concretely compute mean field CCEs in a linear-quadratic mean field game (MFG) framework. We compare their performance to mean field control solutions and mean field NE (usually named MFG solutions). Our approach is implemented in the mean field version of an emission abatement game between greenhouse gas emitters. In particular, we exhibit a simple and tractable class of mean field CCEs which allows to outperform very significantly the mean field NE payoff and abatement levels, bridging the gap between the mean field NE and the social optimum obtained by mean field control.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10198-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142821384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Well-Posed Uniform Solvability of Convex Optimization Problems on a Uniform Differentiable Closed Convex Set","authors":"Shaoqiang Shang","doi":"10.1007/s00245-024-10206-6","DOIUrl":"10.1007/s00245-024-10206-6","url":null,"abstract":"<div><p>In this paper, we first give the definition of uniformly differentiable set and give the definitions of sets <span>(P(A,eta , r))</span> and <span>(P_{A,delta }(f))</span>. Secondly, we prove that if the set <i>A</i> is bounded closed convex set, then <i>A</i> is uniformly differentiable if and only if for any <span>(varepsilon , eta , r>0)</span>, there exists <span>(delta =delta (varepsilon ,eta ,r )>0)</span> such that <span>(Vert x-yVert <varepsilon )</span> whenever <span>(fin P(A,eta , r))</span>, <span>(yin P_{A,delta }(f))</span> and <span>(xin P_{A}(f))</span>. Moreover, we also prove that if <i>A</i> is a bounded closed convex set in a finite-dimensional space <i>X</i>, then <i>A</i> is differentiable if and only if <i>A</i> is uniformly differentiable. Finally, we give some examples of uniformly differentiable set. Therefore, we extend some conclusions (SIAM J. Optim. Vol. 30, No. 1, pp. 490–512).</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142811195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Global Boundedness of Solutions to a Food Chain Model with Nonlinear Taxis Sensitivity","authors":"Enhui Pan, Changchun Liu","doi":"10.1007/s00245-024-10208-4","DOIUrl":"10.1007/s00245-024-10208-4","url":null,"abstract":"<div><p>In this paper, we investigate the initial boundary value problem of a three-species spatial food chain model with nonlinear taxis sensitivity in a bounded domain <span>(Omega subset {mathbb {R}}^2)</span> with a smooth boundary and homogeneous Neumann boundary conditions. By establishing a new energy functional, we demonstrate the global boundedness of classical solutions under appropriate initial data.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimal Consumption–Investment with Constraints in a Regime Switching Market with Random Coefficients","authors":"Ying Hu, Xiaomin Shi, Zuo Quan Xu","doi":"10.1007/s00245-024-10203-9","DOIUrl":"10.1007/s00245-024-10203-9","url":null,"abstract":"<div><p>This paper studies finite-time optimal consumption–investment problems with power, logarithmic and exponential utilities, in a regime switching market with random coefficients and possibly subject to non-convex constraints. Compared to the existing models, one distinguish feature of our model is that the trading constraints put on the consumption and investment strategies are coupled together in the cases of power and logarithmic utilities, leading to new technical challenges. We provide explicit optimal consumption–investment strategies and optimal values for these consumption–investment problems, which are expressed in terms of the solutions to some multi-dimensional diagonally quadratic backward stochastic differential equation (BSDE) and linear BSDE with unbound coefficients. These BSDEs are new in the literature and solving them is one of the main theoretical contributions of this paper. We accomplish the latter by applying the truncation, approximation technique to get some a priori uniformly lower and upper bounds for their solutions.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimality Conditions for Parabolic Stochastic Optimal Control Problems with Boundary Controls","authors":"Piero Visconti","doi":"10.1007/s00245-024-10204-8","DOIUrl":"10.1007/s00245-024-10204-8","url":null,"abstract":"<div><p>In this paper, we study optimality conditions for a class of control problems driven by a cylindrical Wiener process, resulting in a stochastic maximum principle in differential form. The control acts on both the drift and volatility, potentially as unbounded operators, allowing for SPDEs with boundary control and/or noise. Through the factorization method, we establish a regularity property for the state equation, which, by duality, extends to the backward costate equation, understood in the transposition sense. Finally, we show that the cost functional is Gâteaux differentiable, with its derivative represented by the costate. The optimality condition is derived using results from set-valued analysis.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142798509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stopper vs. Singular Controller Games With Degenerate Diffusions","authors":"Andrea Bovo, Tiziano De Angelis, Jan Palczewski","doi":"10.1007/s00245-024-10199-2","DOIUrl":"10.1007/s00245-024-10199-2","url":null,"abstract":"<div><p>We study zero-sum stochastic games between a singular controller and a stopper when the (state-dependent) diffusion matrix of the underlying controlled diffusion process is degenerate. In particular, we show the existence of a value for the game and determine an optimal strategy for the stopper. The degeneracy of the dynamics prevents the use of analytical methods based on solution in Sobolev spaces of suitable variational problems. Therefore we adopt a probabilistic approach based on a perturbation of the underlying diffusion modulated by a parameter <span>(gamma >0)</span>. For each <span>(gamma >0)</span> the approximating game is non-degenerate and admits a value <span>(u^gamma )</span> and an optimal strategy <span>(tau ^gamma _*)</span> for the stopper. Letting <span>(gamma rightarrow 0)</span> we prove convergence of <span>(u^gamma )</span> to a function <i>v</i>, which identifies the value of the original game. We also construct explicitly optimal stopping times <span>(theta ^gamma _*)</span> for <span>(u^gamma )</span>, related but not equal to <span>(tau ^gamma _*)</span>, which converge almost surely to an optimal stopping time <span>(theta _*)</span> for the game with degenerate dynamics.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00245-024-10199-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142778199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Almost Sectorial Operators in Fractional Superdiffusion Equations","authors":"Eduardo Cuesta, Rodrigo Ponce","doi":"10.1007/s00245-024-10201-x","DOIUrl":"10.1007/s00245-024-10201-x","url":null,"abstract":"<div><p>In this paper the resolvent family <span>({S_{alpha ,beta }(t)}_{tge 0}subset mathcal {L}(X,Y))</span> generated by an almost sectorial operator <i>A</i>, where <span>(alpha ,beta >0,)</span> <i>X</i>, <i>Y</i> are complex Banach spaces and its Laplace transform satisfies <span>(hat{S}_{alpha ,beta }(z)=z^{alpha -beta }(z^alpha -A)^{-1})</span> is studied. This family of operators allows to write the solution to an abstract initial value problem of time fractional type of order <span>(1<alpha <2)</span> as a variation of constants formula. Estimates of the norm <span>(Vert S_{alpha ,beta }(t)Vert ,)</span> as well as the continuity and compactness of <span>(S_{alpha ,beta }(t))</span>, for <span>(t>0)</span>, are shown. Moreover, the Hölder regularity of its solutions is also studied.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"91 1","pages":""},"PeriodicalIF":1.6,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142761958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}