José Luis Pérez, Neofytos Rodosthenous, Kazutoshi Yamazaki
{"title":"Nonzero-Sum Optimal Stopping Game with Continuous vs. Periodic Exercise Opportunities","authors":"José Luis Pérez, Neofytos Rodosthenous, Kazutoshi Yamazaki","doi":"10.1287/moor.2023.0123","DOIUrl":"https://doi.org/10.1287/moor.2023.0123","url":null,"abstract":"We introduce a new nonzero-sum game of optimal stopping with asymmetric exercise opportunities. Given a stochastic process modeling the value of an asset, one player observes and can act on the process continuously, whereas the other player can act on it only periodically at independent Poisson arrival times. The first one to stop receives a reward, different for each player, whereas the other one gets nothing. We study how each player balances the maximization of gains against the maximization of the likelihood of stopping before the opponent. In such a setup driven by a Lévy process with positive jumps, we not only prove the existence but also explicitly construct a Nash equilibrium with values of the game written in terms of the scale function. Numerical illustrations with put-option payoffs are also provided to study the behavior of the players’ strategies as well as the quantification of the value of available exercise opportunities.Funding: K. Yamazaki was partly supported by The Japan Society for the Promotion of Science (JSPS) Grant-in-Aid for Scientific Research (KAKENHI) [Grants 19H01791, 20K03758, and 24K06844], Open Partnership Joint Research Projects [Grant JPJSBP120209921], and the University of Queensland [start-up grant].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141863649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exploration Noise for Learning Linear-Quadratic Mean Field Games","authors":"François Delarue, Athanasios Vasileiadis","doi":"10.1287/moor.2021.0157","DOIUrl":"https://doi.org/10.1287/moor.2021.0157","url":null,"abstract":"The goal of this paper is to demonstrate that common noise may serve as an exploration noise for learning the solution of a mean field game. This concept is here exemplified through a toy linear-quadratic model, for which a suitable form of common noise has already been proven to restore existence and uniqueness. We here go one step further and prove that the same form of common noise may force the convergence of the learning algorithm called fictitious play, and this without any further potential or monotone structure. Several numerical examples are provided to support our theoretical analysis.Funding: F. Delarue acknowledges the financial support of the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme [AdG ELISA project, Grant 101054746]. A. Vasileiadis acknowledge the financial support of French ANR project ANR-19-P3IA-0002-3IA Côte d'Azur-Nice-Interdisciplinary Institute for Artificial Intelligence.","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Moment-Sum-of-Squares Hierarchy for Robust Polynomial Matrix Inequality Optimization with Sum-of-Squares Convexity","authors":"Feng Guo, Jie Wang","doi":"10.1287/moor.2023.0361","DOIUrl":"https://doi.org/10.1287/moor.2023.0361","url":null,"abstract":"We study a class of polynomial optimization problems with a robust polynomial matrix inequality (PMI) constraint where the uncertainty set itself is also defined by a PMI. These can be viewed as matrix generalizations of semi-infinite polynomial programs because they involve actually infinitely many PMI constraints in general. Under certain sum-of-squares (SOS)-convexity assumptions, we construct a hierarchy of increasingly tight moment-SOS relaxations for solving such problems. Most of the nice features of the moment-SOS hierarchy for the usual polynomial optimization are extended to this more complicated setting. In particular, asymptotic convergence of the hierarchy is guaranteed, and finite convergence can be certified if some flat extension condition holds true. To extract global minimizers, we provide a linear algebra procedure for recovering a finitely atomic matrix-valued measure from truncated matrix-valued moments. As an application, we are able to solve the problem of minimizing the smallest eigenvalue of a polynomial matrix subject to a PMI constraint. If SOS convexity is replaced by convexity, we can still approximate the optimal value as closely as desired by solving a sequence of semidefinite programs and certify global optimality in case that certain flat extension conditions hold true. Finally, an extension to the nonconvexity setting is provided under a rank 1 condition. To obtain the above-mentioned results, techniques from real algebraic geometry, matrix-valued measure theory, and convex optimization are employed. Funding: This work was supported by the National Key Research and Development Program of China [Grant 2023YFA1009401] and the National Natural Science Foundation of China [Grants 11571350 and 12201618].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141812366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Risk-Averse Markov Decision Processes Through a Distributional Lens","authors":"Ziteng Cheng, Sebastian Jaimungal","doi":"10.1287/moor.2023.0211","DOIUrl":"https://doi.org/10.1287/moor.2023.0211","url":null,"abstract":"By adopting a distributional viewpoint on law-invariant convex risk measures, we construct dynamic risk measures (DRMs) at the distributional level. We then apply these DRMs to investigate Markov decision processes, incorporating latent costs, random actions, and weakly continuous transition kernels. Furthermore, the proposed DRMs allow risk aversion to change dynamically. Under mild assumptions, we derive a dynamic programming principle and show the existence of an optimal policy in both finite and infinite time horizons. Moreover, we provide a sufficient condition for the optimality of deterministic actions. For illustration, we conclude the paper with examples from optimal liquidation with limit order books and autonomous driving.Funding: This work was supported by Natural Sciences and Engineering Research Council of Canada [Grants RGPAS-2018-522715 and RGPIN-2018-05705].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141738361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the Convex Formulations of Robust Markov Decision Processes","authors":"Julien Grand-Clément, Marek Petrik","doi":"10.1287/moor.2022.0284","DOIUrl":"https://doi.org/10.1287/moor.2022.0284","url":null,"abstract":"Robust Markov decision processes (MDPs) are used for applications of dynamic optimization in uncertain environments and have been studied extensively. Many of the main properties and algorithms of MDPs, such as value iteration and policy iteration, extend directly to RMDPs. Surprisingly, there is no known analog of the MDP convex optimization formulation for solving RMDPs. This work describes the first convex optimization formulation of RMDPs under the classical sa-rectangularity and s-rectangularity assumptions. By using entropic regularization and exponential change of variables, we derive a convex formulation with a number of variables and constraints polynomial in the number of states and actions, but with large coefficients in the constraints. We further simplify the formulation for RMDPs with polyhedral, ellipsoidal, or entropy-based uncertainty sets, showing that, in these cases, RMDPs can be reformulated as conic programs based on exponential cones, quadratic cones, and nonnegative orthants. Our work opens a new research direction for RMDPs and can serve as a first step toward obtaining a tractable convex formulation of RMDPs.Funding: The work in the paper was supported, in part, by NSF [Grants 2144601 and 1815275]; and Agence Nationale de la Recherche [Grant 11-LABX-0047].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimality Conditions in Control Problems with Random State Constraints in Probabilistic or Almost Sure Form","authors":"Caroline Geiersbach, René Henrion","doi":"10.1287/moor.2023.0177","DOIUrl":"https://doi.org/10.1287/moor.2023.0177","url":null,"abstract":"In this paper, we discuss optimality conditions for optimization problems involving random state constraints, which are modeled in probabilistic or almost sure form. Although the latter can be understood as the limiting case of the former, the derivation of optimality conditions requires substantially different approaches. We apply them to a linear elliptic partial differential equation with random inputs. In the probabilistic case, we rely on the spherical-radial decomposition of Gaussian random vectors in order to formulate fully explicit optimality conditions involving a spherical integral. In the almost sure case, we derive optimality conditions and compare them with a model based on robust constraints with respect to the (compact) support of the given distribution.Funding: The authors thank the Deutsche Forschungsgemeinschaft [Projects B02 and B04 in the “Sonderforschungsbereich/Transregio 154 Mathematical Modelling, Simulation and Optimization Using the Example of Gas Networks”] for support. C. Geiersbach acknowledges support from the Deutsche Forschungsgemeinschaft [Germany’s Excellence Strategy–the Berlin Mathematics Research Center MATH+ Grant EXC-2046/1, Project 390685689]. R. Henrion acknowledges support from the Fondation Mathématique Jacques Hadamard [Program Gaspard Monge in Optimization and Operations Research, including support to this program by Electricité de France].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141721355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Order Independence in Sequential, Issue-by-Issue Voting","authors":"Alex Gershkov, Benny Moldovanu, Xianwen Shi","doi":"10.1287/moor.2022.0342","DOIUrl":"https://doi.org/10.1287/moor.2022.0342","url":null,"abstract":"We study when the voting outcome is independent of the order of issues put up for vote in a spatial multidimensional voting model. Agents equipped with norm-based preferences that use a norm to measure the distance from their ideal policy vote sequentially and issue by issue via simple majority. If the underlying norm is generated by an inner product—such as the Euclidean norm—then the voting outcome is order independent if and only if the issues are orthogonal. If the underlying norm is a general one, then the outcome is order independent if the basis defining the issues to be voted upon satisfies the following property; for any vector in the basis, any linear combination of the other vectors is Birkhoff–James orthogonal to it. We prove a partial converse in the case of two dimensions; if the underlying basis fails this property, then the voting order matters. Finally, despite existence results for the two-dimensional case and for the general l<jats:sub>p</jats:sub> case, we show that nonexistence of bases with this property is generic.Funding: The research of A. Gershkov is supported by the Israel Science Foundation [Grant 1118/22]. The research of B. Moldovanu is supported by the German Science Foundation through the Hausdorff Center for Mathematics and The Collaborative Research Center Transregio 224. The research of X. Shi is supported by the Social Sciences and Humanities Research Council of Canada.","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141614395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Large Independent Sets in Recursive Markov Random Graphs","authors":"Akshay Gupte, Yiran Zhu","doi":"10.1287/moor.2022.0215","DOIUrl":"https://doi.org/10.1287/moor.2022.0215","url":null,"abstract":"Computing the maximum size of an independent set in a graph is a famously hard combinatorial problem that has been well studied for various classes of graphs. When it comes to random graphs, the classic Erdős–Rényi–Gilbert random graph [Formula: see text] has been analyzed and shown to have the largest independent sets of size [Formula: see text] with high probability (w.h.p.) This classic model does not capture any dependency structure between edges that can appear in real-world networks. We define random graphs [Formula: see text] whose existence of edges is determined by a Markov process that is also governed by a decay parameter [Formula: see text]. We prove that w.h.p. [Formula: see text] has independent sets of size [Formula: see text] for arbitrary [Formula: see text]. This is derived using bounds on the terms of a harmonic series, a Turán bound on a stability number, and a concentration analysis for a certain sequence of dependent Bernoulli variables that may also be of independent interest. Because [Formula: see text] collapses to [Formula: see text] when there is no decay, it follows that having even the slightest bit of dependency (any [Formula: see text]) in the random graph construction leads to the presence of large independent sets, and thus, our random model has a phase transition at its boundary value of r = 1. This implies that there are large matchings in the line graph of [Formula: see text], which is a Markov random field. For the maximal independent set output by a greedy algorithm, we deduce that it has a performance ratio of at most [Formula: see text] w.h.p. when the lowest degree vertex is picked at each iteration and also show that, under any other permutation of vertices, the algorithm outputs a set of size [Formula: see text], where [Formula: see text] and, hence, has a performance ratio of [Formula: see text].Funding: The initial phase of this research was supported by the National Science Foundation [Grant DMS-1913294].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141609401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Rockafellian Relaxation and Stochastic Optimization Under Perturbations","authors":"Johannes O. Royset, Louis L. Chen, Eric Eckstrand","doi":"10.1287/moor.2022.0122","DOIUrl":"https://doi.org/10.1287/moor.2022.0122","url":null,"abstract":"In practice, optimization models are often prone to unavoidable inaccuracies because of dubious assumptions and corrupted data. Traditionally, this placed special emphasis on risk-based and robust formulations, and their focus on “conservative” decisions. We develop, in contrast, an “optimistic” framework based on Rockafellian relaxations in which optimization is conducted not only over the original decision space but also jointly with a choice of model perturbation. The framework enables us to address challenging problems with ambiguous probability distributions from the areas of two-stage stochastic optimization without relatively complete recourse, probability functions lacking continuity properties, expectation constraints, and outlier analysis. We are also able to circumvent the fundamental difficulty in stochastic optimization that convergence of distributions fails to guarantee convergence of expectations. The framework centers on the novel concepts of exact and limit-exact Rockafellians, with interpretations of “negative” regularization emerging in certain settings. We illustrate the role of Phi-divergence, examine rates of convergence under changing distributions, and explore extensions to first-order optimality conditions. The main development is free of assumptions about convexity, smoothness, and even continuity of objective functions. Numerical results in the setting of computer vision and text analytics with label noise illustrate the framework.Funding: This work was supported by the Air Force Office of Scientific Research (Mathematical Optimization Program) under the grant: “Optimal Decision Making under Tight Performance Requirements in Adversarial and Uncertain Environments: Insight from Rockafellian Functions.”","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141614397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jean-Paul Décamps, Fabien Gensbittel, Thomas Mariotti
{"title":"Investment Timing and Technological Breakthroughs","authors":"Jean-Paul Décamps, Fabien Gensbittel, Thomas Mariotti","doi":"10.1287/moor.2022.0022","DOIUrl":"https://doi.org/10.1287/moor.2022.0022","url":null,"abstract":"We study the optimal investment policy of a firm facing both technological and cash-flow uncertainty. At any point in time, the firm can irreversibly invest in a stand-alone technology or wait for a technological breakthrough. Breakthroughs occur when market conditions become favorable enough, exceeding a threshold value that is ex ante unknown to the firm. The Markov state variables for the optimal investment policy are the current market conditions and their historic maximum, and the firm optimally invests in the stand-alone technology only when market conditions deteriorate enough after reaching a maximum. The path-dependent return required for investing in the stand-alone technology is always higher than if no technological breakthroughs could occur and can take arbitrarily large values following certain histories. Decreases in development costs or increases in the value of the new technology make the firm more prone to bearing downside risk and delaying investment in the stand-alone technology.Funding: This research has benefited from financial support of the ANR [Programmes d’Investissements d’Avenir CHESS ANR-17-EURE-0010 and ANITI ANR-19-PI3A-0004] and the research foundation TSE-Partnership [Chaire Marchés des Risques et Création de Valeur, Fondation du Risque/SCOR].","PeriodicalId":49852,"journal":{"name":"Mathematics of Operations Research","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141568637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}