{"title":"Deep Reinforcement Learning for Market Making in Corporate Bonds: Beating the Curse of Dimensionality","authors":"Olivier Gu'eant, Iuliia Manziuk","doi":"10.1080/1350486X.2020.1714455","DOIUrl":null,"url":null,"abstract":"ABSTRACT In corporate bond markets, which are mainly OTC markets, market makers play a central role by providing bid and ask prices for bonds to asset managers. Determining the optimal bid and ask quotes that a market maker should set for a given universe of bonds is a complex task. The existing models, mostly inspired by the Avellaneda-Stoikov model, describe the complex optimization problem faced by market makers: proposing bid and ask prices for making money out of the difference between them while mitigating the market risk associated with holding inventory. While most of the models only tackle one-asset market making, they can often be generalized to a multi-asset framework. However, the problem of solving the equations characterizing the optimal bid and ask quotes numerically is seldom tackled in the literature, especially in high dimension. In this paper, we propose a numerical method for approximating the optimal bid and ask quotes over a large universe of bonds in a model à la Avellaneda–Stoikov. As classical finite difference methods cannot be used in high dimension, we present a discrete-time method inspired by reinforcement learning techniques, namely, a model-based deep actor-critic algorithm.","PeriodicalId":35818,"journal":{"name":"Applied Mathematical Finance","volume":"2 1","pages":"387 - 452"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"48","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematical Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/1350486X.2020.1714455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 48
Abstract
ABSTRACT In corporate bond markets, which are mainly OTC markets, market makers play a central role by providing bid and ask prices for bonds to asset managers. Determining the optimal bid and ask quotes that a market maker should set for a given universe of bonds is a complex task. The existing models, mostly inspired by the Avellaneda-Stoikov model, describe the complex optimization problem faced by market makers: proposing bid and ask prices for making money out of the difference between them while mitigating the market risk associated with holding inventory. While most of the models only tackle one-asset market making, they can often be generalized to a multi-asset framework. However, the problem of solving the equations characterizing the optimal bid and ask quotes numerically is seldom tackled in the literature, especially in high dimension. In this paper, we propose a numerical method for approximating the optimal bid and ask quotes over a large universe of bonds in a model à la Avellaneda–Stoikov. As classical finite difference methods cannot be used in high dimension, we present a discrete-time method inspired by reinforcement learning techniques, namely, a model-based deep actor-critic algorithm.
在以场外交易市场为主的公司债券市场中,做市商通过向资产管理公司提供债券的买入价和卖出价而发挥着核心作用。确定做市商为特定债券设定的最佳买卖报价是一项复杂的任务。现有的模型大多受到Avellaneda-Stoikov模型的启发,描述了做市商面临的复杂优化问题:提出买入价和卖出价,以便从两者之间的差价中赚钱,同时降低与持有库存相关的市场风险。虽然大多数模型只处理单一资产做市,但它们通常可以推广到多资产框架。然而,文献中对最优买入价和最优卖出价方程的数值求解问题,特别是在高维的情况下,研究较少。在本文中,我们提出了一种数值方法来逼近大范围债券的最优买入价和最优卖出价,该方法适用于a - la Avellaneda-Stoikov模型。由于经典的有限差分方法不能用于高维,我们提出了一种受强化学习技术启发的离散时间方法,即基于模型的深度actor-critic算法。
期刊介绍:
The journal encourages the confident use of applied mathematics and mathematical modelling in finance. The journal publishes papers on the following: •modelling of financial and economic primitives (interest rates, asset prices etc); •modelling market behaviour; •modelling market imperfections; •pricing of financial derivative securities; •hedging strategies; •numerical methods; •financial engineering.