Reinforcement Learning for Corporate Bond Trading: A Sell Side Perspective

Samuel Atkins, Ali Fathi, Sammy Assefa
{"title":"Reinforcement Learning for Corporate Bond Trading: A Sell Side Perspective","authors":"Samuel Atkins, Ali Fathi, Sammy Assefa","doi":"arxiv-2406.12983","DOIUrl":null,"url":null,"abstract":"A corporate bond trader in a typical sell side institution such as a bank\nprovides liquidity to the market participants by buying/selling securities and\nmaintaining an inventory. Upon receiving a request for a buy/sell price quote\n(RFQ), the trader provides a quote by adding a spread over a \\textit{prevalent\nmarket price}. For illiquid bonds, the market price is harder to observe, and\ntraders often resort to available benchmark bond prices (such as MarketAxess,\nBloomberg, etc.). In \\cite{Bergault2023ModelingLI}, the concept of \\textit{Fair\nTransfer Price} for an illiquid corporate bond was introduced which is derived\nfrom an infinite horizon stochastic optimal control problem (for maximizing the\ntrader's expected P\\&L, regularized by the quadratic variation). In this paper,\nwe consider the same optimization objective, however, we approach the\nestimation of an optimal bid-ask spread quoting strategy in a data driven\nmanner and show that it can be learned using Reinforcement Learning.\nFurthermore, we perform extensive outcome analysis to examine the\nreasonableness of the trained agent's behavior.","PeriodicalId":501294,"journal":{"name":"arXiv - QuantFin - Computational Finance","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Computational Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.12983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A corporate bond trader in a typical sell side institution such as a bank provides liquidity to the market participants by buying/selling securities and maintaining an inventory. Upon receiving a request for a buy/sell price quote (RFQ), the trader provides a quote by adding a spread over a \textit{prevalent market price}. For illiquid bonds, the market price is harder to observe, and traders often resort to available benchmark bond prices (such as MarketAxess, Bloomberg, etc.). In \cite{Bergault2023ModelingLI}, the concept of \textit{Fair Transfer Price} for an illiquid corporate bond was introduced which is derived from an infinite horizon stochastic optimal control problem (for maximizing the trader's expected P\&L, regularized by the quadratic variation). In this paper, we consider the same optimization objective, however, we approach the estimation of an optimal bid-ask spread quoting strategy in a data driven manner and show that it can be learned using Reinforcement Learning. Furthermore, we perform extensive outcome analysis to examine the reasonableness of the trained agent's behavior.
公司债券交易的强化学习:卖方视角
典型的卖方机构(如银行)中的公司债券交易员通过买入/卖出证券和维持库存为市场参与者提供流动性。在收到买入/卖出报价请求(RFQ)后,交易员会在报价上加上一个价差(textit{prevalentmarket price})。对于非流动性债券,市场价格更难观察,交易商通常会求助于现有的基准债券价格(如 MarketAxess、彭博等)。在《Bergault2023ModelingLI》一书中,引入了非流动性公司债券的 "公平转让价格"(FairTransfer Price)概念,该概念来自于一个无限期随机最优控制问题(最大化交易者的预期收益,并通过二次变化正则化)。在本文中,我们考虑了相同的优化目标,但是,我们以数据驱动的方式来估计最优买卖价差报价策略,并证明可以使用强化学习来学习该策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信