Onflow: an online portfolio allocation algorithm

Gabriel Turinici, Pierre Brugiere
{"title":"Onflow: an online portfolio allocation algorithm","authors":"Gabriel Turinici, Pierre Brugiere","doi":"arxiv-2312.05169","DOIUrl":null,"url":null,"abstract":"We introduce Onflow, a reinforcement learning technique that enables online\noptimization of portfolio allocation policies based on gradient flows. We\ndevise dynamic allocations of an investment portfolio to maximize its expected\nlog return while taking into account transaction fees. The portfolio allocation\nis parameterized through a softmax function, and at each time step, the\ngradient flow method leads to an ordinary differential equation whose solutions\ncorrespond to the updated allocations. This algorithm belongs to the large\nclass of stochastic optimization procedures; we measure its efficiency by\ncomparing our results to the mathematical theoretical values in a log-normal\nframework and to standard benchmarks from the 'old NYSE' dataset. For\nlog-normal assets, the strategy learned by Onflow, with transaction costs at\nzero, mimics Markowitz's optimal portfolio and thus the best possible asset\nallocation strategy. Numerical experiments from the 'old NYSE' dataset show\nthat Onflow leads to dynamic asset allocation strategies whose performances\nare: a) comparable to benchmark strategies such as Cover's Universal Portfolio\nor Helmbold et al. \"multiplicative updates\" approach when transaction costs are\nzero, and b) better than previous procedures when transaction costs are high.\nOnflow can even remain efficient in regimes where other dynamical allocation\ntechniques do not work anymore. Therefore, as far as tested, Onflow appears to\nbe a promising dynamic portfolio management strategy based on observed prices\nonly and without any assumption on the laws of distributions of the underlying\nassets' returns. In particular it could avoid model risk when building a\ntrading strategy.","PeriodicalId":501045,"journal":{"name":"arXiv - QuantFin - Portfolio Management","volume":"94 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Portfolio Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2312.05169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We introduce Onflow, a reinforcement learning technique that enables online optimization of portfolio allocation policies based on gradient flows. We devise dynamic allocations of an investment portfolio to maximize its expected log return while taking into account transaction fees. The portfolio allocation is parameterized through a softmax function, and at each time step, the gradient flow method leads to an ordinary differential equation whose solutions correspond to the updated allocations. This algorithm belongs to the large class of stochastic optimization procedures; we measure its efficiency by comparing our results to the mathematical theoretical values in a log-normal framework and to standard benchmarks from the 'old NYSE' dataset. For log-normal assets, the strategy learned by Onflow, with transaction costs at zero, mimics Markowitz's optimal portfolio and thus the best possible asset allocation strategy. Numerical experiments from the 'old NYSE' dataset show that Onflow leads to dynamic asset allocation strategies whose performances are: a) comparable to benchmark strategies such as Cover's Universal Portfolio or Helmbold et al. "multiplicative updates" approach when transaction costs are zero, and b) better than previous procedures when transaction costs are high. Onflow can even remain efficient in regimes where other dynamical allocation techniques do not work anymore. Therefore, as far as tested, Onflow appears to be a promising dynamic portfolio management strategy based on observed prices only and without any assumption on the laws of distributions of the underlying assets' returns. In particular it could avoid model risk when building a trading strategy.
Onflow:在线投资组合分配算法
我们介绍了基于梯度流的在线优化投资组合分配策略的强化学习技术 Onflow。我们对投资组合进行动态分配,在考虑交易费用的同时使其预期日志收益最大化。投资组合分配通过软最大值函数进行参数化,在每个时间步长,梯度流方法都会产生一个常微分方程,其解与更新的分配相对应。该算法属于随机优化程序的一个大类;我们通过将结果与对数正态框架下的数学理论值以及 "旧纽约证券交易所 "数据集的标准基准进行比较,来衡量其效率。对于对数正态资产,Onflow 学习到的策略在交易成本为零的情况下模仿了马科维茨的最优投资组合,因此是可能的最佳资产配置策略。来自 "旧纽约证券交易所 "数据集的数值实验表明,Onflow 所得出的动态资产配置策略的性能:a)在交易成本为零时,可与 Cover 的通用投资组合或 Helmbold 等人的 "乘法更新 "方法等基准策略相媲美;b)在交易成本较高时,优于之前的程序。因此,经过测试,Onflow 似乎是一种很有前途的动态投资组合管理策略,它只基于观察到的价格,而不假定相关资产收益的分布规律。特别是,它可以在制定交易策略时避免模型风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信