Onflow: an online portfolio allocation algorithm

arXiv - QuantFin - Portfolio Management Pub Date : 2023-12-08 DOI:arxiv-2312.05169

Gabriel Turinici, Pierre Brugiere

{"title":"Onflow: an online portfolio allocation algorithm","authors":"Gabriel Turinici, Pierre Brugiere","doi":"arxiv-2312.05169","DOIUrl":null,"url":null,"abstract":"We introduce Onflow, a reinforcement learning technique that enables online\noptimization of portfolio allocation policies based on gradient flows. We\ndevise dynamic allocations of an investment portfolio to maximize its expected\nlog return while taking into account transaction fees. The portfolio allocation\nis parameterized through a softmax function, and at each time step, the\ngradient flow method leads to an ordinary differential equation whose solutions\ncorrespond to the updated allocations. This algorithm belongs to the large\nclass of stochastic optimization procedures; we measure its efficiency by\ncomparing our results to the mathematical theoretical values in a log-normal\nframework and to standard benchmarks from the 'old NYSE' dataset. For\nlog-normal assets, the strategy learned by Onflow, with transaction costs at\nzero, mimics Markowitz's optimal portfolio and thus the best possible asset\nallocation strategy. Numerical experiments from the 'old NYSE' dataset show\nthat Onflow leads to dynamic asset allocation strategies whose performances\nare: a) comparable to benchmark strategies such as Cover's Universal Portfolio\nor Helmbold et al. \"multiplicative updates\" approach when transaction costs are\nzero, and b) better than previous procedures when transaction costs are high.\nOnflow can even remain efficient in regimes where other dynamical allocation\ntechniques do not work anymore. Therefore, as far as tested, Onflow appears to\nbe a promising dynamic portfolio management strategy based on observed prices\nonly and without any assumption on the laws of distributions of the underlying\nassets' returns. In particular it could avoid model risk when building a\ntrading strategy.","PeriodicalId":501045,"journal":{"name":"arXiv - QuantFin - Portfolio Management","volume":"94 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Portfolio Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2312.05169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

We introduce Onflow, a reinforcement learning technique that enables online optimization of portfolio allocation policies based on gradient flows. We devise dynamic allocations of an investment portfolio to maximize its expected log return while taking into account transaction fees. The portfolio allocation is parameterized through a softmax function, and at each time step, the gradient flow method leads to an ordinary differential equation whose solutions correspond to the updated allocations. This algorithm belongs to the large class of stochastic optimization procedures; we measure its efficiency by comparing our results to the mathematical theoretical values in a log-normal framework and to standard benchmarks from the 'old NYSE' dataset. For log-normal assets, the strategy learned by Onflow, with transaction costs at zero, mimics Markowitz's optimal portfolio and thus the best possible asset allocation strategy. Numerical experiments from the 'old NYSE' dataset show that Onflow leads to dynamic asset allocation strategies whose performances are: a) comparable to benchmark strategies such as Cover's Universal Portfolio or Helmbold et al. "multiplicative updates" approach when transaction costs are zero, and b) better than previous procedures when transaction costs are high. Onflow can even remain efficient in regimes where other dynamical allocation techniques do not work anymore. Therefore, as far as tested, Onflow appears to be a promising dynamic portfolio management strategy based on observed prices only and without any assumption on the laws of distributions of the underlying assets' returns. In particular it could avoid model risk when building a trading strategy.

查看原文本刊更多论文

Onflow：在线投资组合分配算法

我们介绍了基于梯度流的在线优化投资组合分配策略的强化学习技术 Onflow。我们对投资组合进行动态分配，在考虑交易费用的同时使其预期日志收益最大化。投资组合分配通过软最大值函数进行参数化，在每个时间步长，梯度流方法都会产生一个常微分方程，其解与更新的分配相对应。该算法属于随机优化程序的一个大类；我们通过将结果与对数正态框架下的数学理论值以及 "旧纽约证券交易所 "数据集的标准基准进行比较，来衡量其效率。对于对数正态资产，Onflow 学习到的策略在交易成本为零的情况下模仿了马科维茨的最优投资组合，因此是可能的最佳资产配置策略。来自 "旧纽约证券交易所 "数据集的数值实验表明，Onflow 所得出的动态资产配置策略的性能：a）在交易成本为零时，可与 Cover 的通用投资组合或 Helmbold 等人的 "乘法更新 "方法等基准策略相媲美；b）在交易成本较高时，优于之前的程序。因此，经过测试，Onflow 似乎是一种很有前途的动态投资组合管理策略，它只基于观察到的价格，而不假定相关资产收益的分布规律。特别是，它可以在制定交易策略时避免模型风险。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - QuantFin - Portfolio Management

自引率

0.00%

发文量