平衡投资组合管理的利润、风险和可持续性

2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr) Pub Date : 2022-05-01 DOI:10.1109/CIFEr52523.2022.9776048

Charl Maree, C. Omlin

{"title":"平衡投资组合管理的利润、风险和可持续性","authors":"Charl Maree, C. Omlin","doi":"10.1109/CIFEr52523.2022.9776048","DOIUrl":null,"url":null,"abstract":"Stock portfolio optimization is the process of continuous reallocation of funds to a selection of stocks. This is a particularly well-suited problem for reinforcement learning, as daily rewards are compounding and objective functions may include more than just profit, e.g., risk and sustainability. We developed a novel utility function with the Sharpe ratio representing risk and the environmental, social, and governance score (ESG) representing sustainability. We show that a state- of-the-art policy gradient method – multi-agent deep deterministic policy gradients (MADDPG) – fails to find the optimum policy due to flat policy gradients and we therefore replaced gradient descent with a genetic algorithm for parameter optimization. We show that our system outperforms MADDPG while improving on deep Q-learning approaches by allowing for continuous action spaces. Crucially, by incorporating risk and sustainability criteria in the utility function, we improve on the state-of-the-art in reinforcement learning for portfolio optimization; risk and sustainability are essential in any modern trading strategy, and we propose a system that does not merely report these metrics, but that actively optimizes the portfolio to improve on them.","PeriodicalId":234473,"journal":{"name":"2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Balancing Profit, Risk, and Sustainability for Portfolio Management\",\"authors\":\"Charl Maree, C. Omlin\",\"doi\":\"10.1109/CIFEr52523.2022.9776048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Stock portfolio optimization is the process of continuous reallocation of funds to a selection of stocks. This is a particularly well-suited problem for reinforcement learning, as daily rewards are compounding and objective functions may include more than just profit, e.g., risk and sustainability. We developed a novel utility function with the Sharpe ratio representing risk and the environmental, social, and governance score (ESG) representing sustainability. We show that a state- of-the-art policy gradient method – multi-agent deep deterministic policy gradients (MADDPG) – fails to find the optimum policy due to flat policy gradients and we therefore replaced gradient descent with a genetic algorithm for parameter optimization. We show that our system outperforms MADDPG while improving on deep Q-learning approaches by allowing for continuous action spaces. Crucially, by incorporating risk and sustainability criteria in the utility function, we improve on the state-of-the-art in reinforcement learning for portfolio optimization; risk and sustainability are essential in any modern trading strategy, and we propose a system that does not merely report these metrics, but that actively optimizes the portfolio to improve on them.\",\"PeriodicalId\":234473,\"journal\":{\"name\":\"2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIFEr52523.2022.9776048\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIFEr52523.2022.9776048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

股票投资组合优化是将资金不断重新配置到股票选择中的过程。这是一个特别适合强化学习的问题，因为每日奖励是复合的，目标函数可能不仅仅包括利润，例如风险和可持续性。我们开发了一个新的效用函数，夏普比率代表风险，环境、社会和治理得分(ESG)代表可持续性。我们证明了最先进的策略梯度方法-多智能体深度确定性策略梯度(madpg) -由于策略梯度平坦而无法找到最优策略，因此我们用遗传算法代替梯度下降进行参数优化。我们证明了我们的系统优于MADDPG，同时通过允许连续的动作空间改进了深度q学习方法。至关重要的是，通过将风险和可持续性标准纳入效用函数，我们改进了用于投资组合优化的强化学习的最新技术;风险和可持续性在任何现代交易策略中都是必不可少的，我们提出了一个系统，它不仅报告这些指标，而且还积极优化投资组合以改进它们。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Balancing Profit, Risk, and Sustainability for Portfolio Management

Stock portfolio optimization is the process of continuous reallocation of funds to a selection of stocks. This is a particularly well-suited problem for reinforcement learning, as daily rewards are compounding and objective functions may include more than just profit, e.g., risk and sustainability. We developed a novel utility function with the Sharpe ratio representing risk and the environmental, social, and governance score (ESG) representing sustainability. We show that a state- of-the-art policy gradient method – multi-agent deep deterministic policy gradients (MADDPG) – fails to find the optimum policy due to flat policy gradients and we therefore replaced gradient descent with a genetic algorithm for parameter optimization. We show that our system outperforms MADDPG while improving on deep Q-learning approaches by allowing for continuous action spaces. Crucially, by incorporating risk and sustainability criteria in the utility function, we improve on the state-of-the-art in reinforcement learning for portfolio optimization; risk and sustainability are essential in any modern trading strategy, and we propose a system that does not merely report these metrics, but that actively optimizes the portfolio to improve on them.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)

自引率

0.00%

发文量