面向项目组合管理的深度强化学习

EPiC series in computing Pub Date : 2022-01-01 DOI:10.29007/w2m3

Yue Ma, Ziping Liu, Chuck McAllister

{"title":"面向项目组合管理的深度强化学习","authors":"Yue Ma, Ziping Liu, Chuck McAllister","doi":"10.29007/w2m3","DOIUrl":null,"url":null,"abstract":"This paper discussed how to build deep reinforcement learning (DRL) agents to determine the allocation of money for assets in a portfolio so that the maximum return can be gained. The policy gradient method from reinforcement learning and convolutional neural network/recurrent neural network/convolutional neural network concatenated with the recurrent neural network from deep learning are combined together to build the agents. With the proposed models, three types of portfolios are tested: stocks portfolio which has a positive influence due to the Covid-19, stocks portfolio which has a negative influence due to the Covid-19, and portfolio of stocks combined with cryptocurrency which are randomly selected. The performance of our DRL agents was compared with that of equal-weighted agent and all the money fully invested on one stock agents. All of our DRL agents showed the best performance on the randomly selected portfolio, which has an overall stable up-ticking trend. In addition, the performance of linear regression model was also tested with the random selected portfolio, and it shows a poor result compared to other agents.","PeriodicalId":93549,"journal":{"name":"EPiC series in computing","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning for Portfolio Management\",\"authors\":\"Yue Ma, Ziping Liu, Chuck McAllister\",\"doi\":\"10.29007/w2m3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper discussed how to build deep reinforcement learning (DRL) agents to determine the allocation of money for assets in a portfolio so that the maximum return can be gained. The policy gradient method from reinforcement learning and convolutional neural network/recurrent neural network/convolutional neural network concatenated with the recurrent neural network from deep learning are combined together to build the agents. With the proposed models, three types of portfolios are tested: stocks portfolio which has a positive influence due to the Covid-19, stocks portfolio which has a negative influence due to the Covid-19, and portfolio of stocks combined with cryptocurrency which are randomly selected. The performance of our DRL agents was compared with that of equal-weighted agent and all the money fully invested on one stock agents. All of our DRL agents showed the best performance on the randomly selected portfolio, which has an overall stable up-ticking trend. In addition, the performance of linear regression model was also tested with the random selected portfolio, and it shows a poor result compared to other agents.\",\"PeriodicalId\":93549,\"journal\":{\"name\":\"EPiC series in computing\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EPiC series in computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29007/w2m3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPiC series in computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/w2m3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文讨论了如何构建深度强化学习(DRL)智能体来确定投资组合中资产的资金分配，从而获得最大的回报。将强化学习中的策略梯度方法与卷积神经网络/递归神经网络/卷积神经网络与深度学习中的递归神经网络相结合来构建智能体。利用所提出的模型，对三种类型的投资组合进行了测试:由于Covid-19具有积极影响的股票投资组合，由于Covid-19具有负面影响的股票投资组合以及随机选择的股票与加密货币组合。将我们的DRL代理的表现与等权重代理和所有资金全部投资于一个股票代理的表现进行了比较。我们所有的DRL代理在随机选择的投资组合中表现最好，总体上有稳定的上升趋势。此外，对随机选择的投资组合进行了线性回归模型的性能测试，与其他代理相比，线性回归模型的效果较差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Reinforcement Learning for Portfolio Management

This paper discussed how to build deep reinforcement learning (DRL) agents to determine the allocation of money for assets in a portfolio so that the maximum return can be gained. The policy gradient method from reinforcement learning and convolutional neural network/recurrent neural network/convolutional neural network concatenated with the recurrent neural network from deep learning are combined together to build the agents. With the proposed models, three types of portfolios are tested: stocks portfolio which has a positive influence due to the Covid-19, stocks portfolio which has a negative influence due to the Covid-19, and portfolio of stocks combined with cryptocurrency which are randomly selected. The performance of our DRL agents was compared with that of equal-weighted agent and all the money fully invested on one stock agents. All of our DRL agents showed the best performance on the randomly selected portfolio, which has an overall stable up-ticking trend. In addition, the performance of linear regression model was also tested with the random selected portfolio, and it shows a poor result compared to other agents.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EPiC series in computing

CiteScore

1.60

自引率

0.00%

发文量