Alireza Mohammadshafie, Akram Mirzaeinia, Haseebullah Jumakhan, Amir Mirzaeinia
{"title":"Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity","authors":"Alireza Mohammadshafie, Akram Mirzaeinia, Haseebullah Jumakhan, Amir Mirzaeinia","doi":"arxiv-2407.09557","DOIUrl":null,"url":null,"abstract":"Recent deep reinforcement learning (DRL) methods in finance show promising\noutcomes. However, there is limited research examining the behavior of these\nDRL algorithms. This paper aims to investigate their tendencies towards holding\nor trading financial assets as well as purchase diversity. By analyzing their\ntrading behaviors, we provide insights into the decision-making processes of\nDRL models in finance applications. Our findings reveal that each DRL algorithm\nexhibits unique trading patterns and strategies, with A2C emerging as the top\nperformer in terms of cumulative rewards. While PPO and SAC engage in\nsignificant trades with a limited number of stocks, DDPG and TD3 adopt a more\nbalanced approach. Furthermore, SAC and PPO tend to hold positions for shorter\ndurations, whereas DDPG, A2C, and TD3 display a propensity to remain stationary\nfor extended periods.","PeriodicalId":501478,"journal":{"name":"arXiv - QuantFin - Trading and Market Microstructure","volume":"25 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - Trading and Market Microstructure","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.09557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recent deep reinforcement learning (DRL) methods in finance show promising
outcomes. However, there is limited research examining the behavior of these
DRL algorithms. This paper aims to investigate their tendencies towards holding
or trading financial assets as well as purchase diversity. By analyzing their
trading behaviors, we provide insights into the decision-making processes of
DRL models in finance applications. Our findings reveal that each DRL algorithm
exhibits unique trading patterns and strategies, with A2C emerging as the top
performer in terms of cumulative rewards. While PPO and SAC engage in
significant trades with a limited number of stocks, DDPG and TD3 adopt a more
balanced approach. Furthermore, SAC and PPO tend to hold positions for shorter
durations, whereas DDPG, A2C, and TD3 display a propensity to remain stationary
for extended periods.