A novel data-efficient double deep Q-network framework for intelligent financial portfolio management

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-09-30 DOI:10.1016/j.engappai.2025.112436

Mahshad Alidousti, Morteza Khakzar Bafruei, Amir Hosein Afshar Sedigh

{"title":"A novel data-efficient double deep Q-network framework for intelligent financial portfolio management","authors":"Mahshad Alidousti, Morteza Khakzar Bafruei, Amir Hosein Afshar Sedigh","doi":"10.1016/j.engappai.2025.112436","DOIUrl":null,"url":null,"abstract":"<div><div>Navigating the complexities of dynamic and uncertain financial markets demands intelligent systems capable of learning profitable strategies amidst risk and volatility. While Deep Q-Networks (DQN) offer a foundation for such systems, they often suffer from overestimation bias, training instability, and poor generalization in noisy financial environments. To address these challenges, this work introduces Portfolio Double Deep Q-Network (PDQN), a novel architecture inspired by recent advancements in reinforcement learning. PDQN enhances portfolio management by integrating Double Q-Learning to reduce overestimation, alongside Leaky ReLU activation, Xavier initialization, Huber loss, and dropout regularization to improve learning stability and generalization. Unlike prior methods that rely on large datasets and heavy computational infrastructure, PDQN achieves competitive—and often superior—performance using substantially less training data and lightweight infrastructure, making it well-suited for real-world, resource-constrained financial applications. Distinct from conventional approaches, PDQN uses separate networks to adapt portfolio decisions across varying market conditions. Empirical results across multiple market years show that PDQN often outperforms baseline strategies, including classic DQN and Buy-and-Hold, across key metrics such as Sharpe ratio, Sterling ratio, and cumulative return. PDQN—like all data-driven models—exhibits room for improvement under highly irregular or extreme financial scenarios. These observations suggest promising directions for future refinement and increased robustness, without detracting from the model's practical effectiveness and competitive edge.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"162 ","pages":"Article 112436"},"PeriodicalIF":8.0000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625024674","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Navigating the complexities of dynamic and uncertain financial markets demands intelligent systems capable of learning profitable strategies amidst risk and volatility. While Deep Q-Networks (DQN) offer a foundation for such systems, they often suffer from overestimation bias, training instability, and poor generalization in noisy financial environments. To address these challenges, this work introduces Portfolio Double Deep Q-Network (PDQN), a novel architecture inspired by recent advancements in reinforcement learning. PDQN enhances portfolio management by integrating Double Q-Learning to reduce overestimation, alongside Leaky ReLU activation, Xavier initialization, Huber loss, and dropout regularization to improve learning stability and generalization. Unlike prior methods that rely on large datasets and heavy computational infrastructure, PDQN achieves competitive—and often superior—performance using substantially less training data and lightweight infrastructure, making it well-suited for real-world, resource-constrained financial applications. Distinct from conventional approaches, PDQN uses separate networks to adapt portfolio decisions across varying market conditions. Empirical results across multiple market years show that PDQN often outperforms baseline strategies, including classic DQN and Buy-and-Hold, across key metrics such as Sharpe ratio, Sterling ratio, and cumulative return. PDQN—like all data-driven models—exhibits room for improvement under highly irregular or extreme financial scenarios. These observations suggest promising directions for future refinement and increased robustness, without detracting from the model's practical effectiveness and competitive edge.

查看原文本刊更多论文

一种新型的数据高效双深度q网络框架，用于智能金融投资组合管理

驾驭动态和不确定金融市场的复杂性需要能够在风险和波动中学习盈利策略的智能系统。虽然深度q -网络（DQN）为这样的系统提供了基础，但它们经常受到高估偏差、训练不稳定以及在嘈杂的金融环境中泛化不良的影响。为了解决这些挑战，本工作引入了组合双深度q -网络（PDQN），这是一种受强化学习最新进展启发的新颖架构。PDQN通过集成Double Q-Learning来减少高估，以及Leaky ReLU激活、Xavier初始化、Huber损失和dropout正则化来提高学习的稳定性和泛化，从而增强了投资组合管理。与之前依赖于大型数据集和重型计算基础设施的方法不同，PDQN使用更少的训练数据和轻量级基础设施实现了具有竞争力的性能，并且通常是卓越的性能，使其非常适合现实世界中资源受限的金融应用。与传统方法不同，PDQN使用单独的网络来适应不同市场条件下的投资组合决策。跨越多个市场年份的实证结果表明，在夏普比率、英镑比率和累积回报等关键指标上，PDQN通常优于基准策略，包括经典的DQN和买入并持有。与所有数据驱动的模型一样，pdqn在高度不规则或极端的财务情景下显示出改进的空间。这些观察结果为未来的改进和增强鲁棒性提供了有希望的方向，而不会损害模型的实际有效性和竞争优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.