Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong
{"title":"Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management","authors":"Pengrui Yu , Siya Liu , Chengneng Jin , Runsheng Gu , Xiaomin Gong","doi":"10.1016/j.pacfin.2025.102746","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.</div></div>","PeriodicalId":48074,"journal":{"name":"Pacific-Basin Finance Journal","volume":"91 ","pages":"Article 102746"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pacific-Basin Finance Journal","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927538X25000836","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a novel approach to equity portfolio optimization that combines spectral analysis and classical equity portfolio optimization theory with deep reinforcement learning in an end-to-end framework. We introduce the End-to-end Frequency Online Deep Deterministic Policy Gradient (EFO-DDPG) algorithm, which leverages discrete Fourier transform to decompose asset return sequences into frequency components. Unlike traditional methods that treat high-frequency components as noise, EFO-DDPG learns to adjust the influence of different frequency components dynamically. Moreover, the algorithm embeds a mean–variance portfolio optimization problem within a deep learning network, enhancing interpretability compared to black-box approaches. The framework models the investment problem as a Partially Observable Markov Decision Process (POMDP), using a state processing block with transformer encoders to capture complex relationships in the market data. By integrating spectral analysis, portfolio optimization theory, and online deep reinforcement learning, EFO-DDPG aims to adapt to non-stationary financial markets and generate superior investment strategies.
期刊介绍:
The Pacific-Basin Finance Journal is aimed at providing a specialized forum for the publication of academic research on capital markets of the Asia-Pacific countries. Primary emphasis will be placed on the highest quality empirical and theoretical research in the following areas: • Market Micro-structure; • Investment and Portfolio Management; • Theories of Market Equilibrium; • Valuation of Financial and Real Assets; • Behavior of Asset Prices in Financial Sectors; • Normative Theory of Financial Management; • Capital Markets of Development; • Market Mechanisms.