Lei Yan , Qiulei Wang , Gang Hu , Wenli Chen , Bernd R. Noack
{"title":"Deep reinforcement cross-domain transfer learning of active flow control for three-dimensional bluff body flow","authors":"Lei Yan , Qiulei Wang , Gang Hu , Wenli Chen , Bernd R. Noack","doi":"10.1016/j.jcp.2025.113893","DOIUrl":null,"url":null,"abstract":"<div><div>This paper applies mutual information-based knowledge transfer learning with soft actor-critic (MIKT-SAC) algorithm to address cross-domain issues in state and action dimensions for active flow control (AFC). It explores the potential of deep reinforcement learning (DRL) in discovering novel drag reduction strategies. The algorithm starts with a pretrained agent on a two-dimensional (2D) case, extracting knowledge to mitigate aerodynamic forces acting on a 3D bluff body under high Reynolds number flow conditions. The algorithm is applied to two test cases to demonstrate its capabilities and limits: The first investigates the state dimension mismatch problem using a 3D square cylinder at high Reynolds number <span><math><mi>R</mi><mi>e</mi><mo>=</mo><mn>22000</mn></math></span>, where four jets at the corners of square cylinder as actuators. The second test examines scenarios with both state and action dimension mismatches using a circular cylinder with multiple zero-net-mass-flux jets positioned as two slots on the top and bottom surfaces. The results show that MIKT-SAC method outperforms the vanilla SAC algorithm, significantly reducing 51.1% and 45.1% training time and reducing drag coefficients (<span><math><msub><mrow><mi>C</mi></mrow><mrow><mi>D</mi></mrow></msub></math></span>) by 50.9% and 49.4% for the square and circular cylinders, respectively, while effectively suppressing drag and lift fluctuations. The multi-jet actuation delays vortex shedding on the surface of bluff body, reducing fluctuating lift forces on both cases. These findings highlight the potential of DRL in active flow control, laying a foundation for efficient, robust, and practical implementation of bluff body control technologies in practical engineering applications.</div></div>","PeriodicalId":352,"journal":{"name":"Journal of Computational Physics","volume":"529 ","pages":"Article 113893"},"PeriodicalIF":3.8000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Physics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021999125001767","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper applies mutual information-based knowledge transfer learning with soft actor-critic (MIKT-SAC) algorithm to address cross-domain issues in state and action dimensions for active flow control (AFC). It explores the potential of deep reinforcement learning (DRL) in discovering novel drag reduction strategies. The algorithm starts with a pretrained agent on a two-dimensional (2D) case, extracting knowledge to mitigate aerodynamic forces acting on a 3D bluff body under high Reynolds number flow conditions. The algorithm is applied to two test cases to demonstrate its capabilities and limits: The first investigates the state dimension mismatch problem using a 3D square cylinder at high Reynolds number , where four jets at the corners of square cylinder as actuators. The second test examines scenarios with both state and action dimension mismatches using a circular cylinder with multiple zero-net-mass-flux jets positioned as two slots on the top and bottom surfaces. The results show that MIKT-SAC method outperforms the vanilla SAC algorithm, significantly reducing 51.1% and 45.1% training time and reducing drag coefficients () by 50.9% and 49.4% for the square and circular cylinders, respectively, while effectively suppressing drag and lift fluctuations. The multi-jet actuation delays vortex shedding on the surface of bluff body, reducing fluctuating lift forces on both cases. These findings highlight the potential of DRL in active flow control, laying a foundation for efficient, robust, and practical implementation of bluff body control technologies in practical engineering applications.
期刊介绍:
Journal of Computational Physics thoroughly treats the computational aspects of physical problems, presenting techniques for the numerical solution of mathematical equations arising in all areas of physics. The journal seeks to emphasize methods that cross disciplinary boundaries.
The Journal of Computational Physics also publishes short notes of 4 pages or less (including figures, tables, and references but excluding title pages). Letters to the Editor commenting on articles already published in this Journal will also be considered. Neither notes nor letters should have an abstract.