Joel Vasanth, Jean Rabault, Francisco Alcántara-Ávila, Mikael Mortensen, Ricardo Vinuesa
{"title":"Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection","authors":"Joel Vasanth, Jean Rabault, Francisco Alcántara-Ávila, Mikael Mortensen, Ricardo Vinuesa","doi":"10.1007/s10494-024-00619-2","DOIUrl":null,"url":null,"abstract":"<div><p>Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers <span>\\(\\textrm{Ra}=500\\)</span> and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by <span>\\(23.5\\%\\)</span> and <span>\\(8.7\\%\\)</span> at <span>\\(\\textrm{Ra}=500\\)</span> and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both <span>\\(\\textrm{Ra}\\)</span> and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.</p></div>","PeriodicalId":559,"journal":{"name":"Flow, Turbulence and Combustion","volume":"115 :","pages":"1319 - 1355"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10494-024-00619-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow, Turbulence and Combustion","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10494-024-00619-2","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}
引用次数: 0
Abstract
Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers \(\textrm{Ra}=500\) and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by \(23.5\%\) and \(8.7\%\) at \(\textrm{Ra}=500\) and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both \(\textrm{Ra}\) and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.
期刊介绍:
Flow, Turbulence and Combustion provides a global forum for the publication of original and innovative research results that contribute to the solution of fundamental and applied problems encountered in single-phase, multi-phase and reacting flows, in both idealized and real systems. The scope of coverage encompasses topics in fluid dynamics, scalar transport, multi-physics interactions and flow control. From time to time the journal publishes Special or Theme Issues featuring invited articles.
Contributions may report research that falls within the broad spectrum of analytical, computational and experimental methods. This includes research conducted in academia, industry and a variety of environmental and geophysical sectors. Turbulence, transition and associated phenomena are expected to play a significant role in the majority of studies reported, although non-turbulent flows, typical of those in micro-devices, would be regarded as falling within the scope covered. The emphasis is on originality, timeliness, quality and thematic fit, as exemplified by the title of the journal and the qualifications described above. Relevance to real-world problems and industrial applications are regarded as strengths.