Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection

IF 2.4 3区工程技术 Q3 MECHANICS

Flow, Turbulence and Combustion Pub Date : 2024-12-14 DOI:10.1007/s10494-024-00619-2

Joel Vasanth, Jean Rabault, Francisco Alcántara-Ávila, Mikael Mortensen, Ricardo Vinuesa

{"title":"Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection","authors":"Joel Vasanth, Jean Rabault, Francisco Alcántara-Ávila, Mikael Mortensen, Ricardo Vinuesa","doi":"10.1007/s10494-024-00619-2","DOIUrl":null,"url":null,"abstract":"<div>Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers \\(\\textrm{Ra}=500\\) and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by \\(23.5\\%\\) and \\(8.7\\%\\) at \\(\\textrm{Ra}=500\\) and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both \\(\\textrm{Ra}\\) and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.</div>","PeriodicalId":559,"journal":{"name":"Flow, Turbulence and Combustion","volume":"115 :","pages":"1319 - 1355"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10494-024-00619-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow, Turbulence and Combustion","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10494-024-00619-2","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}

引用次数: 0

Abstract

Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers \(\textrm{Ra}=500\) and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by \(23.5\%\) and \(8.7\%\) at \(\textrm{Ra}=500\) and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both \(\textrm{Ra}\) and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.

查看原文本刊更多论文

基于多智能体强化学习的三维rayleigh - b对流控制

深度强化学习（DRL）已经在许多与流量控制相关的用例中得到了应用。多智能体强化学习（MARL）是DRL的一种变体，在控制具有局部性和平移不变性的流方面比单智能体强化学习更有效。我们首次提出了一种基于marl的三维rayleigh - bsamadard对流（RBC）控制的实现。控制是通过修改沿底壁的温度分布来执行的，温度分布分为多个控制段，每个控制段作为一个独立的代理。在瑞利数\(\textrm{Ra}=500\)和750处考虑两种RBC状态。对学习控制策略的评估显示对流强度分别减少\(23.5\%\)和\(8.7\%\)在\(\textrm{Ra}=500\)和750。MARL控制器将不规则形状的对流模式转换为具有较低对流的规则直卷，类似于相对更稳定的流动状态。我们在\(\textrm{Ra}\)与比例控制进行了比较，并表明MARL能够优于比例控制器。学习的控制策略是复杂的，具有不同的非线性分段执行器延迟和驱动量。我们还在比用于训练的更大的域上进行了成功的评估，证明了MARL的不变性允许直接转移学到的策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Flow, Turbulence and Combustion 工程技术-力学

CiteScore

5.70

自引率

8.30%

发文量

审稿时长

2 months

期刊介绍： Flow, Turbulence and Combustion provides a global forum for the publication of original and innovative research results that contribute to the solution of fundamental and applied problems encountered in single-phase, multi-phase and reacting flows, in both idealized and real systems. The scope of coverage encompasses topics in fluid dynamics, scalar transport, multi-physics interactions and flow control. From time to time the journal publishes Special or Theme Issues featuring invited articles. Contributions may report research that falls within the broad spectrum of analytical, computational and experimental methods. This includes research conducted in academia, industry and a variety of environmental and geophysical sectors. Turbulence, transition and associated phenomena are expected to play a significant role in the majority of studies reported, although non-turbulent flows, typical of those in micro-devices, would be regarded as falling within the scope covered. The emphasis is on originality, timeliness, quality and thematic fit, as exemplified by the title of the journal and the qualifications described above. Relevance to real-world problems and industrial applications are regarded as strengths.