Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection

IF 2.4 3区 工程技术 Q3 MECHANICS
Joel Vasanth, Jean Rabault, Francisco Alcántara-Ávila, Mikael Mortensen, Ricardo Vinuesa
{"title":"Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection","authors":"Joel Vasanth,&nbsp;Jean Rabault,&nbsp;Francisco Alcántara-Ávila,&nbsp;Mikael Mortensen,&nbsp;Ricardo Vinuesa","doi":"10.1007/s10494-024-00619-2","DOIUrl":null,"url":null,"abstract":"<div><p>Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers <span>\\(\\textrm{Ra}=500\\)</span> and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by <span>\\(23.5\\%\\)</span> and <span>\\(8.7\\%\\)</span> at <span>\\(\\textrm{Ra}=500\\)</span> and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both <span>\\(\\textrm{Ra}\\)</span> and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.</p></div>","PeriodicalId":559,"journal":{"name":"Flow, Turbulence and Combustion","volume":"115 :","pages":"1319 - 1355"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10494-024-00619-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow, Turbulence and Combustion","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10494-024-00619-2","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}
引用次数: 0

Abstract

Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective than single-agent RL in controlling flows exhibiting locality and translational invariance. We present, for the first time, an implementation of MARL-based control of three-dimensional Rayleigh–Bénard convection (RBC). Control is executed by modifying the temperature distribution along the bottom wall divided into multiple control segments, each of which acts as an independent agent. Two regimes of RBC are considered at Rayleigh numbers \(\textrm{Ra}=500\) and 750. Evaluation of the learned control policy reveals a reduction in convection intensity by \(23.5\%\) and \(8.7\%\) at \(\textrm{Ra}=500\) and 750, respectively. The MARL controller converts irregularly shaped convective patterns to regular straight rolls with lower convection that resemble flow in a relatively more stable regime. We draw comparisons with proportional control at both \(\textrm{Ra}\) and show that MARL is able to outperform the proportional controller. The learned control strategy is complex, featuring different non-linear segment-wise actuator delays and actuation magnitudes. We also perform successful evaluations on a larger domain than used for training, demonstrating that the invariant property of MARL allows direct transfer of the learnt policy.

基于多智能体强化学习的三维rayleigh - b对流控制
深度强化学习(DRL)已经在许多与流量控制相关的用例中得到了应用。多智能体强化学习(MARL)是DRL的一种变体,在控制具有局部性和平移不变性的流方面比单智能体强化学习更有效。我们首次提出了一种基于marl的三维rayleigh - bsamadard对流(RBC)控制的实现。控制是通过修改沿底壁的温度分布来执行的,温度分布分为多个控制段,每个控制段作为一个独立的代理。在瑞利数\(\textrm{Ra}=500\)和750处考虑两种RBC状态。对学习控制策略的评估显示对流强度分别减少\(23.5\%\)和\(8.7\%\)在\(\textrm{Ra}=500\)和750。MARL控制器将不规则形状的对流模式转换为具有较低对流的规则直卷,类似于相对更稳定的流动状态。我们在\(\textrm{Ra}\)与比例控制进行了比较,并表明MARL能够优于比例控制器。学习的控制策略是复杂的,具有不同的非线性分段执行器延迟和驱动量。我们还在比用于训练的更大的域上进行了成功的评估,证明了MARL的不变性允许直接转移学到的策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Flow, Turbulence and Combustion
Flow, Turbulence and Combustion 工程技术-力学
CiteScore
5.70
自引率
8.30%
发文量
72
审稿时长
2 months
期刊介绍: Flow, Turbulence and Combustion provides a global forum for the publication of original and innovative research results that contribute to the solution of fundamental and applied problems encountered in single-phase, multi-phase and reacting flows, in both idealized and real systems. The scope of coverage encompasses topics in fluid dynamics, scalar transport, multi-physics interactions and flow control. From time to time the journal publishes Special or Theme Issues featuring invited articles. Contributions may report research that falls within the broad spectrum of analytical, computational and experimental methods. This includes research conducted in academia, industry and a variety of environmental and geophysical sectors. Turbulence, transition and associated phenomena are expected to play a significant role in the majority of studies reported, although non-turbulent flows, typical of those in micro-devices, would be regarded as falling within the scope covered. The emphasis is on originality, timeliness, quality and thematic fit, as exemplified by the title of the journal and the qualifications described above. Relevance to real-world problems and industrial applications are regarded as strengths.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信