部分可观测环境下混沌系统控制的强化学习

IF 2.4 3区工程技术 Q3 MECHANICS

Flow, Turbulence and Combustion Pub Date : 2025-01-09 DOI:10.1007/s10494-024-00632-5

Max Weissenbacher, Anastasia Borovykh, Georgios Rigas

{"title":"部分可观测环境下混沌系统控制的强化学习","authors":"Max Weissenbacher, Anastasia Borovykh, Georgios Rigas","doi":"10.1007/s10494-024-00632-5","DOIUrl":null,"url":null,"abstract":"<div><p>Control of chaotic systems has far-reaching implications in engineering, including fluid-based energy and transport systems, among many other fields. In real-world applications, control algorithms typically operate only with partial information about the system (<i>partial observability</i>) due to limited sensing, which leads to sub-optimal performance when compared to the case where a controller has access to the full system state (<i>full observability</i>). While it is well-known that the effect of partial observability can be mediated by introducing a memory component, which allows the controller to keep track of the system’s partial state history, the effect of the type of memory on performance in chaotic regimes is poorly understood. In this study we investigate the use of reinforcement learning for controlling chaotic flows using only partial observations. We use the chaotic Kuramoto–Sivashinsky equation with a forcing term as a model system. In contrast to previous studies, we consider the flow in a variety of dynamic regimes, ranging from mildly to strongly chaotic. We evaluate the loss of performance as the number of sensors available to the controller decreases. We then compare two different frameworks to incorporate memory into the controller, one based on recurrent neural networks and another novel mechanism based on transformers. We demonstrate that the attention-based framework robustly outperforms the alternatives in a range of dynamic regimes. In particular, our method yields improved control in highly chaotic environments, suggesting that attention-based mechanisms may be better suited to the control of chaotic systems.</p></div>","PeriodicalId":559,"journal":{"name":"Flow, Turbulence and Combustion","volume":"115 :","pages":"1357 - 1378"},"PeriodicalIF":2.4000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10494-024-00632-5.pdf","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning of Chaotic Systems Control in Partially Observable Environments\",\"authors\":\"Max Weissenbacher, Anastasia Borovykh, Georgios Rigas\",\"doi\":\"10.1007/s10494-024-00632-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Control of chaotic systems has far-reaching implications in engineering, including fluid-based energy and transport systems, among many other fields. In real-world applications, control algorithms typically operate only with partial information about the system (<i>partial observability</i>) due to limited sensing, which leads to sub-optimal performance when compared to the case where a controller has access to the full system state (<i>full observability</i>). While it is well-known that the effect of partial observability can be mediated by introducing a memory component, which allows the controller to keep track of the system’s partial state history, the effect of the type of memory on performance in chaotic regimes is poorly understood. In this study we investigate the use of reinforcement learning for controlling chaotic flows using only partial observations. We use the chaotic Kuramoto–Sivashinsky equation with a forcing term as a model system. In contrast to previous studies, we consider the flow in a variety of dynamic regimes, ranging from mildly to strongly chaotic. We evaluate the loss of performance as the number of sensors available to the controller decreases. We then compare two different frameworks to incorporate memory into the controller, one based on recurrent neural networks and another novel mechanism based on transformers. We demonstrate that the attention-based framework robustly outperforms the alternatives in a range of dynamic regimes. In particular, our method yields improved control in highly chaotic environments, suggesting that attention-based mechanisms may be better suited to the control of chaotic systems.</p></div>\",\"PeriodicalId\":559,\"journal\":{\"name\":\"Flow, Turbulence and Combustion\",\"volume\":\"115 :\",\"pages\":\"1357 - 1378\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10494-024-00632-5.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Flow, Turbulence and Combustion\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10494-024-00632-5\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow, Turbulence and Combustion","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10494-024-00632-5","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}

引用次数: 0

摘要

混沌系统的控制在工程中具有深远的意义，包括基于流体的能源和运输系统，以及许多其他领域。在现实世界的应用中，由于有限的传感，控制算法通常只使用有关系统的部分信息（部分可观察性）进行操作，与控制器可以访问完整系统状态（完全可观察性）的情况相比，这导致了次优性能。众所周知，部分可观察性的影响可以通过引入内存组件来调节，内存组件允许控制器跟踪系统的部分状态历史，但在混沌状态下，内存类型对性能的影响知之甚少。在本研究中，我们研究了仅使用部分观测来控制混沌流的强化学习的使用。我们使用带强迫项的混沌Kuramoto-Sivashinsky方程作为模型系统。与以往的研究相反，我们考虑了各种动态状态下的流动，从轻度到强混沌。当控制器可用的传感器数量减少时，我们评估性能损失。然后，我们比较了两种不同的框架来将记忆整合到控制器中，一种基于循环神经网络，另一种基于变压器的新机制。我们证明了基于注意力的框架在一系列动态机制中稳健地优于替代方案。特别是，我们的方法在高度混乱的环境中产生了更好的控制，这表明基于注意力的机制可能更适合于混沌系统的控制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning of Chaotic Systems Control in Partially Observable Environments

Control of chaotic systems has far-reaching implications in engineering, including fluid-based energy and transport systems, among many other fields. In real-world applications, control algorithms typically operate only with partial information about the system (partial observability) due to limited sensing, which leads to sub-optimal performance when compared to the case where a controller has access to the full system state (full observability). While it is well-known that the effect of partial observability can be mediated by introducing a memory component, which allows the controller to keep track of the system’s partial state history, the effect of the type of memory on performance in chaotic regimes is poorly understood. In this study we investigate the use of reinforcement learning for controlling chaotic flows using only partial observations. We use the chaotic Kuramoto–Sivashinsky equation with a forcing term as a model system. In contrast to previous studies, we consider the flow in a variety of dynamic regimes, ranging from mildly to strongly chaotic. We evaluate the loss of performance as the number of sensors available to the controller decreases. We then compare two different frameworks to incorporate memory into the controller, one based on recurrent neural networks and another novel mechanism based on transformers. We demonstrate that the attention-based framework robustly outperforms the alternatives in a range of dynamic regimes. In particular, our method yields improved control in highly chaotic environments, suggesting that attention-based mechanisms may be better suited to the control of chaotic systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Flow, Turbulence and Combustion 工程技术-力学

CiteScore

5.70

自引率

8.30%

发文量

审稿时长

2 months

期刊介绍： Flow, Turbulence and Combustion provides a global forum for the publication of original and innovative research results that contribute to the solution of fundamental and applied problems encountered in single-phase, multi-phase and reacting flows, in both idealized and real systems. The scope of coverage encompasses topics in fluid dynamics, scalar transport, multi-physics interactions and flow control. From time to time the journal publishes Special or Theme Issues featuring invited articles. Contributions may report research that falls within the broad spectrum of analytical, computational and experimental methods. This includes research conducted in academia, industry and a variety of environmental and geophysical sectors. Turbulence, transition and associated phenomena are expected to play a significant role in the majority of studies reported, although non-turbulent flows, typical of those in micro-devices, would be regarded as falling within the scope covered. The emphasis is on originality, timeliness, quality and thematic fit, as exemplified by the title of the journal and the qualifications described above. Relevance to real-world problems and industrial applications are regarded as strengths.