{"title":"大时滞下最小输入输出装置剪切流的多步强化学习控制","authors":"Amine Saibi, Lionel Mathelin, Onofrio Semeraro","doi":"10.1007/s10494-025-00697-w","DOIUrl":null,"url":null,"abstract":"<div><p>Flow control has attracted research for its potential role in reducing drag, suppressing turbulence, and enhancing mixing in fluid systems. The emergence of data-driven modeling and machine learning techniques has sparked new interest in designing control strategies that can adapt in real time to complex, high-dimensional flow environments. However, fluid systems remain particularly challenging testbeds for control design due to their nonlinear and convective nature, which introduces large time delays. In active control, additional difficulties arise from practical constraints, such as the use of localized sensors in limited number. In this work, we investigate a reinforcement learning framework based on a suitable actor–critic algorithm designed to address these challenges. Two test cases representative of transitional shear flows are considered: a linearized version of the Kuramoto–Sivashinsky equation and the control of instabilities in a two-dimensional boundary-layer flow over a flat plate, using a minimal but realistic sensor–actuator configuration. This choice reflects our focus on the limitations that arise from plants of experimental interest. Time delays are identified during a pretraining stage, while the control algorithm employs multistep returns during value iteration. This approach improves both the convergence rate and stability of learning. Furthermore, we show that the look-ahead in the multistep formulation provides a non-trivial beneficial effect in plants where the control task is characterized by a severe credit-assignment issue.</p></div>","PeriodicalId":559,"journal":{"name":"Flow, Turbulence and Combustion","volume":"115 :","pages":"1379 - 1402"},"PeriodicalIF":2.4000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Multistep Reinforcement Learning Control of Shear Flows in Minimal Input–Output Plants Under Large Time-delays\",\"authors\":\"Amine Saibi, Lionel Mathelin, Onofrio Semeraro\",\"doi\":\"10.1007/s10494-025-00697-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Flow control has attracted research for its potential role in reducing drag, suppressing turbulence, and enhancing mixing in fluid systems. The emergence of data-driven modeling and machine learning techniques has sparked new interest in designing control strategies that can adapt in real time to complex, high-dimensional flow environments. However, fluid systems remain particularly challenging testbeds for control design due to their nonlinear and convective nature, which introduces large time delays. In active control, additional difficulties arise from practical constraints, such as the use of localized sensors in limited number. In this work, we investigate a reinforcement learning framework based on a suitable actor–critic algorithm designed to address these challenges. Two test cases representative of transitional shear flows are considered: a linearized version of the Kuramoto–Sivashinsky equation and the control of instabilities in a two-dimensional boundary-layer flow over a flat plate, using a minimal but realistic sensor–actuator configuration. This choice reflects our focus on the limitations that arise from plants of experimental interest. Time delays are identified during a pretraining stage, while the control algorithm employs multistep returns during value iteration. This approach improves both the convergence rate and stability of learning. Furthermore, we show that the look-ahead in the multistep formulation provides a non-trivial beneficial effect in plants where the control task is characterized by a severe credit-assignment issue.</p></div>\",\"PeriodicalId\":559,\"journal\":{\"name\":\"Flow, Turbulence and Combustion\",\"volume\":\"115 :\",\"pages\":\"1379 - 1402\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Flow, Turbulence and Combustion\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10494-025-00697-w\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Flow, Turbulence and Combustion","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10494-025-00697-w","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}
A Multistep Reinforcement Learning Control of Shear Flows in Minimal Input–Output Plants Under Large Time-delays
Flow control has attracted research for its potential role in reducing drag, suppressing turbulence, and enhancing mixing in fluid systems. The emergence of data-driven modeling and machine learning techniques has sparked new interest in designing control strategies that can adapt in real time to complex, high-dimensional flow environments. However, fluid systems remain particularly challenging testbeds for control design due to their nonlinear and convective nature, which introduces large time delays. In active control, additional difficulties arise from practical constraints, such as the use of localized sensors in limited number. In this work, we investigate a reinforcement learning framework based on a suitable actor–critic algorithm designed to address these challenges. Two test cases representative of transitional shear flows are considered: a linearized version of the Kuramoto–Sivashinsky equation and the control of instabilities in a two-dimensional boundary-layer flow over a flat plate, using a minimal but realistic sensor–actuator configuration. This choice reflects our focus on the limitations that arise from plants of experimental interest. Time delays are identified during a pretraining stage, while the control algorithm employs multistep returns during value iteration. This approach improves both the convergence rate and stability of learning. Furthermore, we show that the look-ahead in the multistep formulation provides a non-trivial beneficial effect in plants where the control task is characterized by a severe credit-assignment issue.
期刊介绍:
Flow, Turbulence and Combustion provides a global forum for the publication of original and innovative research results that contribute to the solution of fundamental and applied problems encountered in single-phase, multi-phase and reacting flows, in both idealized and real systems. The scope of coverage encompasses topics in fluid dynamics, scalar transport, multi-physics interactions and flow control. From time to time the journal publishes Special or Theme Issues featuring invited articles.
Contributions may report research that falls within the broad spectrum of analytical, computational and experimental methods. This includes research conducted in academia, industry and a variety of environmental and geophysical sectors. Turbulence, transition and associated phenomena are expected to play a significant role in the majority of studies reported, although non-turbulent flows, typical of those in micro-devices, would be regarded as falling within the scope covered. The emphasis is on originality, timeliness, quality and thematic fit, as exemplified by the title of the journal and the qualifications described above. Relevance to real-world problems and industrial applications are regarded as strengths.