Kala Agbo bidi , Jean-Michel Coron , Amaury Hayat , Nathan Lichtlé
{"title":"一种基于深度强化学习的反馈控制新方法","authors":"Kala Agbo bidi , Jean-Michel Coron , Amaury Hayat , Nathan Lichtlé","doi":"10.1016/j.sysconle.2025.106102","DOIUrl":null,"url":null,"abstract":"<div><div>We present a novel approach to feedback control design by leveraging the power of deep reinforcement learning (RL). The goal is to blend the RL methodology with mathematical analysis to extract an explicit feedback control for systems where, because of constraints or limited measurements, the classical approaches of control theory (e.g. dynamical programming, optimal control-based feedback, backstepping, etc.) cannot be used.</div><div>We study a dynamical system of mosquito populations for biological pest control using the Sterile Insect Technique (SIT), a method traditionally applied in agriculture that involves releasing large numbers of sterile insects into the wild to reduce pest populations. Our goal is to derive a feedback control that globally stabilizes the system around the zero-mosquito equilibrium using only practical measurements, such as total male and female mosquito counts, rather than detailed counts of sterilized versus potent males or fecund versus unfecund females, which are often not accessible. This physical constraint presents challenges for classical methods control theory, as the full state cannot be measured. To address this, we apply deep reinforcement learning to suggest feedback laws for a discretized system that only rely on these accessible, real-world measurements, obtainable through methods like pheromone traps. Finally, we leverage the trained neural network to extract explicit feedback controls that stabilize the original continuous system over a wide range of initial conditions.</div><div>Many other dynamical systems arising from practical applications are subject to measurement constraints, which render the stabilization problem complex from a mathematical perspective. We believe that this approach could help in finding new solutions to these problems.</div></div>","PeriodicalId":49450,"journal":{"name":"Systems & Control Letters","volume":"202 ","pages":"Article 106102"},"PeriodicalIF":2.1000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel approach to feedback control with deep reinforcement learning\",\"authors\":\"Kala Agbo bidi , Jean-Michel Coron , Amaury Hayat , Nathan Lichtlé\",\"doi\":\"10.1016/j.sysconle.2025.106102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We present a novel approach to feedback control design by leveraging the power of deep reinforcement learning (RL). The goal is to blend the RL methodology with mathematical analysis to extract an explicit feedback control for systems where, because of constraints or limited measurements, the classical approaches of control theory (e.g. dynamical programming, optimal control-based feedback, backstepping, etc.) cannot be used.</div><div>We study a dynamical system of mosquito populations for biological pest control using the Sterile Insect Technique (SIT), a method traditionally applied in agriculture that involves releasing large numbers of sterile insects into the wild to reduce pest populations. Our goal is to derive a feedback control that globally stabilizes the system around the zero-mosquito equilibrium using only practical measurements, such as total male and female mosquito counts, rather than detailed counts of sterilized versus potent males or fecund versus unfecund females, which are often not accessible. This physical constraint presents challenges for classical methods control theory, as the full state cannot be measured. To address this, we apply deep reinforcement learning to suggest feedback laws for a discretized system that only rely on these accessible, real-world measurements, obtainable through methods like pheromone traps. Finally, we leverage the trained neural network to extract explicit feedback controls that stabilize the original continuous system over a wide range of initial conditions.</div><div>Many other dynamical systems arising from practical applications are subject to measurement constraints, which render the stabilization problem complex from a mathematical perspective. We believe that this approach could help in finding new solutions to these problems.</div></div>\",\"PeriodicalId\":49450,\"journal\":{\"name\":\"Systems & Control Letters\",\"volume\":\"202 \",\"pages\":\"Article 106102\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Systems & Control Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167691125000842\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systems & Control Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167691125000842","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A novel approach to feedback control with deep reinforcement learning
We present a novel approach to feedback control design by leveraging the power of deep reinforcement learning (RL). The goal is to blend the RL methodology with mathematical analysis to extract an explicit feedback control for systems where, because of constraints or limited measurements, the classical approaches of control theory (e.g. dynamical programming, optimal control-based feedback, backstepping, etc.) cannot be used.
We study a dynamical system of mosquito populations for biological pest control using the Sterile Insect Technique (SIT), a method traditionally applied in agriculture that involves releasing large numbers of sterile insects into the wild to reduce pest populations. Our goal is to derive a feedback control that globally stabilizes the system around the zero-mosquito equilibrium using only practical measurements, such as total male and female mosquito counts, rather than detailed counts of sterilized versus potent males or fecund versus unfecund females, which are often not accessible. This physical constraint presents challenges for classical methods control theory, as the full state cannot be measured. To address this, we apply deep reinforcement learning to suggest feedback laws for a discretized system that only rely on these accessible, real-world measurements, obtainable through methods like pheromone traps. Finally, we leverage the trained neural network to extract explicit feedback controls that stabilize the original continuous system over a wide range of initial conditions.
Many other dynamical systems arising from practical applications are subject to measurement constraints, which render the stabilization problem complex from a mathematical perspective. We believe that this approach could help in finding new solutions to these problems.
期刊介绍:
Founded in 1981 by two of the pre-eminent control theorists, Roger Brockett and Jan Willems, Systems & Control Letters is one of the leading journals in the field of control theory. The aim of the journal is to allow dissemination of relatively concise but highly original contributions whose high initial quality enables a relatively rapid review process. All aspects of the fields of systems and control are covered, especially mathematically-oriented and theoretical papers that have a clear relevance to engineering, physical and biological sciences, and even economics. Application-oriented papers with sophisticated and rigorous mathematical elements are also welcome.