Christina Schenk, Aditya Vasudevan, Maciej Haranczyk, Ignacio Romero
{"title":"Model‐based reinforcement learning control of reaction‐diffusion problems","authors":"Christina Schenk, Aditya Vasudevan, Maciej Haranczyk, Ignacio Romero","doi":"10.1002/oca.3196","DOIUrl":null,"url":null,"abstract":"Mathematical and computational tools have proven to be reliable in decision‐making processes. In recent times, in particular, machine learning‐based methods are becoming increasingly popular as advanced support tools. When dealing with control problems, reinforcement learning has been applied to decision‐making in several applications, most notably in games. The success of these methods in finding solutions to complex problems motivates the exploration of new areas where they can be employed to overcome current difficulties. In this article, we explore the use of automatic control strategies to initial boundary value problems in thermal and disease transport. Specifically, in this work, we adapt an existing reinforcement learning algorithm using a stochastic policy gradient method and we introduce two novel reward functions to drive the flow of the transported field. The new model‐based framework exploits the interactions between a reaction‐diffusion model and the modified agent. The results show that certain controls can be implemented successfully in these applications, although model simplifications had to be assumed.","PeriodicalId":501055,"journal":{"name":"Optimal Control Applications and Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optimal Control Applications and Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/oca.3196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Mathematical and computational tools have proven to be reliable in decision‐making processes. In recent times, in particular, machine learning‐based methods are becoming increasingly popular as advanced support tools. When dealing with control problems, reinforcement learning has been applied to decision‐making in several applications, most notably in games. The success of these methods in finding solutions to complex problems motivates the exploration of new areas where they can be employed to overcome current difficulties. In this article, we explore the use of automatic control strategies to initial boundary value problems in thermal and disease transport. Specifically, in this work, we adapt an existing reinforcement learning algorithm using a stochastic policy gradient method and we introduce two novel reward functions to drive the flow of the transported field. The new model‐based framework exploits the interactions between a reaction‐diffusion model and the modified agent. The results show that certain controls can be implemented successfully in these applications, although model simplifications had to be assumed.