Application of an Off-Policy Reinforcement Learning Algorithm for
H
∞
${{H}_\infty }$
Control Design of Nonlinear Structural Systems With Completely Unknown Dynamics
M. Amirmojahedi, A. Mojoodi, Saeed Shojaee, Saleh Hamzehei-Javaran
{"title":"Application of an Off-Policy Reinforcement Learning Algorithm for \n \n \n H\n ∞\n \n ${{H}_\\infty }$\n Control Design of Nonlinear Structural Systems With Completely Unknown Dynamics","authors":"M. Amirmojahedi, A. Mojoodi, Saeed Shojaee, Saleh Hamzehei-Javaran","doi":"10.1002/eqe.4299","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper proposes a model-free and online off-policy algorithm based on reinforcement learning (RL) for vibration attenuation of earthquake-excited structures, through designing an optimal <span></span><math>\n <semantics>\n <msub>\n <mi>H</mi>\n <mi>∞</mi>\n </msub>\n <annotation>${{H}_\\infty }$</annotation>\n </semantics></math> controller. This design relies on solving a two-player zero-sum game theory with a Hamilton–Jacobi–Isaacs (HJI) equation, which is extremely difficult, or often impossible, to be solved for the value function and the related optimal controller. The proposed strategy uses an actor-critic-disturbance structure to learn the solution of the HJI equation online and forward in time, without requiring any knowledge of the system dynamics. In addition, the control and disturbance policies and value function are approximated by the actor, the disturbance, and the critic neural networks (NNs), respectively.</p>\n <p>Implementing the policy iteration technique, the NNs’ weights of the proposed model are calculated using the least square (LS) method in each iteration. In the present study, the convergence of the proposed algorithm is investigated through two distinct examples. Furthermore, the performance of this off-policy RL strategy is studied in reducing the response of a seismically excited nonlinear structure with an active mass damper (AMD) for two cases of state feedback. The simulation results prove the effectiveness of the proposed algorithm in application to civil engineering structures.</p>\n </div>","PeriodicalId":11390,"journal":{"name":"Earthquake Engineering & Structural Dynamics","volume":"54 4","pages":"1210-1228"},"PeriodicalIF":4.3000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earthquake Engineering & Structural Dynamics","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eqe.4299","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a model-free and online off-policy algorithm based on reinforcement learning (RL) for vibration attenuation of earthquake-excited structures, through designing an optimal controller. This design relies on solving a two-player zero-sum game theory with a Hamilton–Jacobi–Isaacs (HJI) equation, which is extremely difficult, or often impossible, to be solved for the value function and the related optimal controller. The proposed strategy uses an actor-critic-disturbance structure to learn the solution of the HJI equation online and forward in time, without requiring any knowledge of the system dynamics. In addition, the control and disturbance policies and value function are approximated by the actor, the disturbance, and the critic neural networks (NNs), respectively.
Implementing the policy iteration technique, the NNs’ weights of the proposed model are calculated using the least square (LS) method in each iteration. In the present study, the convergence of the proposed algorithm is investigated through two distinct examples. Furthermore, the performance of this off-policy RL strategy is studied in reducing the response of a seismically excited nonlinear structure with an active mass damper (AMD) for two cases of state feedback. The simulation results prove the effectiveness of the proposed algorithm in application to civil engineering structures.
期刊介绍:
Earthquake Engineering and Structural Dynamics provides a forum for the publication of papers on several aspects of engineering related to earthquakes. The problems in this field, and their solutions, are international in character and require knowledge of several traditional disciplines; the Journal will reflect this. Papers that may be relevant but do not emphasize earthquake engineering and related structural dynamics are not suitable for the Journal. Relevant topics include the following:
ground motions for analysis and design
geotechnical earthquake engineering
probabilistic and deterministic methods of dynamic analysis
experimental behaviour of structures
seismic protective systems
system identification
risk assessment
seismic code requirements
methods for earthquake-resistant design and retrofit of structures.