运动三柔性耦合梁的强化学习振动控制

IF 4.3 2区 工程技术 Q1 ACOUSTICS
Zhi-cheng Qiu, Jun-fei Hu, Xian-min Zhang
{"title":"运动三柔性耦合梁的强化学习振动控制","authors":"Zhi-cheng Qiu,&nbsp;Jun-fei Hu,&nbsp;Xian-min Zhang","doi":"10.1016/j.jsv.2025.119069","DOIUrl":null,"url":null,"abstract":"<div><div>A multi-agent reinforcement learning (RL) vibration controller is designed to actively suppress the vibration of a three-flexible coupled moving beam system and optimize the translational motion trajectory to minimize the residual vibration after motion. The finite element method is used to model the dynamic equation of the system, and the parameters of the actual model are identified by wavelet transform and intelligent optimization algorithm. A structured light visual vibration detection system is constructed to measure the vibration of three flexible beams. Based on the modified model, a trajectory optimization method of the three-flexible beam coupling system based on neural network optimization algorithm is designed. The QMIX and phasic policy gradient (PPG) controller are trained to obtain excellent nonlinear controllers for vibration control of piezoelectric actuators. The simulation and experimental results show that the optimized trajectory greatly reduces the vibration excitation, and the vibration at the end of the translation and movement is quickly suppressed. For the dual flexible beam coupled system, the designed QMIX controller suppresses the vibration completely in about 7 s, while under large gain proportional and derivative (PD) control, there is still residual vibration in 20 s. For the vibration of the first bending mode of the single flexible beam system, the designed PPG controller completely suppresses the vibration in about 3 s, while the vibration attenuation takes about 10 s for the large gain PD control. For the vibration of the first two bending modes, the PPG controller completely suppresses the vibration in about 4 s, while it takes about 11 s for the large gain PD control. The control effects of the two RL methods are better than that of PD control, especially for suppressing small amplitude vibration.</div></div>","PeriodicalId":17233,"journal":{"name":"Journal of Sound and Vibration","volume":"608 ","pages":"Article 119069"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning vibration control of moving three flexible coupled beams\",\"authors\":\"Zhi-cheng Qiu,&nbsp;Jun-fei Hu,&nbsp;Xian-min Zhang\",\"doi\":\"10.1016/j.jsv.2025.119069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A multi-agent reinforcement learning (RL) vibration controller is designed to actively suppress the vibration of a three-flexible coupled moving beam system and optimize the translational motion trajectory to minimize the residual vibration after motion. The finite element method is used to model the dynamic equation of the system, and the parameters of the actual model are identified by wavelet transform and intelligent optimization algorithm. A structured light visual vibration detection system is constructed to measure the vibration of three flexible beams. Based on the modified model, a trajectory optimization method of the three-flexible beam coupling system based on neural network optimization algorithm is designed. The QMIX and phasic policy gradient (PPG) controller are trained to obtain excellent nonlinear controllers for vibration control of piezoelectric actuators. The simulation and experimental results show that the optimized trajectory greatly reduces the vibration excitation, and the vibration at the end of the translation and movement is quickly suppressed. For the dual flexible beam coupled system, the designed QMIX controller suppresses the vibration completely in about 7 s, while under large gain proportional and derivative (PD) control, there is still residual vibration in 20 s. For the vibration of the first bending mode of the single flexible beam system, the designed PPG controller completely suppresses the vibration in about 3 s, while the vibration attenuation takes about 10 s for the large gain PD control. For the vibration of the first two bending modes, the PPG controller completely suppresses the vibration in about 4 s, while it takes about 11 s for the large gain PD control. The control effects of the two RL methods are better than that of PD control, especially for suppressing small amplitude vibration.</div></div>\",\"PeriodicalId\":17233,\"journal\":{\"name\":\"Journal of Sound and Vibration\",\"volume\":\"608 \",\"pages\":\"Article 119069\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Sound and Vibration\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022460X25001439\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Sound and Vibration","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022460X25001439","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

设计了一种多智能体强化学习(RL)振动控制器,以主动抑制三柔性耦合动梁系统的振动,并优化平移运动轨迹,使运动后残余振动最小化。采用有限元法对系统动力学方程进行建模,并采用小波变换和智能优化算法对实际模型参数进行辨识。构造了一种结构光视觉振动检测系统,用于测量三根柔性梁的振动。在修正模型的基础上,设计了一种基于神经网络优化算法的三柔性梁耦合系统轨迹优化方法。通过对QMIX和相位策略梯度(PPG)控制器的训练,得到了用于压电致动器振动控制的优秀非线性控制器。仿真和实验结果表明,优化后的轨迹大大降低了振动激励,平移和运动结束时的振动得到了快速抑制。对于双柔性梁耦合系统,所设计的QMIX控制器在7 s左右就能完全抑制振动,而在大增益比例导数(PD)控制下,20 s左右仍有残余振动。对于单柔性梁系统第一弯曲模态的振动,所设计的PPG控制器在3 s左右完全抑制了振动,而对于大增益PD控制,振动衰减大约需要10 s。对于前两种弯曲模态的振动,PPG控制器在约4 s内完全抑制了振动,而大增益PD控制则需要约11 s。两种RL方法的控制效果都优于PD控制,特别是在抑制小振幅振动方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Reinforcement learning vibration control of moving three flexible coupled beams

Reinforcement learning vibration control of moving three flexible coupled beams
A multi-agent reinforcement learning (RL) vibration controller is designed to actively suppress the vibration of a three-flexible coupled moving beam system and optimize the translational motion trajectory to minimize the residual vibration after motion. The finite element method is used to model the dynamic equation of the system, and the parameters of the actual model are identified by wavelet transform and intelligent optimization algorithm. A structured light visual vibration detection system is constructed to measure the vibration of three flexible beams. Based on the modified model, a trajectory optimization method of the three-flexible beam coupling system based on neural network optimization algorithm is designed. The QMIX and phasic policy gradient (PPG) controller are trained to obtain excellent nonlinear controllers for vibration control of piezoelectric actuators. The simulation and experimental results show that the optimized trajectory greatly reduces the vibration excitation, and the vibration at the end of the translation and movement is quickly suppressed. For the dual flexible beam coupled system, the designed QMIX controller suppresses the vibration completely in about 7 s, while under large gain proportional and derivative (PD) control, there is still residual vibration in 20 s. For the vibration of the first bending mode of the single flexible beam system, the designed PPG controller completely suppresses the vibration in about 3 s, while the vibration attenuation takes about 10 s for the large gain PD control. For the vibration of the first two bending modes, the PPG controller completely suppresses the vibration in about 4 s, while it takes about 11 s for the large gain PD control. The control effects of the two RL methods are better than that of PD control, especially for suppressing small amplitude vibration.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Sound and Vibration
Journal of Sound and Vibration 工程技术-工程:机械
CiteScore
9.10
自引率
10.60%
发文量
551
审稿时长
69 days
期刊介绍: The Journal of Sound and Vibration (JSV) is an independent journal devoted to the prompt publication of original papers, both theoretical and experimental, that provide new information on any aspect of sound or vibration. There is an emphasis on fundamental work that has potential for practical application. JSV was founded and operates on the premise that the subject of sound and vibration requires a journal that publishes papers of a high technical standard across the various subdisciplines, thus facilitating awareness of techniques and discoveries in one area that may be applicable in others.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信