QMIX Multiple Intelligences Reinforcement Learning Damping Control for Cylindrical Shell

Yang Song, Xu Kai, Su Hua, Zhang Gang
{"title":"QMIX Multiple Intelligences Reinforcement Learning Damping Control for Cylindrical Shell","authors":"Yang Song, Xu Kai, Su Hua, Zhang Gang","doi":"10.1109/WCMEIM56910.2022.10021369","DOIUrl":null,"url":null,"abstract":"One of the fundamental mechanical constructions of ships and navigators is the cylindrical shell structure. Their damping control is difficult to predict and frequently depends on precise control models. For that reason, this work provides a data-driven multi-intelligence reinforcement learning damping control approach that is significance for damping control of massive structures. Firstly, the dynamics equations of cylindrical shell structure are established based on the hypothetical modal method, and modal variables are introduced to derive the state-space equations for damping control of cylindrical shell structure, and an interactive environment for multi-intelligent reinforcement learning is established. Secondly, the damping control strategy of cylindrical shell structure with multiple intelligences is designed based on the value decomposition QMIX algorithm. For a single smart body design vibration displacement, velocity, piezoelectric actuator voltage, smart body operation steps as the state space, quadratic performance indicators with saturation characteristics as the damping effect reward function, greedy strategy as damping action selection method for multi-intelligent body cooperative damping. The QMIX algorithm hybrid network performs fusion evaluation of the joint action value of each intelligence and updates the action value function of a single intelligence. Finally, five sets of hyperparameters are set based on the Grid Search approach for comparative simulation experiments for deep learning network hyperparameter selection. The result of the simulation demonstrate that the current tactic effectively suppresses the vibration of the cylindrical shell construction. Furthermore, the optimal hyperparameter is determined by comparing simulation trials with different values, proving that the approach described in this article has better damping performance under this parameter.","PeriodicalId":202270,"journal":{"name":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCMEIM56910.2022.10021369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

One of the fundamental mechanical constructions of ships and navigators is the cylindrical shell structure. Their damping control is difficult to predict and frequently depends on precise control models. For that reason, this work provides a data-driven multi-intelligence reinforcement learning damping control approach that is significance for damping control of massive structures. Firstly, the dynamics equations of cylindrical shell structure are established based on the hypothetical modal method, and modal variables are introduced to derive the state-space equations for damping control of cylindrical shell structure, and an interactive environment for multi-intelligent reinforcement learning is established. Secondly, the damping control strategy of cylindrical shell structure with multiple intelligences is designed based on the value decomposition QMIX algorithm. For a single smart body design vibration displacement, velocity, piezoelectric actuator voltage, smart body operation steps as the state space, quadratic performance indicators with saturation characteristics as the damping effect reward function, greedy strategy as damping action selection method for multi-intelligent body cooperative damping. The QMIX algorithm hybrid network performs fusion evaluation of the joint action value of each intelligence and updates the action value function of a single intelligence. Finally, five sets of hyperparameters are set based on the Grid Search approach for comparative simulation experiments for deep learning network hyperparameter selection. The result of the simulation demonstrate that the current tactic effectively suppresses the vibration of the cylindrical shell construction. Furthermore, the optimal hyperparameter is determined by comparing simulation trials with different values, proving that the approach described in this article has better damping performance under this parameter.
QMIX多智能强化学习圆柱壳阻尼控制
船舶和导航仪的基本机械结构之一是圆柱壳结构。它们的阻尼控制很难预测,往往依赖于精确的控制模型。因此,本研究提供了一种数据驱动的多智能强化学习阻尼控制方法,对大型结构的阻尼控制具有重要意义。首先,基于假设模态法建立了圆柱壳结构的动力学方程,引入模态变量导出了圆柱壳结构阻尼控制的状态空间方程,建立了多智能强化学习的交互环境;其次,设计了基于值分解QMIX算法的多智能圆柱壳结构阻尼控制策略;针对单个智能体设计振动位移、速度、压电致动器电压、智能体运行步数为状态空间,以具有饱和特征的二次性能指标为阻尼效果奖励函数,以贪婪策略为阻尼动作选择方法,实现多智能体协同阻尼。QMIX算法混合网络对各智能的联合动作值进行融合评估,更新单个智能的动作值函数。最后,基于网格搜索方法设置了5组超参数,进行了深度学习网络超参数选择的对比仿真实验。仿真结果表明,该策略有效地抑制了圆柱壳结构的振动。通过不同数值的仿真试验对比,确定了最优的超参数,证明了本文方法在该参数下具有较好的阻尼性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信