梁上伺服驱动中心旋转球的强化学习控制

Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal
{"title":"梁上伺服驱动中心旋转球的强化学习控制","authors":"Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal","doi":"10.1109/ICIIS51140.2020.9342690","DOIUrl":null,"url":null,"abstract":"The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.","PeriodicalId":352858,"journal":{"name":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","volume":"344 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning control of servo actuated centrally pivoted ball on a beam\",\"authors\":\"Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal\",\"doi\":\"10.1109/ICIIS51140.2020.9342690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.\",\"PeriodicalId\":352858,\"journal\":{\"name\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"volume\":\"344 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIS51140.2020.9342690\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIS51140.2020.9342690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

这项工作的目的是设计一个使用强化学习(RL)代理的控制器,用于不稳定和复杂的控制系统,如球梁系统。强化学习代理的工作是保持球的位置尽可能接近一个设定点。强化学习代理通过奖励进行学习。玩家采取的每一个行动都是为了最大化奖励价值。如果设定值和当前球的位置尽可能接近,奖励就会达到最大值。所以,从传感器得到的球的位置,就奖励而言,被作为预测下一步动作的反馈。预测的动作是需要由电机转动的光束的角度。考虑的动作空间是一个连续域,使用的强化学习算法是近端策略优化(PPO)和深度确定性策略梯度(DDPG)。一旦定义了环境动力学,就可以调整与该环境相关的强化学习算法的超参数,并训练模型。采用伺服电机作为驱动机构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reinforcement learning control of servo actuated centrally pivoted ball on a beam
The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信