梁上伺服驱动中心旋转球的强化学习控制

2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS) Pub Date : 2020-11-26 DOI:10.1109/ICIIS51140.2020.9342690

Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal

{"title":"梁上伺服驱动中心旋转球的强化学习控制","authors":"Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal","doi":"10.1109/ICIIS51140.2020.9342690","DOIUrl":null,"url":null,"abstract":"The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.","PeriodicalId":352858,"journal":{"name":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","volume":"344 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning control of servo actuated centrally pivoted ball on a beam\",\"authors\":\"Archana Ganesh, Banu Sundareswari Murugesan, M. Panda, T. Ganapathy, Dhanalakshmi Kaliaperumal\",\"doi\":\"10.1109/ICIIS51140.2020.9342690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.\",\"PeriodicalId\":352858,\"journal\":{\"name\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"volume\":\"344 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIS51140.2020.9342690\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIS51140.2020.9342690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

这项工作的目的是设计一个使用强化学习(RL)代理的控制器，用于不稳定和复杂的控制系统，如球梁系统。强化学习代理的工作是保持球的位置尽可能接近一个设定点。强化学习代理通过奖励进行学习。玩家采取的每一个行动都是为了最大化奖励价值。如果设定值和当前球的位置尽可能接近，奖励就会达到最大值。所以，从传感器得到的球的位置，就奖励而言，被作为预测下一步动作的反馈。预测的动作是需要由电机转动的光束的角度。考虑的动作空间是一个连续域，使用的强化学习算法是近端策略优化(PPO)和深度确定性策略梯度(DDPG)。一旦定义了环境动力学，就可以调整与该环境相关的强化学习算法的超参数，并训练模型。采用伺服电机作为驱动机构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement learning control of servo actuated centrally pivoted ball on a beam

The objective of this work is to devise a controller using Reinforcement Learning (RL) agents, for unstable and complex control systems like the ball beam system. The reinforcement learning agent's job is to keep the ball's position as close as possible to a set point. The Reinforcement Learning agent learns through rewards. Every action is taken such that the reward value is maximized. The reward becomes maximum if setpoint and the current ball position are as close as possible. So, a ball position from the sensor, in terms of reward is taken as feedback to predict the next action. The predicted action is the angle of the beam which needs to be turned by the motor. The action space considered is of a continuous domain, and the Reinforcement Learning algorithms that have been used are Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG). Once the environment dynamics are defined, hyper-parameters of the reinforcement learning algorithms pertaining to this environment are tuned, and the model is trained. Servo motor is used as the actuation mechanism.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)

自引率

0.00%

发文量