基于PID控制器和深度强化学习的两轮自平衡机器人

2022 22nd International Conference on Control, Automation and Systems (ICCAS) Pub Date : 2022-11-27 DOI:10.23919/ICCAS55662.2022.10003940

G. S. Krishna, D.M Sumith, Garika Akshay

{"title":"基于PID控制器和深度强化学习的两轮自平衡机器人","authors":"G. S. Krishna, D.M Sumith, Garika Akshay","doi":"10.23919/ICCAS55662.2022.10003940","DOIUrl":null,"url":null,"abstract":"A two-wheeled self-balancing robot is an example of an inverse pendulum and is an inherently non-linear, unstable system. The fundamental concept of the proposed framework “Epersist” is to overcome the challenge of counterbalancing an initially unstable system by delivering robust control mechanisms, Proportional Integral Derivative (PID), and Reinforcement Learning (RL). Moreover, the micro-controller NodeMCU ESP32 and inertial sensor in the Epersist employ fewer computational procedures to give accurate instruction regarding the spin of wheels to the motor driver, which helps control the wheels and balance the robot. This framework also consists of the mathematical model of the PID controller and a novel self-trained advantage actor-critic algorithm as the RL agent. After several experiments, control variable calibrations are made as the benchmark values to attain the angle of static equilibrium. This “Epersist” framework proposes PID and RL-assisted functional prototypes and simulations for better utility.","PeriodicalId":129856,"journal":{"name":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Epersist: A Two-Wheeled Self Balancing Robot Using PID Controller And Deep Reinforcement Learning\",\"authors\":\"G. S. Krishna, D.M Sumith, Garika Akshay\",\"doi\":\"10.23919/ICCAS55662.2022.10003940\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A two-wheeled self-balancing robot is an example of an inverse pendulum and is an inherently non-linear, unstable system. The fundamental concept of the proposed framework “Epersist” is to overcome the challenge of counterbalancing an initially unstable system by delivering robust control mechanisms, Proportional Integral Derivative (PID), and Reinforcement Learning (RL). Moreover, the micro-controller NodeMCU ESP32 and inertial sensor in the Epersist employ fewer computational procedures to give accurate instruction regarding the spin of wheels to the motor driver, which helps control the wheels and balance the robot. This framework also consists of the mathematical model of the PID controller and a novel self-trained advantage actor-critic algorithm as the RL agent. After several experiments, control variable calibrations are made as the benchmark values to attain the angle of static equilibrium. This “Epersist” framework proposes PID and RL-assisted functional prototypes and simulations for better utility.\",\"PeriodicalId\":129856,\"journal\":{\"name\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICCAS55662.2022.10003940\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS55662.2022.10003940","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

两轮自平衡机器人是倒摆的一个例子，它本身就是一个非线性的、不稳定的系统。提出的框架“Epersist”的基本概念是通过提供鲁棒控制机制、比例积分导数(PID)和强化学习(RL)来克服平衡最初不稳定系统的挑战。此外，Epersist中的微控制器NodeMCU ESP32和惯性传感器采用较少的计算程序，向电机驾驶员提供有关车轮旋转的准确指令，从而有助于控制车轮和平衡机器人。该框架还包括PID控制器的数学模型和一种新的自我训练的优势行为者批评算法作为强化学习代理。经过多次实验，对控制变量进行标定作为基准值，得到静平衡角。这个“Epersist”框架提出了PID和rl辅助的功能原型和模拟，以获得更好的效用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Epersist: A Two-Wheeled Self Balancing Robot Using PID Controller And Deep Reinforcement Learning

A two-wheeled self-balancing robot is an example of an inverse pendulum and is an inherently non-linear, unstable system. The fundamental concept of the proposed framework “Epersist” is to overcome the challenge of counterbalancing an initially unstable system by delivering robust control mechanisms, Proportional Integral Derivative (PID), and Reinforcement Learning (RL). Moreover, the micro-controller NodeMCU ESP32 and inertial sensor in the Epersist employ fewer computational procedures to give accurate instruction regarding the spin of wheels to the motor driver, which helps control the wheels and balance the robot. This framework also consists of the mathematical model of the PID controller and a novel self-trained advantage actor-critic algorithm as the RL agent. After several experiments, control variable calibrations are made as the benchmark values to attain the angle of static equilibrium. This “Epersist” framework proposes PID and RL-assisted functional prototypes and simulations for better utility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 22nd International Conference on Control, Automation and Systems (ICCAS)

自引率

0.00%

发文量