Reinforcement Learning of Whole-Body Control Strategies to Balance a Dynamically Stable Mobile Manipulator

Vighnesh Vatsal, B. Purushothaman
{"title":"Reinforcement Learning of Whole-Body Control Strategies to Balance a Dynamically Stable Mobile Manipulator","authors":"Vighnesh Vatsal, B. Purushothaman","doi":"10.1109/ICC54714.2021.9703140","DOIUrl":null,"url":null,"abstract":"Mobile manipulators consist of a ground robot base and a mounted robotic arm, with the two components typically controlled as separate subsystems. This is enabled by the fact that most mobile bases with three or four-wheeled designs are inherently stable, though lacking in maneuverability. In contrast, dynamically stable mobile bases offer greater agility and safety in crowded human interaction scenarios, though requiring active balancing. In this work, we consider the balancing problem for a Two-Wheeled Inverted Pendulum Mobile Manipulator (TWIP-MM), designed for retail shelf inspection. Using deep reinforcement learning methods (PPO and SAC), we can generate whole-body control strategies that leverage the motion of the robotic arm for in-place stabilization of the base, through a completely model-free approach. In contrast, tuning a standard PID controller requires a model of the robot, and is considered here as a baseline. Compared to PID control in simulation, the RL-based controllers are found to be more robust against changes in initial conditions, variations in inertial parameters, and disturbances applied to the robot.","PeriodicalId":382373,"journal":{"name":"2021 Seventh Indian Control Conference (ICC)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Seventh Indian Control Conference (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC54714.2021.9703140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Mobile manipulators consist of a ground robot base and a mounted robotic arm, with the two components typically controlled as separate subsystems. This is enabled by the fact that most mobile bases with three or four-wheeled designs are inherently stable, though lacking in maneuverability. In contrast, dynamically stable mobile bases offer greater agility and safety in crowded human interaction scenarios, though requiring active balancing. In this work, we consider the balancing problem for a Two-Wheeled Inverted Pendulum Mobile Manipulator (TWIP-MM), designed for retail shelf inspection. Using deep reinforcement learning methods (PPO and SAC), we can generate whole-body control strategies that leverage the motion of the robotic arm for in-place stabilization of the base, through a completely model-free approach. In contrast, tuning a standard PID controller requires a model of the robot, and is considered here as a baseline. Compared to PID control in simulation, the RL-based controllers are found to be more robust against changes in initial conditions, variations in inertial parameters, and disturbances applied to the robot.
动态稳定移动机械臂平衡的全身强化学习控制策略
移动机械臂由地面机器人基座和安装的机械臂组成,这两个部件通常作为单独的子系统进行控制。这是由于大多数带有三轮或四轮设计的移动基地虽然缺乏机动性,但本质上是稳定的。相比之下,动态稳定的移动基地在拥挤的人类互动场景中提供了更大的灵活性和安全性,尽管需要主动平衡。本文研究了用于零售货架检测的两轮倒立摆移动机械手的平衡问题。使用深度强化学习方法(PPO和SAC),我们可以通过完全无模型的方法生成全身控制策略,利用机械臂的运动来实现基座的原位稳定。相比之下,调整标准PID控制器需要机器人的模型,这里将其视为基线。与仿真中的PID控制相比,基于rl的控制器对初始条件的变化、惯性参数的变化和应用于机器人的干扰具有更强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信