Learning passive policies with virtual energy tanks in robotics

IF 2.2 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

IET Control Theory and Applications Pub Date : 2024-01-25 DOI:10.1049/cth2.12558

Riccardo Zanella, Gianluca Palli, Stefano Stramigioli, Federico Califano

{"title":"Learning passive policies with virtual energy tanks in robotics","authors":"Riccardo Zanella, Gianluca Palli, Stefano Stramigioli, Federico Califano","doi":"10.1049/cth2.12558","DOIUrl":null,"url":null,"abstract":"<p>Within a robotic context, the techniques of passivity-based control and reinforcement learning are merged with the goal of eliminating some of their reciprocal weaknesses, as well as inducing novel promising features in the resulting framework. The contribution is framed in a scenario where passivity-based control is implemented by means of virtual energy tanks, a control technique developed to achieve closed-loop passivity for any arbitrary control input. Albeit the latter result is heavily used, it is discussed why its practical application at its current stage remains rather limited, which makes contact with the highly debated claim that passivity-based techniques are associated with a loss of performance. The use of reinforcement learning allows to learn a control policy that can be passivized using the energy tank architecture, combining the versatility of learning approaches and the system theoretic properties which can be inferred due to the energy tanks. Simulations show the validity of the approach, as well as novel interesting research directions in energy-aware robotics.</p>","PeriodicalId":50382,"journal":{"name":"IET Control Theory and Applications","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cth2.12558","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Control Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cth2.12558","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Within a robotic context, the techniques of passivity-based control and reinforcement learning are merged with the goal of eliminating some of their reciprocal weaknesses, as well as inducing novel promising features in the resulting framework. The contribution is framed in a scenario where passivity-based control is implemented by means of virtual energy tanks, a control technique developed to achieve closed-loop passivity for any arbitrary control input. Albeit the latter result is heavily used, it is discussed why its practical application at its current stage remains rather limited, which makes contact with the highly debated claim that passivity-based techniques are associated with a loss of performance. The use of reinforcement learning allows to learn a control policy that can be passivized using the energy tank architecture, combining the versatility of learning approaches and the system theoretic properties which can be inferred due to the energy tanks. Simulations show the validity of the approach, as well as novel interesting research directions in energy-aware robotics.

Abstract Image

查看原文本刊更多论文

在机器人中利用虚拟能源罐学习被动策略

在机器人领域，基于被动性的控制技术和强化学习技术被融合在一起，目的是消除它们之间的一些互为因果的弱点，并在由此产生的框架中引入新的有前途的特征。这一贡献的背景是，基于被动性的控制是通过虚拟能量槽来实现的，而虚拟能量槽是为实现任意控制输入的闭环被动性而开发的一种控制技术。尽管后一种结果得到了广泛应用，但本文讨论了其在现阶段的实际应用仍然相当有限的原因，这与基于被动性的技术会导致性能损失这一备受争议的说法不谋而合。通过使用强化学习，可以学习到一种控制策略，这种控制策略可以使用能量槽结构进行钝化，将学习方法的多功能性与能量槽可以推断出的系统理论特性结合起来。模拟结果表明了该方法的有效性，以及在能量感知机器人学方面新颖有趣的研究方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IET Control Theory and Applications 工程技术-工程：电子与电气

CiteScore

5.70

自引率

7.70%

发文量

167

审稿时长

5.1 months

期刊介绍： IET Control Theory & Applications is devoted to control systems in the broadest sense, covering new theoretical results and the applications of new and established control methods. Among the topics of interest are system modelling, identification and simulation, the analysis and design of control systems (including computer-aided design), and practical implementation. The scope encompasses technological, economic, physiological (biomedical) and other systems, including man-machine interfaces. Most of the papers published deal with original work from industrial and government laboratories and universities, but subject reviews and tutorial expositions of current methods are welcomed. Correspondence discussing published papers is also welcomed. Applications papers need not necessarily involve new theory. Papers which describe new realisations of established methods, or control techniques applied in a novel situation, or practical studies which compare various designs, would be of interest. Of particular value are theoretical papers which discuss the applicability of new work or applications which engender new theoretical applications.