用户主导的模块化机器人操作系统交互任务导向的层次逼近最优控制：一个Stackelberg-Pareto微分博弈视角

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Automation Science and Engineering Pub Date : 2025-07-02 DOI:10.1109/TASE.2025.3585484

Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma

{"title":"用户主导的模块化机器人操作系统交互任务导向的层次逼近最优控制：一个Stackelberg-Pareto微分博弈视角","authors":"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma","doi":"10.1109/TASE.2025.3585484","DOIUrl":null,"url":null,"abstract":"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"17801-17813"},"PeriodicalIF":6.4000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective\",\"authors\":\"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma\",\"doi\":\"10.1109/TASE.2025.3585484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"17801-17813\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11063305/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11063305/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

针对采用关节力矩反馈（JTF）技术建模的用户主导模块化机械臂（MRM）系统，提出了一种基于Stackelberg-Pareto微分对策的近似最优交互控制方法。通过采用MRM中的合作微分对策和人与机器人之间的Stackelberg微分对策，将具有物理人机交互的最优控制的主要目标演化为逼近Stackelberg- pareto均衡。借鉴自适应动态规划（ADP）方法，利用批评性神经网络（NN）建立了与pHRI的近似最优交互控制策略，用于求解Hamilton-Jacobian （HJ）和HJ- bellman （HJB）耦合方程。pHRI任务下的位置跟踪误差是由李雅普诺夫定理的概念最终一致有界的。两个区分实验证明了所提控制方法的优越性。操作者注意：控制用户主导的MRM系统的主要挑战包括优化MRM系统性能以及机器人和人之间的分层交互任务。传统的微分博弈，如零和博弈、非零和博弈、合作博弈，在交互过程中只考虑同一层，不适合以用户为主导的有领导者和追随者的MRM。此外，现有的等级博弈是一个领导者和一个或多个追随者之间的非零和博弈，忽略了追随者之间的充分合作关系。为此，本文通过理论分析和实验验证，提出了基于Stackelberg-Pareto微分对策的用户主导MRM近似最优控制，以提高系统性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective

A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.