用户主导的模块化机器人操作系统交互任务导向的层次逼近最优控制:一个Stackelberg-Pareto微分博弈视角

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma
{"title":"用户主导的模块化机器人操作系统交互任务导向的层次逼近最优控制:一个Stackelberg-Pareto微分博弈视角","authors":"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma","doi":"10.1109/TASE.2025.3585484","DOIUrl":null,"url":null,"abstract":"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"17801-17813"},"PeriodicalIF":6.4000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective\",\"authors\":\"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma\",\"doi\":\"10.1109/TASE.2025.3585484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"17801-17813\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11063305/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11063305/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

针对采用关节力矩反馈(JTF)技术建模的用户主导模块化机械臂(MRM)系统,提出了一种基于Stackelberg-Pareto微分对策的近似最优交互控制方法。通过采用MRM中的合作微分对策和人与机器人之间的Stackelberg微分对策,将具有物理人机交互的最优控制的主要目标演化为逼近Stackelberg- pareto均衡。借鉴自适应动态规划(ADP)方法,利用批评性神经网络(NN)建立了与pHRI的近似最优交互控制策略,用于求解Hamilton-Jacobian (HJ)和HJ- bellman (HJB)耦合方程。pHRI任务下的位置跟踪误差是由李雅普诺夫定理的概念最终一致有界的。两个区分实验证明了所提控制方法的优越性。操作者注意:控制用户主导的MRM系统的主要挑战包括优化MRM系统性能以及机器人和人之间的分层交互任务。传统的微分博弈,如零和博弈、非零和博弈、合作博弈,在交互过程中只考虑同一层,不适合以用户为主导的有领导者和追随者的MRM。此外,现有的等级博弈是一个领导者和一个或多个追随者之间的非零和博弈,忽略了追随者之间的充分合作关系。为此,本文通过理论分析和实验验证,提出了基于Stackelberg-Pareto微分对策的用户主导MRM近似最优控制,以提高系统性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective
A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信