Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma
{"title":"用户主导的模块化机器人操作系统交互任务导向的层次逼近最优控制:一个Stackelberg-Pareto微分博弈视角","authors":"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma","doi":"10.1109/TASE.2025.3585484","DOIUrl":null,"url":null,"abstract":"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"17801-17813"},"PeriodicalIF":6.4000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective\",\"authors\":\"Tianjiao An;Xiaogang Dong;Bo Dong;Ruiqi Cong;Lei Liu;Bing Ma\",\"doi\":\"10.1109/TASE.2025.3585484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"17801-17813\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11063305/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11063305/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
User-Led Modular Robot Manipulator Systems Interaction Tasks-Oriented Hierarchical Approximate Optimal Control: A Stackelberg-Pareto Differential Game Perspective
A Stackelberg-Pareto differential game-based approximate optimal interaction control approach is proposed for user-led modular robot manipulator (MRM) systems modeled by joint torque feedback (JTF) technique. The major objective of optimal control with physical human-robot interaction (pHRI) is evolved into approximating Stackelberg-Pareto equilibrium by adopting cooperative differential game in MRM and Stackelberg differential game between the human and robot. Learning from adaptive dynamic programming (ADP), the approximate optimal interaction control strategy with pHRI is developed by critic neural network (NN) for solving the coupled Hamilton-Jacobian (HJ) and HJ-Bellman (HJB) equations. The position tracking error under pHRI task is ultimately uniformly bounded (UUB) by the concept of Lyapunov theorem. Two distinction experiments demonstrate the superiority of proposed control approach. Note to Practitioners—Major challenges of controlling user-led MRM systems include optimizing MRM system performance as well as layering interaction task between the robot and human. Traditional differential game, such as zero-sum game, nonzero-sum game and cooperative game only consider the same layer in the interaction progress that is not suitable for user-led MRM with leader and follower. Besides, the existed hierarchical game deals with one leader and one follower or followers formulated as nonzero-sum game that ignores the fully cooperative relationship among followers. Therefore, this paper proposes Stackelberg-Pareto differential game-based approximate optimal control for user-led MRM to improve system performance with theoretical analysis and experimental verification.
期刊介绍:
The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.