肌肉骨骼机器人广义抓取操作的持续学习方法

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Automation Science and Engineering Pub Date : 2025-03-12 DOI:10.1109/TASE.2025.3569402

Bo Jiang;Ci Song;Siyuan Liu;Shuai Gan;Jiahao Chen

{"title":"肌肉骨骼机器人广义抓取操作的持续学习方法","authors":"Bo Jiang;Ci Song;Siyuan Liu;Shuai Gan;Jiahao Chen","doi":"10.1109/TASE.2025.3569402","DOIUrl":null,"url":null,"abstract":"Musculoskeletal robotic systems offer structural advantages while presenting significant control challenges. Current research on their manipulation capabilities, particularly multi-object grasping scenarios, remains insufficient. Furthermore, as robots operate in dynamic environments with evolving task requirements, developing their ability to grasp novel objects while maintaining existing manipulation skills becomes crucial. To address these challenges, we propose a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb robot. First, we propose an end-to-end learning method for multi-object grasping that leverages object-specific latent features and multi-level states of a musculoskeletal robot. Additionally, a novel continual learning method for grasping tasks is proposed with a bio-inspired object-preference-based experience replay selector, which optimizes learning efficiency while mitigating catastrophic forgetting. Our approach successfully masters the grasping manipulation task of 10 objects in the first phase and continually learns 23 additional objects in the second phase, outperforming existing methods in both multi-object grasping and continual learning. Furthermore, our analyses of muscle synergy and methodological robustness demonstrate that the proposed approach generates biologically plausible muscle synergies and exhibits strong robustness against object observation bias and neural excitation noise. Our experiments, conducted both in simulation and on a hardware system, demonstrate the practical transferability of our method to an articulated dexterous hand grasping task. <italic>Note to Practitioners</i>—Musculoskeletal robots, which incorporate human-like joints, muscles, and actuation mechanisms, exhibit exceptional robustness, dexterity, and compliance, offering promising applications and significant control challenges. While current research predominantly addresses lower limb locomotion and upper limb movements, studies on multi-object grasping and continual learning remain limited. This article presents a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb, enabling end-to-end multi-object grasping without the dependency on predefined or generated grasping postures. The proposed methodology encompasses two key components: 1) the extraction and processing of object point cloud features as partial observations for the controller and 2) the implementation of a bio-inspired object-preference-based experience replay selector that balances new object learning and existing skill retention. Experimental evaluations reveal that the proposed approach exhibits effective muscle synergy patterns and demonstrates robust performance under diverse noise conditions and magnitudes, indicating significant potential for real-world deployment and cross-domain adaptation. Nevertheless, future work should address current limitations, including the extension from fixed-point to arbitrary position grasping, and the advancement from single-step grasping to complex multi-stage manipulation and assembly tasks.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"15671-15686"},"PeriodicalIF":6.4000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Continual Learning Method for Generalized Grasping Manipulation in a Musculoskeletal Robot\",\"authors\":\"Bo Jiang;Ci Song;Siyuan Liu;Shuai Gan;Jiahao Chen\",\"doi\":\"10.1109/TASE.2025.3569402\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Musculoskeletal robotic systems offer structural advantages while presenting significant control challenges. Current research on their manipulation capabilities, particularly multi-object grasping scenarios, remains insufficient. Furthermore, as robots operate in dynamic environments with evolving task requirements, developing their ability to grasp novel objects while maintaining existing manipulation skills becomes crucial. To address these challenges, we propose a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb robot. First, we propose an end-to-end learning method for multi-object grasping that leverages object-specific latent features and multi-level states of a musculoskeletal robot. Additionally, a novel continual learning method for grasping tasks is proposed with a bio-inspired object-preference-based experience replay selector, which optimizes learning efficiency while mitigating catastrophic forgetting. Our approach successfully masters the grasping manipulation task of 10 objects in the first phase and continually learns 23 additional objects in the second phase, outperforming existing methods in both multi-object grasping and continual learning. Furthermore, our analyses of muscle synergy and methodological robustness demonstrate that the proposed approach generates biologically plausible muscle synergies and exhibits strong robustness against object observation bias and neural excitation noise. Our experiments, conducted both in simulation and on a hardware system, demonstrate the practical transferability of our method to an articulated dexterous hand grasping task. <italic>Note to Practitioners</i>—Musculoskeletal robots, which incorporate human-like joints, muscles, and actuation mechanisms, exhibit exceptional robustness, dexterity, and compliance, offering promising applications and significant control challenges. While current research predominantly addresses lower limb locomotion and upper limb movements, studies on multi-object grasping and continual learning remain limited. This article presents a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb, enabling end-to-end multi-object grasping without the dependency on predefined or generated grasping postures. The proposed methodology encompasses two key components: 1) the extraction and processing of object point cloud features as partial observations for the controller and 2) the implementation of a bio-inspired object-preference-based experience replay selector that balances new object learning and existing skill retention. Experimental evaluations reveal that the proposed approach exhibits effective muscle synergy patterns and demonstrates robust performance under diverse noise conditions and magnitudes, indicating significant potential for real-world deployment and cross-domain adaptation. Nevertheless, future work should address current limitations, including the extension from fixed-point to arbitrary position grasping, and the advancement from single-step grasping to complex multi-stage manipulation and assembly tasks.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"15671-15686\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-03-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11002571/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11002571/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

肌肉骨骼机器人系统在提供结构优势的同时，也提出了重大的控制挑战。目前对它们的操作能力，特别是多目标抓取场景的研究仍然不足。此外，随着机器人在不断变化的任务要求的动态环境中操作，在保持现有操作技能的同时，发展它们掌握新物体的能力变得至关重要。为了解决这些挑战，我们提出了一种新的持续学习方法，用于肌肉骨骼上肢机器人的广义抓取操作。首先，我们提出了一种端到端的多目标抓取学习方法，该方法利用了肌肉骨骼机器人的特定对象潜在特征和多层次状态。此外，提出了一种基于生物启发的基于对象偏好的经验重放选择器的抓取任务持续学习方法，该方法在优化学习效率的同时减轻了灾难性遗忘。我们的方法在第一阶段成功地掌握了10个物体的抓取操作任务，在第二阶段持续学习了23个额外的物体，在多物体抓取和持续学习方面都优于现有的方法。此外，我们对肌肉协同作用和方法鲁棒性的分析表明，所提出的方法产生生物学上合理的肌肉协同作用，并对物体观察偏差和神经兴奋噪声具有很强的鲁棒性。我们在仿真和硬件系统上进行的实验证明了我们的方法在关节灵巧手抓取任务中的实际可移植性。从业人员注意：肌肉骨骼机器人结合了类似人类的关节、肌肉和驱动机构，具有出色的鲁棒性、灵活性和顺应性，提供了有前途的应用和重大的控制挑战。虽然目前的研究主要针对下肢运动和上肢运动，但对多物体抓取和持续学习的研究仍然有限。本文提出了一种新的连续学习方法，用于肌肉骨骼上肢的广义抓取操作，实现端到端多目标抓取，而不依赖于预定义或生成的抓取姿势。提出的方法包括两个关键组成部分：1)提取和处理对象点云特征，作为控制器的部分观察；2)实现基于生物的基于对象偏好的经验重放选择器，平衡新对象学习和现有技能保留。实验评估表明，该方法显示出有效的肌肉协同模式，并在不同噪声条件和大小下表现出稳健的性能，表明在现实世界中部署和跨域适应的巨大潜力。然而，未来的工作应该解决当前的局限性，包括从定点抓取到任意位置抓取的扩展，以及从单步抓取到复杂的多阶段操作和装配任务的进展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Continual Learning Method for Generalized Grasping Manipulation in a Musculoskeletal Robot

Musculoskeletal robotic systems offer structural advantages while presenting significant control challenges. Current research on their manipulation capabilities, particularly multi-object grasping scenarios, remains insufficient. Furthermore, as robots operate in dynamic environments with evolving task requirements, developing their ability to grasp novel objects while maintaining existing manipulation skills becomes crucial. To address these challenges, we propose a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb robot. First, we propose an end-to-end learning method for multi-object grasping that leverages object-specific latent features and multi-level states of a musculoskeletal robot. Additionally, a novel continual learning method for grasping tasks is proposed with a bio-inspired object-preference-based experience replay selector, which optimizes learning efficiency while mitigating catastrophic forgetting. Our approach successfully masters the grasping manipulation task of 10 objects in the first phase and continually learns 23 additional objects in the second phase, outperforming existing methods in both multi-object grasping and continual learning. Furthermore, our analyses of muscle synergy and methodological robustness demonstrate that the proposed approach generates biologically plausible muscle synergies and exhibits strong robustness against object observation bias and neural excitation noise. Our experiments, conducted both in simulation and on a hardware system, demonstrate the practical transferability of our method to an articulated dexterous hand grasping task. Note to Practitioners—Musculoskeletal robots, which incorporate human-like joints, muscles, and actuation mechanisms, exhibit exceptional robustness, dexterity, and compliance, offering promising applications and significant control challenges. While current research predominantly addresses lower limb locomotion and upper limb movements, studies on multi-object grasping and continual learning remain limited. This article presents a novel continual learning method for generalized grasping manipulation in a musculoskeletal upper limb, enabling end-to-end multi-object grasping without the dependency on predefined or generated grasping postures. The proposed methodology encompasses two key components: 1) the extraction and processing of object point cloud features as partial observations for the controller and 2) the implementation of a bio-inspired object-preference-based experience replay selector that balances new object learning and existing skill retention. Experimental evaluations reveal that the proposed approach exhibits effective muscle synergy patterns and demonstrates robust performance under diverse noise conditions and magnitudes, indicating significant potential for real-world deployment and cross-domain adaptation. Nevertheless, future work should address current limitations, including the extension from fixed-point to arbitrary position grasping, and the advancement from single-step grasping to complex multi-stage manipulation and assembly tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.