{"title":"Curriculum-guided skill learning for long-horizon robot manipulation tasks","authors":"João Bernardo Alves, Nuno Lau, Filipe Silva","doi":"10.1016/j.robot.2025.105032","DOIUrl":null,"url":null,"abstract":"<div><div>Robotic tasks often involve solving long-horizon problems. Seen under the reinforcement learning framework, the rewards provided in these problems are often sparse, which can be problematic for the learning process. In this context, dividing the long-horizon task into smaller ones represents a viable strategy to alleviate the credit assignment problem. Another approach generally used to help with this problem is curriculum learning. This paper combines both with a new skill-chaining learning algorithm that provides transition policies to bridge the gap between skills. Our approach begins by extracting meaningful skills from the states of an expert trajectory, using a heuristic method, which are subsequently used by the skill learning and the skill chaining algorithms. By leveraging the sequential order of the skills inside the demonstration, we propose a method to learn inter-skill transition policies to ensure the skills are appropriately chained. Our curriculum-based training approach enables an agent to learn action sequences that generalize inside a specific sub-task context. Using the information of a single demonstration, we show that our approach can solve a robotic manipulation task with similar performance to methods that rely on a large amount of data. Because our skill segmentation method detects which skills are present across demonstrations, we also show that our approach can reuse skills already learned in a zero-shot way.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"192 ","pages":"Article 105032"},"PeriodicalIF":4.3000,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025001186","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Robotic tasks often involve solving long-horizon problems. Seen under the reinforcement learning framework, the rewards provided in these problems are often sparse, which can be problematic for the learning process. In this context, dividing the long-horizon task into smaller ones represents a viable strategy to alleviate the credit assignment problem. Another approach generally used to help with this problem is curriculum learning. This paper combines both with a new skill-chaining learning algorithm that provides transition policies to bridge the gap between skills. Our approach begins by extracting meaningful skills from the states of an expert trajectory, using a heuristic method, which are subsequently used by the skill learning and the skill chaining algorithms. By leveraging the sequential order of the skills inside the demonstration, we propose a method to learn inter-skill transition policies to ensure the skills are appropriately chained. Our curriculum-based training approach enables an agent to learn action sequences that generalize inside a specific sub-task context. Using the information of a single demonstration, we show that our approach can solve a robotic manipulation task with similar performance to methods that rely on a large amount of data. Because our skill segmentation method detects which skills are present across demonstrations, we also show that our approach can reuse skills already learned in a zero-shot way.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.