{"title":"A survey of demonstration learning","authors":"","doi":"10.1016/j.robot.2024.104812","DOIUrl":"10.1016/j.robot.2024.104812","url":null,"abstract":"<div><p>With the fast improvement of machine learning, reinforcement learning (RL) has been used to automate human tasks in different areas. However, training such agents is difficult and restricted to expert users. Moreover, it is mostly limited to simulation environments due to the high cost and safety concerns of interactions in the real-world. Demonstration Learning is a paradigm in which an agent learns to perform a task by imitating the behavior of an expert shown in demonstrations. Learning from demonstration accelerates the learning process by improving sample efficiency, while also reducing the effort of the programmer. Because the task is learned without interacting with the environment, demonstration learning allows the automation of a wide range of real-world applications such as robotics and healthcare. This paper provides a survey of demonstration learning, where we formally introduce the demonstration problem along with its main challenges and provide a comprehensive overview of the process of learning from demonstrations from the creation of the demonstration data set, to learning methods from demonstrations, and optimization by combining demonstration learning with different machine learning methods. We also review the existing benchmarks and identify their strengths and limitations. Additionally, we discuss the advantages and disadvantages of the paradigm as well as its main applications. Lastly, we discuss the open problems and future research directions of the field.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001969/pdfft?md5=ca55398684a5261baba6c83e357cba9b&pid=1-s2.0-S0921889024001969-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142229289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Model-less optimal visual control of tendon-driven continuum robots using recurrent neural network-based neurodynamic optimization","authors":"","doi":"10.1016/j.robot.2024.104811","DOIUrl":"10.1016/j.robot.2024.104811","url":null,"abstract":"<div><p>Tendon-driven continuum robots (TDCRs) have infinite degrees of freedom and high flexibility, posing challenges for accurate modeling and autonomous control, especially in confined environments. This paper presents a model-less optimal visual control (MLOVC) method using neurodynamic optimization to enable autonomous target tracking of TDCRs in confined environments. The TDCR’s kinematics are estimated online from sensory data, establishing a connection between the actuator input and visual features. An optimal visual servoing method based on quadratic programming (QP) is developed to ensure precise target tracking without violating the robot’s physical constraints. An inverse-free recurrent neural network (RNN)-based neurodynamic optimization method is designed to solve the complex QP problem. Comparative simulations and experiments demonstrate that the proposed method outperforms existing methods in target tracking accuracy and computational efficiency. The RNN-based controller successfully achieves target tracking within constraints in confined environments.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142169031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"GSC: A graph-based skill composition framework for robot learning","authors":"","doi":"10.1016/j.robot.2024.104787","DOIUrl":"10.1016/j.robot.2024.104787","url":null,"abstract":"<div><p>Humans excel at performing a wide range of sophisticated tasks by leveraging skills acquired from prior experiences. This characteristic is especially essential in robotics empowered by deep reinforcement learning, as learning every skill from scratch is time-consuming and may not always be feasible. With the prior skills incorporated, skill composition aims to accelerate the learning process on new robotic tasks. Previous works have given insight into combining pre-trained task-agnostic skills, whereas skills are transformed into fixed order representation, resulting in poor capturing of potential complex skill relations. In this paper, we novelly propose a <u>G</u>raph-based framework for <u>S</u>kill <u>C</u>omposition (GSC). To learn rich structural information, a carefully designed skill graph is constructed, where skill representations are taken as nodes and skill relations are utilized as edges. Furthermore, to allow it trained efficiently on large-scale skill set, a transformer-style graph updating method is employed to achieve comprehensive information aggregation. Our simulation experiments indicate that GSC outperforms the state-of-the-art methods on various challenging tasks. Additionally, we successfully apply the technique to the navigation task on a real quadruped robot. The project homepage can be found at <span><span>Graph Skill Composition</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DewROS2: A platform for informed Dew Robotics in ROS","authors":"","doi":"10.1016/j.robot.2024.104800","DOIUrl":"10.1016/j.robot.2024.104800","url":null,"abstract":"<div><p>With the shift from Cloud to Fog and Dew Robotics a lot of emphasis of the research community has been devoted to task offloading. Effective and efficient resource monitoring is however necessary for such offloading and it is also fundamental for other important safety and security tasks. Despite this, robot monitoring has received little attention in general and also for Robot Operating System (ROS) the most employed framework in robotics. In this paper DewROS2 is presented, a platform for Dew Robotics that comprises entities to monitor the system status and to share it with interested applications. The design and implementation of the platform is presented together with the monitoring entities created. DewROS2 has been deployed on different real devices, including an unmanned aerial vehicle and an industrial router, to move from theory to practice and to analyze the impact of monitoring on robot resources. DewROS2 has also been tested in a search and rescue use case where robots are used to collect and transmit videos to spot signs of humans in trouble. Results in controlled and uncontrolled conditions show that the monitoring nodes do not have a significant impact on the performance while providing important and measurable benefits to the applications. Accurately monitoring of robot resources, for example, allows the search and rescue application to almost double the utilization of the network, therefore collecting video at a much higher resolution.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001842/pdfft?md5=5c297fef383277cab8dfb1d34ab6da82&pid=1-s2.0-S0921889024001842-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust quadruped jumping via deep reinforcement learning","authors":"","doi":"10.1016/j.robot.2024.104799","DOIUrl":"10.1016/j.robot.2024.104799","url":null,"abstract":"<div><p>In this paper, we consider a general task of jumping varying distances and heights for a quadrupedal robot in noisy environments, such as off of uneven terrain and with variable robot dynamics parameters. To accurately jump in such conditions, we propose a framework using deep reinforcement learning that leverages and augments the complex solution of nonlinear trajectory optimization for quadrupedal jumping. While the standalone optimization limits jumping to take-off from flat ground and requires accurate assumptions of robot dynamics, our proposed approach improves the robustness to allow jumping off of significantly uneven terrain with variable robot dynamical parameters and environmental conditions. Compared with walking and running, the realization of aggressive jumping on hardware necessitates accounting for the motors’ torque-speed relationship as well as the robot’s total power limits. By incorporating these constraints into our learning framework, we successfully deploy our policy sim-to-real without further tuning, fully exploiting the available onboard power supply and motors. We demonstrate robustness to environment noise of foot disturbances of up to 6 cm in height, or 33% of the robot’s nominal standing height, while jumping 2<em>x</em> the body length in distance.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001830/pdfft?md5=120dc8f6d5e39f92c0fe9b96f499ae52&pid=1-s2.0-S0921889024001830-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Incorporating shape dependent power law in motion planning for drawing robots","authors":"","doi":"10.1016/j.robot.2024.104801","DOIUrl":"10.1016/j.robot.2024.104801","url":null,"abstract":"<div><p>Incorporating human natural features into the algorithmic functions of robots can enhance performance efficiency. One of the most popular features of human movements is the <em>power law</em> that defines the connection between movement speed and path curvature. To contribute to this area, by discussing how the <em>power law</em> can provide a reasonable balance between velocity and efficiency, we propose a novel method to design motion profiles based on the <em>power law</em>. The novelty of this solution lies in the adjustment approach for the <em>power law</em>. In this work, inspired by features of human hand movements, the overall curvature of non-circular shapes is considered as the shape-dependent criterion for motion planning. Also, a framework to apply the proposed approach to any open non-circular curvy contours is presented. To investigate the efficiency of the approach, we considered a simple drawing robot. The simulation and experimental results verify the efficacy of the proposed motion planning method.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001854/pdfft?md5=01b8e10ad852b88f3d13ec63d56cfd9c&pid=1-s2.0-S0921889024001854-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-objective QoS optimization in swarm robotics","authors":"","doi":"10.1016/j.robot.2024.104796","DOIUrl":"10.1016/j.robot.2024.104796","url":null,"abstract":"<div><p>The “Internet of Robotic Things” (IoRT) is a concept that connects sensors and robotic objects. One of the practical applications of IoRT is swarm robotics, where multiple robots collaborate in a shared workspace to accomplish assigned tasks that may be challenging or impossible for a single robot to conquer. Swarm robots are particularly useful in critical situations, such as post-earthquake scenarios, where they can locate survivors and provide assistance in areas inaccessible to humans. In these life-saving situations, reliable and prompt communication among swarm robots is of utmost importance. To address the need for highly dependable and low-latency communication in swarm robotics, this research introduces a novel hybrid approach called Multi-objective QoS optimization based on Support vector regression and Genetic algorithm (MQSG). The MQSG method consists of two main phases: Parameter Relationship Identification and Parameter Optimization. In the Parameter Relationship Identification phase, the relationship between network inputs (Packet inter-arrival time, Packet size, Transmission power, Distance between sender and receiver) and outputs (quality of service (QoS) parameters) is established using support vector regression. In the parameter optimization phase, a multi-objective function is created based on the obtained relationships from the Parameter Relationship Identification phase. By solving this multi-objective function, optimal values for each QoS parameter are determined, leading to enhanced network performance. Simulation results demonstrate that the MQSG method outperforms other similar algorithms in terms of transmission latency, packet delivery rate, and the number of retransmitted packets.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lane-changing and overtaking trajectory planning for autonomous vehicles with multi-performance optimization considering static and dynamic obstacles","authors":"","doi":"10.1016/j.robot.2024.104797","DOIUrl":"10.1016/j.robot.2024.104797","url":null,"abstract":"<div><p>Affected by the complex traffic environment, lane-changing and overtaking have become daily driving operations of autonomous vehicles, and providing a drivable trajectory is one of the critical tasks of planning processes. To this end, this paper aims to propose an optimization-algorithm-based double quintic polynomial trajectory planning model considering static and dynamic obstacles for lane-changing and overtaking maneuvers of the autonomous vehicle. Firstly, an improved double quintic polynomial planning model considering different motion states and sizes of obstacles is constructed by introducing the lane change transition state to ensure the autonomous vehicle’s driving safety. Secondly, a multi-objective performance function considering various influencing factors is established to improve the driving performances of the autonomous vehicle during lane-changing and overtaking. Finally, a particle swarm optimization (PSO) algorithm is used to optimize parameters of the proposed planning model, such as the lane change time, transition speed, and longitudinal displacement, to generate a driveability trajectory that meets the driving safety, comfort, stability, and low emission requirements of the autonomous vehicle during lane-changing and overtaking. The effectiveness and advantages of the proposed planning model are verified by comparing it with several existing planning models under different driving conditions.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Motion planning in underactuated systems with impulsive phenomenon via dynamic shaping of virtual holonomic constraints","authors":"","doi":"10.1016/j.robot.2024.104798","DOIUrl":"10.1016/j.robot.2024.104798","url":null,"abstract":"<div><p>Rhythmic motions are traditionally achieved by developing predetermined paths for the states of the space system to follow. Since these paths are obtained offline, the dynamic behavior fails to adapt to changes in environmental conditions or user command desires. The solution we propose is a new strategy called dynamic shaping, in which the paths are formed online to allow the system to create an orbit with the characteristics we need. Hereupon, this paper focuses on applying this strategy to Dynamics with One Degree of Under-actuation and Impulsive Phenomenon (DSODUIP) to adapt the characteristics of outcomes to be in line with the demands.</p><p>This research was conducted by leveraging the advantages of virtual holonomic constraints (VHCs) to establish these paths. Therefore, a novel two-level hierarchical control method is designed considering a stability criterion to avoid divergence. At the Low-Level, the controllers stabilize the output of system to follow the VHCs on the system. At the High-Level, the VHCs are modified to shape an orbit with our desired characteristic in the motion. As an illustrative example, the algorithm is implemented to adjust the average angular velocity of a devil stick and the hip velocity of a Three-Link biped robot. Their results vividly demonstrate smooth adjustments and efficient performance in achieving our desired outcomes.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhancing radioactive environment exploration with bio-inspired swarm robotics: A comparative analysis of Lévy flight and stigmergy methods","authors":"","doi":"10.1016/j.robot.2024.104794","DOIUrl":"10.1016/j.robot.2024.104794","url":null,"abstract":"<div><p>Utilizing swarm robotics techniques can significantly enhance the efficiency of exploring and mapping hazardous environments, such as nuclear sites. Instead of relying on a single robot for exploration, employing multiple robots working in coordination allows for fast coverage and more comprehensive data collection. In this study, bio-inspired algorithms, specifically Lévy flight and stigmergy, are utilized to guide the robots' movements. The Lévy flight algorithm mimics the movement patterns observed in animals like sharks and honeybees during their search for food, while stigmergy involves indirect communication between agents through environmental traces. By integrating these algorithms with swarm robotics, the robots effectively explore radioactive environments, gather data, and generate detailed maps of the area. Our research delves into various aspects of exploration, including the influence of the number of deployed robots and their exposure to radiation. Comparative analysis reveals the efficacy of stigmergy as a superior approach for guiding swarm robot movements in radioactive environments. This study underscores the significant potential of employing collective robotics for exploration tasks in nuclear scenarios, highlighting the promising applications of swarm intelligence in enhancing safety and efficiency in hazardous environments.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}