{"title":"Deep reinforcement learning-based trajectory planning for double pendulum cranes: Design and experiments","authors":"Weili Ding , Heng Zhang , Changchun Hua , Biao Lu","doi":"10.1016/j.ymssp.2025.112780","DOIUrl":null,"url":null,"abstract":"<div><div>Since cranes usually possess double pendulum dynamics, the mass of the payload often changes and there are frequently lifting/lowering operations simultaneously. Moreover, most crane systems are driven by motors, whose velocities and accelerations are often limited. To solve the above problems, this paper proposes a deep reinforcement learning (DRL) reference trajectory generation method based on virtual–physical joint training. Firstly, a DRL module based on deep deterministic policy gradient (DDPG), along with a double pendulum crane dynamic model and an adaptive controller, are established within the virtual environment for training the reference trajectory. When the reward reaches the threshold, the training is switched to the physical environment to further optimize the reference trajectory, realizing the swing suppression of the hook and the payload. In addition, to satisfy the performance of the drive motors, velocity and acceleration thresholds are set to constrain the performance of the drive motors. Finally, in view of the fact that the operation process of the double pendulum crane is divided into payload transportation and payload loading/unloading, an event-triggering mechanism is designed to switch different control policies in accordance with different operation processes, thus reducing the consumption of computing resources. Through experiments on the actual double pendulum crane and comparison with the existing reference trajectories and input shapers, the superiority of this method is demonstrated.</div></div>","PeriodicalId":51124,"journal":{"name":"Mechanical Systems and Signal Processing","volume":"234 ","pages":"Article 112780"},"PeriodicalIF":7.9000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechanical Systems and Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0888327025004819","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Since cranes usually possess double pendulum dynamics, the mass of the payload often changes and there are frequently lifting/lowering operations simultaneously. Moreover, most crane systems are driven by motors, whose velocities and accelerations are often limited. To solve the above problems, this paper proposes a deep reinforcement learning (DRL) reference trajectory generation method based on virtual–physical joint training. Firstly, a DRL module based on deep deterministic policy gradient (DDPG), along with a double pendulum crane dynamic model and an adaptive controller, are established within the virtual environment for training the reference trajectory. When the reward reaches the threshold, the training is switched to the physical environment to further optimize the reference trajectory, realizing the swing suppression of the hook and the payload. In addition, to satisfy the performance of the drive motors, velocity and acceleration thresholds are set to constrain the performance of the drive motors. Finally, in view of the fact that the operation process of the double pendulum crane is divided into payload transportation and payload loading/unloading, an event-triggering mechanism is designed to switch different control policies in accordance with different operation processes, thus reducing the consumption of computing resources. Through experiments on the actual double pendulum crane and comparison with the existing reference trajectories and input shapers, the superiority of this method is demonstrated.
期刊介绍:
Journal Name: Mechanical Systems and Signal Processing (MSSP)
Interdisciplinary Focus:
Mechanical, Aerospace, and Civil Engineering
Purpose:Reporting scientific advancements of the highest quality
Arising from new techniques in sensing, instrumentation, signal processing, modelling, and control of dynamic systems