基于双延迟深度确定性策略梯度的外骨骼自抗扰控制

IF 5.8 3区计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY

Journal of Bionic Engineering Pub Date : 2025-03-21 DOI:10.1007/s42235-025-00676-x

Zhong Li, Xiaorong Guan, Chunyang Liu, Dingzhe Li, Long He, Yanfeng Cao, Yi Long

{"title":"基于双延迟深度确定性策略梯度的外骨骼自抗扰控制","authors":"Zhong Li, Xiaorong Guan, Chunyang Liu, Dingzhe Li, Long He, Yanfeng Cao, Yi Long","doi":"10.1007/s42235-025-00676-x","DOIUrl":null,"url":null,"abstract":"<div><p>The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.</p></div>","PeriodicalId":614,"journal":{"name":"Journal of Bionic Engineering","volume":"22 3","pages":"1211 - 1230"},"PeriodicalIF":5.8000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Active Disturbance Rejection Control Based on Twin-Delayed Deep Deterministic Policy Gradient for an Exoskeleton\",\"authors\":\"Zhong Li, Xiaorong Guan, Chunyang Liu, Dingzhe Li, Long He, Yanfeng Cao, Yi Long\",\"doi\":\"10.1007/s42235-025-00676-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.</p></div>\",\"PeriodicalId\":614,\"journal\":{\"name\":\"Journal of Bionic Engineering\",\"volume\":\"22 3\",\"pages\":\"1211 - 1230\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Bionic Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s42235-025-00676-x\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bionic Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s42235-025-00676-x","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

外骨骼的研究一直是世界范围内的热门话题。然而，外骨骼的广泛应用还有很长的路要走。其中一个主要的挑战是控制，目前还没有具体的研究趋势来控制外骨骼。本文提出一种结合自抗扰控制（ADRC）和深度强化学习（DRL）的新型外骨骼控制策略。建立了外骨骼的动力学模型，进行了自抗扰控制器的设计。为了自动调整自抗扰控制器的控制参数，采用了双延迟深度确定性策略梯度（TD3）。然后根据关节角度、角速度及其误差定义奖励函数，使关节角度的精度最大化。通过仿真和实验，对传统自适应控制方法、基于遗传算法（GA）和粒子群优化（PSO）的自适应控制方法进行了比较。实验结果表明，TD3-ADRC具有响应速度快、超调量小、平均绝对误差（MAE）和均方根误差（RMSE）均低于预期的特点，证明了所提出的外骨骼自学习控制方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Active Disturbance Rejection Control Based on Twin-Delayed Deep Deterministic Policy Gradient for an Exoskeleton

The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Bionic Engineering 工程技术-材料科学：生物材料

CiteScore

7.10

自引率

10.00%

发文量

162

审稿时长

10.0 months

期刊介绍： The Journal of Bionic Engineering (JBE) is a peer-reviewed journal that publishes original research papers and reviews that apply the knowledge learned from nature and biological systems to solve concrete engineering problems. The topics that JBE covers include but are not limited to: Mechanisms, kinematical mechanics and control of animal locomotion, development of mobile robots with walking (running and crawling), swimming or flying abilities inspired by animal locomotion. Structures, morphologies, composition and physical properties of natural and biomaterials; fabrication of new materials mimicking the properties and functions of natural and biomaterials. Biomedical materials, artificial organs and tissue engineering for medical applications; rehabilitation equipment and devices. Development of bioinspired computation methods and artificial intelligence for engineering applications.