基于双延迟深度确定性策略梯度的外骨骼自抗扰控制

IF 5.8 3区 计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY
Zhong Li, Xiaorong Guan, Chunyang Liu, Dingzhe Li, Long He, Yanfeng Cao, Yi Long
{"title":"基于双延迟深度确定性策略梯度的外骨骼自抗扰控制","authors":"Zhong Li,&nbsp;Xiaorong Guan,&nbsp;Chunyang Liu,&nbsp;Dingzhe Li,&nbsp;Long He,&nbsp;Yanfeng Cao,&nbsp;Yi Long","doi":"10.1007/s42235-025-00676-x","DOIUrl":null,"url":null,"abstract":"<div><p>The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.</p></div>","PeriodicalId":614,"journal":{"name":"Journal of Bionic Engineering","volume":"22 3","pages":"1211 - 1230"},"PeriodicalIF":5.8000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Active Disturbance Rejection Control Based on Twin-Delayed Deep Deterministic Policy Gradient for an Exoskeleton\",\"authors\":\"Zhong Li,&nbsp;Xiaorong Guan,&nbsp;Chunyang Liu,&nbsp;Dingzhe Li,&nbsp;Long He,&nbsp;Yanfeng Cao,&nbsp;Yi Long\",\"doi\":\"10.1007/s42235-025-00676-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.</p></div>\",\"PeriodicalId\":614,\"journal\":{\"name\":\"Journal of Bionic Engineering\",\"volume\":\"22 3\",\"pages\":\"1211 - 1230\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Bionic Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s42235-025-00676-x\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bionic Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s42235-025-00676-x","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

外骨骼的研究一直是世界范围内的热门话题。然而,外骨骼的广泛应用还有很长的路要走。其中一个主要的挑战是控制,目前还没有具体的研究趋势来控制外骨骼。本文提出一种结合自抗扰控制(ADRC)和深度强化学习(DRL)的新型外骨骼控制策略。建立了外骨骼的动力学模型,进行了自抗扰控制器的设计。为了自动调整自抗扰控制器的控制参数,采用了双延迟深度确定性策略梯度(TD3)。然后根据关节角度、角速度及其误差定义奖励函数,使关节角度的精度最大化。通过仿真和实验,对传统自适应控制方法、基于遗传算法(GA)和粒子群优化(PSO)的自适应控制方法进行了比较。实验结果表明,TD3-ADRC具有响应速度快、超调量小、平均绝对误差(MAE)和均方根误差(RMSE)均低于预期的特点,证明了所提出的外骨骼自学习控制方法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Active Disturbance Rejection Control Based on Twin-Delayed Deep Deterministic Policy Gradient for an Exoskeleton

The study of exoskeletons has been a popular topic worldwide. However, there is still a long way to go before exoskeletons can be widely used. One of the major challenges is control, and there is no specific research trend for controlling exoskeletons. In this paper, we propose a novel exoskeleton control strategy that combines Active Disturbance Rejection Control (ADRC) and Deep Reinforcement Learning (DRL). The dynamic model of the exoskeleton is constructed, followed with the design of the ADRC. To automatically adjust the control parameters of the ADRC, the Twin-Delayed Deep Deterministic Policy Gradient (TD3) is utilized. Then a reward function is defined in terms of the joint angle, angular velocity, and their errors to the desired values, to maximize the accuracy of the joint angle. In the simulations and experiments, a conventional ADRC, and ADRC based on Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) were carried out for comparison with the proposed control method. The results of the tests show that TD3-ADRC has a rapid response, small overshoot, and low Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) followed with the desired, demonstrating the superiority of the proposed control method for the self-learning control of exoskeleton.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Bionic Engineering
Journal of Bionic Engineering 工程技术-材料科学:生物材料
CiteScore
7.10
自引率
10.00%
发文量
162
审稿时长
10.0 months
期刊介绍: The Journal of Bionic Engineering (JBE) is a peer-reviewed journal that publishes original research papers and reviews that apply the knowledge learned from nature and biological systems to solve concrete engineering problems. The topics that JBE covers include but are not limited to: Mechanisms, kinematical mechanics and control of animal locomotion, development of mobile robots with walking (running and crawling), swimming or flying abilities inspired by animal locomotion. Structures, morphologies, composition and physical properties of natural and biomaterials; fabrication of new materials mimicking the properties and functions of natural and biomaterials. Biomedical materials, artificial organs and tissue engineering for medical applications; rehabilitation equipment and devices. Development of bioinspired computation methods and artificial intelligence for engineering applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信