Emil Mühlbradt Sveen, Jing Zhou, Morten Kjeld Ebbesen, Mohammad Poursina
{"title":"未知参数机器人机械臂的分散自适应学习控制","authors":"Emil Mühlbradt Sveen, Jing Zhou, Morten Kjeld Ebbesen, Mohammad Poursina","doi":"10.1016/j.jai.2025.02.002","DOIUrl":null,"url":null,"abstract":"<div><div>This paper studies motor joint control of a 4-degree-of-freedom (DoF) robotic manipulator using learning-based Adaptive Dynamic Programming (ADP) approach. The manipulator’s dynamics are modelled as an open-loop 4-link serial kinematic chain with 4 Degrees of Freedom (DoF). Decentralised optimal controllers are designed for each link using ADP approach based on a set of cost matrices and data collected from exploration trajectories. The proposed control strategy employs an off-line, off-policy iterative approach to derive four optimal control policies, one for each joint, under exploration strategies. The objective of the controller is to control the position of each joint. Simulation and experimental results show that four independent optimal controllers are found, each under similar exploration strategies, and the proposed ADP approach successfully yields optimal linear control policies despite the presence of these complexities. The experimental results conducted on the Quanser Qarm robotic platform demonstrate the effectiveness of the proposed ADP controllers in handling significant dynamic nonlinearities, such as actuation limitations, output saturation, and filter delays.</div></div>","PeriodicalId":100755,"journal":{"name":"Journal of Automation and Intelligence","volume":"4 2","pages":"Pages 136-144"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Decentralised adaptive learning-based control of robot manipulators with unknown parameters\",\"authors\":\"Emil Mühlbradt Sveen, Jing Zhou, Morten Kjeld Ebbesen, Mohammad Poursina\",\"doi\":\"10.1016/j.jai.2025.02.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper studies motor joint control of a 4-degree-of-freedom (DoF) robotic manipulator using learning-based Adaptive Dynamic Programming (ADP) approach. The manipulator’s dynamics are modelled as an open-loop 4-link serial kinematic chain with 4 Degrees of Freedom (DoF). Decentralised optimal controllers are designed for each link using ADP approach based on a set of cost matrices and data collected from exploration trajectories. The proposed control strategy employs an off-line, off-policy iterative approach to derive four optimal control policies, one for each joint, under exploration strategies. The objective of the controller is to control the position of each joint. Simulation and experimental results show that four independent optimal controllers are found, each under similar exploration strategies, and the proposed ADP approach successfully yields optimal linear control policies despite the presence of these complexities. The experimental results conducted on the Quanser Qarm robotic platform demonstrate the effectiveness of the proposed ADP controllers in handling significant dynamic nonlinearities, such as actuation limitations, output saturation, and filter delays.</div></div>\",\"PeriodicalId\":100755,\"journal\":{\"name\":\"Journal of Automation and Intelligence\",\"volume\":\"4 2\",\"pages\":\"Pages 136-144\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Automation and Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949855425000073\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Automation and Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949855425000073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Decentralised adaptive learning-based control of robot manipulators with unknown parameters
This paper studies motor joint control of a 4-degree-of-freedom (DoF) robotic manipulator using learning-based Adaptive Dynamic Programming (ADP) approach. The manipulator’s dynamics are modelled as an open-loop 4-link serial kinematic chain with 4 Degrees of Freedom (DoF). Decentralised optimal controllers are designed for each link using ADP approach based on a set of cost matrices and data collected from exploration trajectories. The proposed control strategy employs an off-line, off-policy iterative approach to derive four optimal control policies, one for each joint, under exploration strategies. The objective of the controller is to control the position of each joint. Simulation and experimental results show that four independent optimal controllers are found, each under similar exploration strategies, and the proposed ADP approach successfully yields optimal linear control policies despite the presence of these complexities. The experimental results conducted on the Quanser Qarm robotic platform demonstrate the effectiveness of the proposed ADP controllers in handling significant dynamic nonlinearities, such as actuation limitations, output saturation, and filter delays.