一种五维三层数字孪生体训练强化学习代理用于青少年特发性脊柱侧凸康复机器人外骨骼的交互控制

IF 3.6 Q1 ENGINEERING, MECHANICAL

国际机械系统动力学学报(英文) Pub Date : 2025-05-11 DOI:10.1002/msd2.70020

Farhad Farhadiyadkuri, Xuping Zhang

{"title":"一种五维三层数字孪生体训练强化学习代理用于青少年特发性脊柱侧凸康复机器人外骨骼的交互控制","authors":"Farhad Farhadiyadkuri, Xuping Zhang","doi":"10.1002/msd2.70020","DOIUrl":null,"url":null,"abstract":"<p>Adolescent idiopathic scoliosis (AIS) is a sideway curvature of the spinal column combined with a vertebral rotation that usually occurs in adolescents without any known causes. Bracing, the most common conservative treatment of AIS, has not fully exploited the benefits of the active control approaches powered by artificial intelligence (AI), although AI has entered a wide range of applications. The correction forces exerted by the brace are controlled passively by regulating the tightness of the brace's strap. Besides, training the learning-based control methods using a virtual model is of high importance in the AIS brace treatment, since training using trial and error on human subjects may result in unexpected pressure and injuries on the patient's torso. However, digital twin (DT) modeling, an emerging technology, has not been implemented into the AIS brace treatment yet. In this paper, reinforcement learning-based position-based impedance control (RLPIC) is proposed to enable a robotic brace to learn the desired physical interaction between the robotic brace and the human torso. A five-dimensional (5D) three-layer DT is also developed to be used for training the RLPIC in a simulated environment. The 5D three-layer DT consists of a physical system, a three-layer digital model of the physical system, including the robotic brace, human torso, and the physical human–robot interaction (HRI), a bidirectional connection between them, and an optimization dimension. A neural network-based regression model is also proposed to estimate the unknown parameters of the digital model. Numerical simulations and real-time experiments are performed to validate the 5D three-layer DT model. The proposed RLPIC trained using the 5D three-layer DT is verified using numerical simulations in terms of position tracking, velocity tracking, and HRI control. It is concluded that the proposed learning-based interaction control approach can improve the HRI control by learning the desired interaction in a simulated environment.</p>","PeriodicalId":60486,"journal":{"name":"国际机械系统动力学学报(英文)","volume":"5 3","pages":"385-400"},"PeriodicalIF":3.6000,"publicationDate":"2025-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/msd2.70020","citationCount":"0","resultStr":"{\"title\":\"A Five-Dimensional Three-Layer Digital Twin to Train a Reinforcement Learning Agent for Interaction Control of a Robotic Exoskeleton in Adolescent Idiopathic Scoliosis Rehabilitation\",\"authors\":\"Farhad Farhadiyadkuri, Xuping Zhang\",\"doi\":\"10.1002/msd2.70020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Adolescent idiopathic scoliosis (AIS) is a sideway curvature of the spinal column combined with a vertebral rotation that usually occurs in adolescents without any known causes. Bracing, the most common conservative treatment of AIS, has not fully exploited the benefits of the active control approaches powered by artificial intelligence (AI), although AI has entered a wide range of applications. The correction forces exerted by the brace are controlled passively by regulating the tightness of the brace's strap. Besides, training the learning-based control methods using a virtual model is of high importance in the AIS brace treatment, since training using trial and error on human subjects may result in unexpected pressure and injuries on the patient's torso. However, digital twin (DT) modeling, an emerging technology, has not been implemented into the AIS brace treatment yet. In this paper, reinforcement learning-based position-based impedance control (RLPIC) is proposed to enable a robotic brace to learn the desired physical interaction between the robotic brace and the human torso. A five-dimensional (5D) three-layer DT is also developed to be used for training the RLPIC in a simulated environment. The 5D three-layer DT consists of a physical system, a three-layer digital model of the physical system, including the robotic brace, human torso, and the physical human–robot interaction (HRI), a bidirectional connection between them, and an optimization dimension. A neural network-based regression model is also proposed to estimate the unknown parameters of the digital model. Numerical simulations and real-time experiments are performed to validate the 5D three-layer DT model. The proposed RLPIC trained using the 5D three-layer DT is verified using numerical simulations in terms of position tracking, velocity tracking, and HRI control. It is concluded that the proposed learning-based interaction control approach can improve the HRI control by learning the desired interaction in a simulated environment.</p>\",\"PeriodicalId\":60486,\"journal\":{\"name\":\"国际机械系统动力学学报(英文)\",\"volume\":\"5 3\",\"pages\":\"385-400\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-05-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/msd2.70020\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"国际机械系统动力学学报(英文)\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/msd2.70020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MECHANICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"国际机械系统动力学学报(英文)","FirstCategoryId":"1087","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/msd2.70020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}

引用次数: 0

摘要

青少年特发性脊柱侧弯（AIS）是一种脊柱侧弯合并椎体旋转，通常发生在青少年，没有任何已知的原因。尽管人工智能（AI）已经进入了广泛的应用领域，但作为AIS最常见的保守治疗方法，支撑并没有充分利用由人工智能（AI）驱动的主动控制方法的优势。通过调节支撑带的松紧度来被动地控制支撑施加的校正力。此外，使用虚拟模型训练基于学习的控制方法在AIS支架治疗中非常重要，因为在人类受试者上使用试错法训练可能会对患者的躯干造成意想不到的压力和伤害。然而，数字孪生（DT）建模这一新兴技术尚未应用于AIS支架治疗中。本文提出了基于强化学习的基于位置的阻抗控制（RLPIC），使机器人支架能够学习机器人支架与人体躯干之间所需的物理相互作用。一个五维（5D）三层DT也被开发用于在模拟环境中训练RLPIC。5D三层DT包括一个物理系统，一个物理系统的三层数字模型，包括机器人支架、人体躯干和物理人机交互（HRI），它们之间的双向连接，以及一个优化维度。提出了一种基于神经网络的回归模型来估计数字模型的未知参数。通过数值模拟和实时实验对5D三层DT模型进行了验证。利用5D三层DT训练的RLPIC在位置跟踪、速度跟踪和HRI控制方面进行了数值模拟验证。研究结果表明，基于学习的交互控制方法可以通过在模拟环境中学习期望的交互来改善HRI控制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A Five-Dimensional Three-Layer Digital Twin to Train a Reinforcement Learning Agent for Interaction Control of a Robotic Exoskeleton in Adolescent Idiopathic Scoliosis Rehabilitation

查看原文本刊更多论文

A Five-Dimensional Three-Layer Digital Twin to Train a Reinforcement Learning Agent for Interaction Control of a Robotic Exoskeleton in Adolescent Idiopathic Scoliosis Rehabilitation

Adolescent idiopathic scoliosis (AIS) is a sideway curvature of the spinal column combined with a vertebral rotation that usually occurs in adolescents without any known causes. Bracing, the most common conservative treatment of AIS, has not fully exploited the benefits of the active control approaches powered by artificial intelligence (AI), although AI has entered a wide range of applications. The correction forces exerted by the brace are controlled passively by regulating the tightness of the brace's strap. Besides, training the learning-based control methods using a virtual model is of high importance in the AIS brace treatment, since training using trial and error on human subjects may result in unexpected pressure and injuries on the patient's torso. However, digital twin (DT) modeling, an emerging technology, has not been implemented into the AIS brace treatment yet. In this paper, reinforcement learning-based position-based impedance control (RLPIC) is proposed to enable a robotic brace to learn the desired physical interaction between the robotic brace and the human torso. A five-dimensional (5D) three-layer DT is also developed to be used for training the RLPIC in a simulated environment. The 5D three-layer DT consists of a physical system, a three-layer digital model of the physical system, including the robotic brace, human torso, and the physical human–robot interaction (HRI), a bidirectional connection between them, and an optimization dimension. A neural network-based regression model is also proposed to estimate the unknown parameters of the digital model. Numerical simulations and real-time experiments are performed to validate the 5D three-layer DT model. The proposed RLPIC trained using the 5D three-layer DT is verified using numerical simulations in terms of position tracking, velocity tracking, and HRI control. It is concluded that the proposed learning-based interaction control approach can improve the HRI control by learning the desired interaction in a simulated environment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

国际机械系统动力学学报(英文)

CiteScore

3.50

自引率

0.00%

发文量