Comparative Analysis of Temporal Difference Learning Methods to Learn General Value Functions of Lower-Limb Signals.

IEEE ... International Conference on Rehabilitation Robotics : [proceedings] Pub Date : 2025-05-01 DOI:10.1109/ICORR66766.2025.11063114

Sonny T Jones, Grange M Simpson, Wyatt M J Young, Kylee North, Patrick M Pilarski, Ashley N Dalrymple

{"title":"Comparative Analysis of Temporal Difference Learning Methods to Learn General Value Functions of Lower-Limb Signals.","authors":"Sonny T Jones, Grange M Simpson, Wyatt M J Young, Kylee North, Patrick M Pilarski, Ashley N Dalrymple","doi":"10.1109/ICORR66766.2025.11063114","DOIUrl":null,"url":null,"abstract":"<p><p>Millions of people in the United States suffer from paralysis, resulting in significant deficits in motor function. Restricted mobility due to these deficits and the lack of adaptive rehabilitative solutions make traversing complex and challenging terrains unsafe. Exoskeletons offer a promising solution, but their effectiveness could be greatly enhanced by incorporating reinforcement learning algorithms for real-time adaptation to changing environments and the user's unique gait biomechanics. This study explored different temporal difference learning methods for predicting signals recorded from sensors on the lower-limbs, including muscle activation from electromyography, underfoot pressure, and joint angles from goniometers. Specifically, the performance of the temporal difference learning methods TD $(\\lambda)$, TOTD, and SwiftTD to quickly and accurately predict these signals was examined. From initial findings, SwiftTD generally converged faster, while TOTD typically achieved lower convergence errors. These outcomes varied depending on the specific signal that was being predicted, highlighting the need for careful consideration of algorithm choice depending on the signal, accuracy, and speed. The results, therefore, support the informed selection of specific algorithms for providing predictive knowledge to adaptive, machine learning-controlled assistive rehabilitative technologies. These findings will enable the selection of appropriate predictive algorithms, leading to the development of better exoskeletons and other assistive devices to enhance the mobility and quality of life of individuals with motor paralysis.</p>","PeriodicalId":73276,"journal":{"name":"IEEE ... International Conference on Rehabilitation Robotics : [proceedings]","volume":"2025 ","pages":"1209-1214"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE ... International Conference on Rehabilitation Robotics : [proceedings]","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICORR66766.2025.11063114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Millions of people in the United States suffer from paralysis, resulting in significant deficits in motor function. Restricted mobility due to these deficits and the lack of adaptive rehabilitative solutions make traversing complex and challenging terrains unsafe. Exoskeletons offer a promising solution, but their effectiveness could be greatly enhanced by incorporating reinforcement learning algorithms for real-time adaptation to changing environments and the user's unique gait biomechanics. This study explored different temporal difference learning methods for predicting signals recorded from sensors on the lower-limbs, including muscle activation from electromyography, underfoot pressure, and joint angles from goniometers. Specifically, the performance of the temporal difference learning methods TD $(\lambda)$, TOTD, and SwiftTD to quickly and accurately predict these signals was examined. From initial findings, SwiftTD generally converged faster, while TOTD typically achieved lower convergence errors. These outcomes varied depending on the specific signal that was being predicted, highlighting the need for careful consideration of algorithm choice depending on the signal, accuracy, and speed. The results, therefore, support the informed selection of specific algorithms for providing predictive knowledge to adaptive, machine learning-controlled assistive rehabilitative technologies. These findings will enable the selection of appropriate predictive algorithms, leading to the development of better exoskeletons and other assistive devices to enhance the mobility and quality of life of individuals with motor paralysis.

查看原文本刊更多论文

学习下肢信号一般值函数的时间差分学习方法的比较分析。

在美国，数以百万计的人患有瘫痪，导致运动功能严重缺陷。由于这些缺陷和缺乏适应性恢复解决方案而限制了行动，使得穿越复杂和具有挑战性的地形变得不安全。外骨骼提供了一个很有前途的解决方案，但通过结合强化学习算法来实时适应不断变化的环境和用户独特的步态生物力学，它们的有效性可以大大提高。本研究探索了不同的时间差异学习方法，用于预测下肢传感器记录的信号，包括肌电图记录的肌肉激活、足下压力和测角仪记录的关节角度。具体而言，考察了时间差分学习方法TD $(\lambda)$、TOTD和SwiftTD快速准确预测这些信号的性能。从最初的研究结果来看，SwiftTD通常收敛更快，而TOTD通常收敛误差更低。这些结果取决于所预测的特定信号，这突出了根据信号、精度和速度仔细考虑算法选择的必要性。因此，研究结果支持对特定算法的明智选择，为自适应、机器学习控制的辅助康复技术提供预测知识。这些发现将有助于选择合适的预测算法，从而开发出更好的外骨骼和其他辅助设备，以提高运动瘫痪患者的行动能力和生活质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE ... International Conference on Rehabilitation Robotics : [proceedings]

CiteScore

0.50

自引率

0.00%

发文量