Diffusion models enable zero-shot pose estimation for lower-limb prosthetic users.

PLOS digital health Pub Date : 2025-03-31 eCollection Date: 2025-03-01 DOI:10.1371/journal.pdig.0000745

Tianxun Zhou, Muhammad Nur Shahril Iskandar, Keng-Hwee Chiam

{"title":"Diffusion models enable zero-shot pose estimation for lower-limb prosthetic users.","authors":"Tianxun Zhou, Muhammad Nur Shahril Iskandar, Keng-Hwee Chiam","doi":"10.1371/journal.pdig.0000745","DOIUrl":null,"url":null,"abstract":"<p><p>Quantitative gait analysis is important for assessing and rehabilitating lower-limb prosthetic users, but markerless motion capture has been challenging for this population due to the difficulty in detecting prosthetic joints using models trained primarily on able-bodied individuals. This study proposes a zero-shot method leveraging generative diffusion models to transform prosthetic limb images into able-bodied representations that standard pose estimation models can detect, eliminating the need for additional data collection or model retraining. Videos of unilateral transfemoral and transtibial amputees walking were obtained publicly from YouTube. For each video frame, an edge map was generated and used as input to a ControlNet diffusion model, generating a synthetic image resembling an able-bodied person while preserving the person's original pose. These synthetic images were then passed through OpenPose. The zero-shot approach achieved substantial reductions in keypoint coordinate errors of 37% for transtibial and 76% for transfemoral prosthetic limbs compared to OpenPose on the original videos. The method enabled the identification and quantification of key gait deviations such as reduced knee flexion and altered kinematics timing between prosthetic and intact limbs. While the results demonstrate the feasibility of markerless gait analysis for lower-limb prosthetic users, the study's findings are based on a limited dataset of publicly available videos, and caution should be exercised in generalizing the results to broader populations due to the varying nature of prosthetic designs. Nonetheless, this approach has the potential to facilitate personalized rehabilitation using standard consumer cameras and existing pose estimation models.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 3","pages":"e0000745"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11957558/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Quantitative gait analysis is important for assessing and rehabilitating lower-limb prosthetic users, but markerless motion capture has been challenging for this population due to the difficulty in detecting prosthetic joints using models trained primarily on able-bodied individuals. This study proposes a zero-shot method leveraging generative diffusion models to transform prosthetic limb images into able-bodied representations that standard pose estimation models can detect, eliminating the need for additional data collection or model retraining. Videos of unilateral transfemoral and transtibial amputees walking were obtained publicly from YouTube. For each video frame, an edge map was generated and used as input to a ControlNet diffusion model, generating a synthetic image resembling an able-bodied person while preserving the person's original pose. These synthetic images were then passed through OpenPose. The zero-shot approach achieved substantial reductions in keypoint coordinate errors of 37% for transtibial and 76% for transfemoral prosthetic limbs compared to OpenPose on the original videos. The method enabled the identification and quantification of key gait deviations such as reduced knee flexion and altered kinematics timing between prosthetic and intact limbs. While the results demonstrate the feasibility of markerless gait analysis for lower-limb prosthetic users, the study's findings are based on a limited dataset of publicly available videos, and caution should be exercised in generalizing the results to broader populations due to the varying nature of prosthetic designs. Nonetheless, this approach has the potential to facilitate personalized rehabilitation using standard consumer cameras and existing pose estimation models.

查看原文本刊更多论文

扩散模型使零射击姿态估计下肢假肢用户。

定量步态分析对于评估和康复下肢假肢使用者是很重要的，但是无标记运动捕捉对于这一人群来说是一个挑战，因为使用主要在健全个体上训练的模型来检测假肢关节是困难的。本研究提出了一种零射击方法，利用生成扩散模型将假肢图像转换为标准姿态估计模型可以检测到的健全表征，从而消除了额外的数据收集或模型再训练的需要。单侧经股骨和经胫骨截肢者行走的视频从YouTube上公开获得。对于每个视频帧，生成一个边缘图，并将其作为ControlNet扩散模型的输入，生成一个类似于健全人的合成图像，同时保留人的原始姿势。然后通过OpenPose传递这些合成图像。与原始视频上的OpenPose相比，零射击方法使经胫骨和经股骨假肢的关键点坐标误差大幅降低了37%和76%。该方法能够识别和量化关键的步态偏差，如膝关节屈曲减少和假肢与完整肢体之间运动时间的改变。虽然研究结果证明了下肢假肢使用者无标记步态分析的可行性，但研究结果是基于有限的公开视频数据集，由于假肢设计的不同性质，在将结果推广到更广泛的人群时应谨慎行事。尽管如此，这种方法有可能促进使用标准消费者相机和现有姿势估计模型的个性化康复。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PLOS digital health

自引率

0.00%

发文量