Tianxun Zhou, Muhammad Nur Shahril Iskandar, Keng-Hwee Chiam
{"title":"Diffusion models enable zero-shot pose estimation for lower-limb prosthetic users.","authors":"Tianxun Zhou, Muhammad Nur Shahril Iskandar, Keng-Hwee Chiam","doi":"10.1371/journal.pdig.0000745","DOIUrl":null,"url":null,"abstract":"<p><p>Quantitative gait analysis is important for assessing and rehabilitating lower-limb prosthetic users, but markerless motion capture has been challenging for this population due to the difficulty in detecting prosthetic joints using models trained primarily on able-bodied individuals. This study proposes a zero-shot method leveraging generative diffusion models to transform prosthetic limb images into able-bodied representations that standard pose estimation models can detect, eliminating the need for additional data collection or model retraining. Videos of unilateral transfemoral and transtibial amputees walking were obtained publicly from YouTube. For each video frame, an edge map was generated and used as input to a ControlNet diffusion model, generating a synthetic image resembling an able-bodied person while preserving the person's original pose. These synthetic images were then passed through OpenPose. The zero-shot approach achieved substantial reductions in keypoint coordinate errors of 37% for transtibial and 76% for transfemoral prosthetic limbs compared to OpenPose on the original videos. The method enabled the identification and quantification of key gait deviations such as reduced knee flexion and altered kinematics timing between prosthetic and intact limbs. While the results demonstrate the feasibility of markerless gait analysis for lower-limb prosthetic users, the study's findings are based on a limited dataset of publicly available videos, and caution should be exercised in generalizing the results to broader populations due to the varying nature of prosthetic designs. Nonetheless, this approach has the potential to facilitate personalized rehabilitation using standard consumer cameras and existing pose estimation models.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 3","pages":"e0000745"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11957558/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Quantitative gait analysis is important for assessing and rehabilitating lower-limb prosthetic users, but markerless motion capture has been challenging for this population due to the difficulty in detecting prosthetic joints using models trained primarily on able-bodied individuals. This study proposes a zero-shot method leveraging generative diffusion models to transform prosthetic limb images into able-bodied representations that standard pose estimation models can detect, eliminating the need for additional data collection or model retraining. Videos of unilateral transfemoral and transtibial amputees walking were obtained publicly from YouTube. For each video frame, an edge map was generated and used as input to a ControlNet diffusion model, generating a synthetic image resembling an able-bodied person while preserving the person's original pose. These synthetic images were then passed through OpenPose. The zero-shot approach achieved substantial reductions in keypoint coordinate errors of 37% for transtibial and 76% for transfemoral prosthetic limbs compared to OpenPose on the original videos. The method enabled the identification and quantification of key gait deviations such as reduced knee flexion and altered kinematics timing between prosthetic and intact limbs. While the results demonstrate the feasibility of markerless gait analysis for lower-limb prosthetic users, the study's findings are based on a limited dataset of publicly available videos, and caution should be exercised in generalizing the results to broader populations due to the varying nature of prosthetic designs. Nonetheless, this approach has the potential to facilitate personalized rehabilitation using standard consumer cameras and existing pose estimation models.