Yun-Ru Lai , Chih-Cheng Huang , Chia-Yi Lien , Yi-Fang Chiang , Chi-Ping Ting , Chien-Feng Kung , Cheng-Hsien Lu
{"title":"利用三维kinect V2探测器建立帕金森病运动表型的综合深度学习模型","authors":"Yun-Ru Lai , Chih-Cheng Huang , Chia-Yi Lien , Yi-Fang Chiang , Chi-Ping Ting , Chien-Feng Kung , Cheng-Hsien Lu","doi":"10.1016/j.gaitpost.2025.110000","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Gait impairments are common for Parkinson’s disease (PD). With the development of artificial intelligence (AI) technology and three-dimensional Kinect V2 Detectors, it is possible to enable more accurate characterization of gait impairment. We develop a comprehensive prediction model by combining skeleton gait energy image with relative distance and angle for PD motor phenotypes.</div></div><div><h3>Research question</h3><div>Does the hybrid convolutional neural network-long short-term memory (CNN-LSTM) deep learning model improve diagnostic accuracy and outperform CNN or LSTM models in diagnosing different motor phenotypes of PD?</div></div><div><h3>Method</h3><div>We implemented and compared three deep learning architectures—CNN, LSTM, and a hybrid CNN-LSTM model. To mitigate class imbalance and enhance classification accuracy, the Synthetic Minority Oversampling Technique was applied. Feature relevance was determined using Random Forest (RF) and SHapley Additive exPlanations (SHAP), facilitating the identification of key predictors. Participants were stratified into three groups—healthy controls, non-postural instability, and gait disturbance (non-PIGD), and PIGD—based on mean scores from selected items of the Unified Parkinson’s Disease Rating Scale.</div></div><div><h3>Results</h3><div>The CNN–LSTM model demonstrated the highest predictive performance for PIGD classification during straight and turning walking in the off-medication state (AUC = 0.94 for both), followed by the CNN (AUC = 0.85 and 0.88) and LSTM models (AUC = 0.81 and 0.72). Moreover, the CNN–LSTM model achieved the highest classification accuracy across both on- and off-medication conditions. Using the DeLong test, we compared ROC curves of the CNN, LSTM, and hybrid CNN–LSTM models for PIGD classification across straight and turning walking tasks under both on- and off-medication conditions. The hybrid CNN–LSTM model consistently achieved significantly higher AUCs than the CNN and LSTM models in all settings.</div></div><div><h3>Conclusion</h3><div>Our study demonstrated that using a hybrid CNN-LSTM deep learning model in combination with RF and/or SHAP-based feature analysis, can achieve high classification performance.</div></div>","PeriodicalId":12496,"journal":{"name":"Gait & posture","volume":"123 ","pages":"Article 110000"},"PeriodicalIF":2.4000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comprehensive deep learning model for motor phenotypes of Parkinson's disease using three-dimensional kinect V2 detectors\",\"authors\":\"Yun-Ru Lai , Chih-Cheng Huang , Chia-Yi Lien , Yi-Fang Chiang , Chi-Ping Ting , Chien-Feng Kung , Cheng-Hsien Lu\",\"doi\":\"10.1016/j.gaitpost.2025.110000\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Gait impairments are common for Parkinson’s disease (PD). With the development of artificial intelligence (AI) technology and three-dimensional Kinect V2 Detectors, it is possible to enable more accurate characterization of gait impairment. We develop a comprehensive prediction model by combining skeleton gait energy image with relative distance and angle for PD motor phenotypes.</div></div><div><h3>Research question</h3><div>Does the hybrid convolutional neural network-long short-term memory (CNN-LSTM) deep learning model improve diagnostic accuracy and outperform CNN or LSTM models in diagnosing different motor phenotypes of PD?</div></div><div><h3>Method</h3><div>We implemented and compared three deep learning architectures—CNN, LSTM, and a hybrid CNN-LSTM model. To mitigate class imbalance and enhance classification accuracy, the Synthetic Minority Oversampling Technique was applied. Feature relevance was determined using Random Forest (RF) and SHapley Additive exPlanations (SHAP), facilitating the identification of key predictors. Participants were stratified into three groups—healthy controls, non-postural instability, and gait disturbance (non-PIGD), and PIGD—based on mean scores from selected items of the Unified Parkinson’s Disease Rating Scale.</div></div><div><h3>Results</h3><div>The CNN–LSTM model demonstrated the highest predictive performance for PIGD classification during straight and turning walking in the off-medication state (AUC = 0.94 for both), followed by the CNN (AUC = 0.85 and 0.88) and LSTM models (AUC = 0.81 and 0.72). Moreover, the CNN–LSTM model achieved the highest classification accuracy across both on- and off-medication conditions. Using the DeLong test, we compared ROC curves of the CNN, LSTM, and hybrid CNN–LSTM models for PIGD classification across straight and turning walking tasks under both on- and off-medication conditions. The hybrid CNN–LSTM model consistently achieved significantly higher AUCs than the CNN and LSTM models in all settings.</div></div><div><h3>Conclusion</h3><div>Our study demonstrated that using a hybrid CNN-LSTM deep learning model in combination with RF and/or SHAP-based feature analysis, can achieve high classification performance.</div></div>\",\"PeriodicalId\":12496,\"journal\":{\"name\":\"Gait & posture\",\"volume\":\"123 \",\"pages\":\"Article 110000\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Gait & posture\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0966636225007271\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gait & posture","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0966636225007271","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
A comprehensive deep learning model for motor phenotypes of Parkinson's disease using three-dimensional kinect V2 detectors
Background
Gait impairments are common for Parkinson’s disease (PD). With the development of artificial intelligence (AI) technology and three-dimensional Kinect V2 Detectors, it is possible to enable more accurate characterization of gait impairment. We develop a comprehensive prediction model by combining skeleton gait energy image with relative distance and angle for PD motor phenotypes.
Research question
Does the hybrid convolutional neural network-long short-term memory (CNN-LSTM) deep learning model improve diagnostic accuracy and outperform CNN or LSTM models in diagnosing different motor phenotypes of PD?
Method
We implemented and compared three deep learning architectures—CNN, LSTM, and a hybrid CNN-LSTM model. To mitigate class imbalance and enhance classification accuracy, the Synthetic Minority Oversampling Technique was applied. Feature relevance was determined using Random Forest (RF) and SHapley Additive exPlanations (SHAP), facilitating the identification of key predictors. Participants were stratified into three groups—healthy controls, non-postural instability, and gait disturbance (non-PIGD), and PIGD—based on mean scores from selected items of the Unified Parkinson’s Disease Rating Scale.
Results
The CNN–LSTM model demonstrated the highest predictive performance for PIGD classification during straight and turning walking in the off-medication state (AUC = 0.94 for both), followed by the CNN (AUC = 0.85 and 0.88) and LSTM models (AUC = 0.81 and 0.72). Moreover, the CNN–LSTM model achieved the highest classification accuracy across both on- and off-medication conditions. Using the DeLong test, we compared ROC curves of the CNN, LSTM, and hybrid CNN–LSTM models for PIGD classification across straight and turning walking tasks under both on- and off-medication conditions. The hybrid CNN–LSTM model consistently achieved significantly higher AUCs than the CNN and LSTM models in all settings.
Conclusion
Our study demonstrated that using a hybrid CNN-LSTM deep learning model in combination with RF and/or SHAP-based feature analysis, can achieve high classification performance.
期刊介绍:
Gait & Posture is a vehicle for the publication of up-to-date basic and clinical research on all aspects of locomotion and balance.
The topics covered include: Techniques for the measurement of gait and posture, and the standardization of results presentation; Studies of normal and pathological gait; Treatment of gait and postural abnormalities; Biomechanical and theoretical approaches to gait and posture; Mathematical models of joint and muscle mechanics; Neurological and musculoskeletal function in gait and posture; The evolution of upright posture and bipedal locomotion; Adaptations of carrying loads, walking on uneven surfaces, climbing stairs etc; spinal biomechanics only if they are directly related to gait and/or posture and are of general interest to our readers; The effect of aging and development on gait and posture; Psychological and cultural aspects of gait; Patient education.