Development and validation of machine learning models for classifying cancer-related sarcopenia using Kinect-based mixed-reality exercises in breast cancer survivors.
{"title":"Development and validation of machine learning models for classifying cancer-related sarcopenia using Kinect-based mixed-reality exercises in breast cancer survivors.","authors":"Byunggul Lim, Wook Song","doi":"10.21037/tcr-2024-2337","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sarcopenia in cancer survivors is often underdiagnosed due to limited access to imaging-based diagnostic tools such as computed tomography (CT) or dual-energy X-ray absorptiometry (DXA). Indirect classification using movement data may offer a practical, scalable alternative. This study aimed to develop and validate machine learning (ML)-based classification models for cancer-related sarcopenia using joint angle data obtained from Kinect-based mixed-reality (KMR) devices, aiming to improve classification accuracy and identify key movement-related predictors.</p><p><strong>Methods: </strong>Overall, 77 breast cancer survivors (mean age, 48.9±5.4 years) were included based on stage I-III diagnosis, treatment completion ≥6 months prior, no metastasis, low physical activity, and no major comorbidities. Sarcopenia was diagnosed using skeletal muscle index (SMI) (<5.7 kg/m<sup>2</sup>) and handgrip strength (HGS) (<18 kg). KMR device data were collected during 8 weeks of exercise. After preprocessing, the dataset was randomly split (8:2) for training and testing. Four ML models-support vector machine (SVM), K-nearest neighbor (KNN), random forest (RF), and XGBoost (XGB)-were trained. Five-fold cross-validation was used for tuning, and feature importance was analyzed.</p><p><strong>Results: </strong>Of the 38 participants in the exercise group included in the final analysis, 12 (31.5%) were initially diagnosed with sarcopenia. After the 8-week KMR device exercise intervention, 3 participants showed recovery from sarcopenia, resulting in 9 (23.6%) remaining classified with the condition. In the test set, the XGB model demonstrated the highest performance, achieving 94.7% accuracy, 91.2% recall, 95.8% precision, 93.4% F1 score, and 96.2% area under the curve (AUC). Feature importance analysis using RF and XGB consistently identified right \"knee flexion (right)\" as the most influential predictor.</p><p><strong>Conclusions: </strong>Among ML classification models trained on KMR device joint data, XGB demonstrated the best performance. Right knee flexion emerged as the most influential feature in sarcopenia classification. These findings suggest that KMR device movement analysis may serve as a practical, non-invasive screening tool for sarcopenia, enabling early detection and personalized intervention strategies for breast cancer survivors in both clinical and remote settings.</p>","PeriodicalId":23216,"journal":{"name":"Translational cancer research","volume":"14 7","pages":"4208-4218"},"PeriodicalIF":1.7000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12335685/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tcr-2024-2337","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/22 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Sarcopenia in cancer survivors is often underdiagnosed due to limited access to imaging-based diagnostic tools such as computed tomography (CT) or dual-energy X-ray absorptiometry (DXA). Indirect classification using movement data may offer a practical, scalable alternative. This study aimed to develop and validate machine learning (ML)-based classification models for cancer-related sarcopenia using joint angle data obtained from Kinect-based mixed-reality (KMR) devices, aiming to improve classification accuracy and identify key movement-related predictors.
Methods: Overall, 77 breast cancer survivors (mean age, 48.9±5.4 years) were included based on stage I-III diagnosis, treatment completion ≥6 months prior, no metastasis, low physical activity, and no major comorbidities. Sarcopenia was diagnosed using skeletal muscle index (SMI) (<5.7 kg/m2) and handgrip strength (HGS) (<18 kg). KMR device data were collected during 8 weeks of exercise. After preprocessing, the dataset was randomly split (8:2) for training and testing. Four ML models-support vector machine (SVM), K-nearest neighbor (KNN), random forest (RF), and XGBoost (XGB)-were trained. Five-fold cross-validation was used for tuning, and feature importance was analyzed.
Results: Of the 38 participants in the exercise group included in the final analysis, 12 (31.5%) were initially diagnosed with sarcopenia. After the 8-week KMR device exercise intervention, 3 participants showed recovery from sarcopenia, resulting in 9 (23.6%) remaining classified with the condition. In the test set, the XGB model demonstrated the highest performance, achieving 94.7% accuracy, 91.2% recall, 95.8% precision, 93.4% F1 score, and 96.2% area under the curve (AUC). Feature importance analysis using RF and XGB consistently identified right "knee flexion (right)" as the most influential predictor.
Conclusions: Among ML classification models trained on KMR device joint data, XGB demonstrated the best performance. Right knee flexion emerged as the most influential feature in sarcopenia classification. These findings suggest that KMR device movement analysis may serve as a practical, non-invasive screening tool for sarcopenia, enabling early detection and personalized intervention strategies for breast cancer survivors in both clinical and remote settings.
期刊介绍:
Translational Cancer Research (Transl Cancer Res TCR; Print ISSN: 2218-676X; Online ISSN 2219-6803; http://tcr.amegroups.com/) is an Open Access, peer-reviewed journal, indexed in Science Citation Index Expanded (SCIE). TCR publishes laboratory studies of novel therapeutic interventions as well as clinical trials which evaluate new treatment paradigms for cancer; results of novel research investigations which bridge the laboratory and clinical settings including risk assessment, cellular and molecular characterization, prevention, detection, diagnosis and treatment of human cancers with the overall goal of improving the clinical care of cancer patients. The focus of TCR is original, peer-reviewed, science-based research that successfully advances clinical medicine toward the goal of improving patients'' quality of life. The editors and an international advisory group of scientists and clinician-scientists as well as other experts will hold TCR articles to the high-quality standards. We accept Original Articles as well as Review Articles, Editorials and Brief Articles.