Explainable machine learning-driven models for predicting Parkinson's disease and its prognosis: obesity patterns associations and models development using NHANES 1999-2018 data.
Jiaxin Fan, Shuai Cao, Hang Peng, Yuanjie Zhi, Shuqin Zhan, Rui Li
{"title":"Explainable machine learning-driven models for predicting Parkinson's disease and its prognosis: obesity patterns associations and models development using NHANES 1999-2018 data.","authors":"Jiaxin Fan, Shuai Cao, Hang Peng, Yuanjie Zhi, Shuqin Zhan, Rui Li","doi":"10.1186/s12944-025-02664-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Parkinson's disease (PD) is a prevalent neurodegenerative condition, the effect of obesity on PD remains controversial. We aimed to investigate the associations of obesity patterns on PD and all-cause mortality, while developing machine learning (ML)-driven predictive and prognostic models for PD.</p><p><strong>Methods: </strong>Fifty-one thousand, three hundred ninety-four adults from the National Health and Nutrition Examination Survey (NHANES) 1999-2018 were classified into four obesity patterns via body mass index (BMI) and waist circumference (WC). Associations of obesity patterns with PD risk and all-cause mortality were evaluated via multivariable logistic and Cox proportional hazards regression across three adjusted models. Subgroup, sensitivity, and restricted cubic spline (RCS) analyses examined stability, robustness, and nonlinearity. An integrative ML-driven architecture identified key features to develop predictive and prognostic nomograms, validated by the area under the receiver operating characteristic curves (AUCROCs) and calibration curves. Survival differences were analyzed using Kaplan-Meier curves. Shapley additive explanations (SHAP) enhanced model explanation.</p><p><strong>Results: </strong>Compound obesity significantly increased PD risk (Model 1: OR = 1.83, P < 0.001; Model 2: OR = 1.70, P = 0.002; Model 3: OR = 1.71, P = 0.006) yet correlated with reduced all-cause mortality in PD patients (Model 1: HR = 0.43, P = 0.003; Model 2: HR = 0.75, P = 0.428; Model 3: HR = 0.41, P = 0.033). Subgroup analysis revealed only HbA1c-modified association of compound obesity with PD (P<sub>interaction</sub> = 0.031). Sensitivity analyses confirmed robustness (pooled OR = 1.83, P < 0.001; pooled HR = 0.43, P = 0.003). RCS analyses revealed BMI-dependent PD risk escalation (P<sub>nonlinearity</sub> = 0.008, BMI < 45.0 kg/m<sup>2</sup>), inverted U-shaped WC-PD link (P<sub>nonlinearity</sub> < 0.001), and inverse dose-response BMI-mortality relationship (P<sub>nonlinearity</sub> = 0.003), along with multiphasic WC-mortality association (P<sub>Threshold</sub> = 0.555 at 95 cm and P<sub>Threshold</sub> = 0.091 at 118 cm). LASSO + RF identified eight features, achieving moderate performance in PD prediction (SMOTE set: AUCROC = 0.75, Brier = 0.20) and prognosis (train set: AUCROC = 0.72, Brier = 0.22) nomograms, with similar results in the test set (AUCROC = 0.70, Brier = 0.01 for prediction, 0.87 and 0.18 for prognosis). No 24-month survival differences were observed across four obesity patterns (train set: P<sub>log-rank</sub> = 0.73; test set: P<sub>log-rank</sub> = 0.32).</p><p><strong>Conclusions: </strong>This study preliminarily reveals that compound obesity significantly increases PD risk yet paradoxically associates with reduced all-cause mortality in PD patients. Validated predictive and prognostic nomograms for PD achieve relatively robust performances. Nonetheless, extensive longitudinal studies are required to validate these exploratory findings more comprehensively.</p>","PeriodicalId":18073,"journal":{"name":"Lipids in Health and Disease","volume":"24 1","pages":"241"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12273281/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lipids in Health and Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12944-025-02664-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Parkinson's disease (PD) is a prevalent neurodegenerative condition, the effect of obesity on PD remains controversial. We aimed to investigate the associations of obesity patterns on PD and all-cause mortality, while developing machine learning (ML)-driven predictive and prognostic models for PD.
Methods: Fifty-one thousand, three hundred ninety-four adults from the National Health and Nutrition Examination Survey (NHANES) 1999-2018 were classified into four obesity patterns via body mass index (BMI) and waist circumference (WC). Associations of obesity patterns with PD risk and all-cause mortality were evaluated via multivariable logistic and Cox proportional hazards regression across three adjusted models. Subgroup, sensitivity, and restricted cubic spline (RCS) analyses examined stability, robustness, and nonlinearity. An integrative ML-driven architecture identified key features to develop predictive and prognostic nomograms, validated by the area under the receiver operating characteristic curves (AUCROCs) and calibration curves. Survival differences were analyzed using Kaplan-Meier curves. Shapley additive explanations (SHAP) enhanced model explanation.
Results: Compound obesity significantly increased PD risk (Model 1: OR = 1.83, P < 0.001; Model 2: OR = 1.70, P = 0.002; Model 3: OR = 1.71, P = 0.006) yet correlated with reduced all-cause mortality in PD patients (Model 1: HR = 0.43, P = 0.003; Model 2: HR = 0.75, P = 0.428; Model 3: HR = 0.41, P = 0.033). Subgroup analysis revealed only HbA1c-modified association of compound obesity with PD (Pinteraction = 0.031). Sensitivity analyses confirmed robustness (pooled OR = 1.83, P < 0.001; pooled HR = 0.43, P = 0.003). RCS analyses revealed BMI-dependent PD risk escalation (Pnonlinearity = 0.008, BMI < 45.0 kg/m2), inverted U-shaped WC-PD link (Pnonlinearity < 0.001), and inverse dose-response BMI-mortality relationship (Pnonlinearity = 0.003), along with multiphasic WC-mortality association (PThreshold = 0.555 at 95 cm and PThreshold = 0.091 at 118 cm). LASSO + RF identified eight features, achieving moderate performance in PD prediction (SMOTE set: AUCROC = 0.75, Brier = 0.20) and prognosis (train set: AUCROC = 0.72, Brier = 0.22) nomograms, with similar results in the test set (AUCROC = 0.70, Brier = 0.01 for prediction, 0.87 and 0.18 for prognosis). No 24-month survival differences were observed across four obesity patterns (train set: Plog-rank = 0.73; test set: Plog-rank = 0.32).
Conclusions: This study preliminarily reveals that compound obesity significantly increases PD risk yet paradoxically associates with reduced all-cause mortality in PD patients. Validated predictive and prognostic nomograms for PD achieve relatively robust performances. Nonetheless, extensive longitudinal studies are required to validate these exploratory findings more comprehensively.
期刊介绍:
Lipids in Health and Disease is an open access, peer-reviewed, journal that publishes articles on all aspects of lipids: their biochemistry, pharmacology, toxicology, role in health and disease, and the synthesis of new lipid compounds.
Lipids in Health and Disease is aimed at all scientists, health professionals and physicians interested in the area of lipids. Lipids are defined here in their broadest sense, to include: cholesterol, essential fatty acids, saturated fatty acids, phospholipids, inositol lipids, second messenger lipids, enzymes and synthetic machinery that is involved in the metabolism of various lipids in the cells and tissues, and also various aspects of lipid transport, etc. In addition, the journal also publishes research that investigates and defines the role of lipids in various physiological processes, pathology and disease. In particular, the journal aims to bridge the gap between the bench and the clinic by publishing articles that are particularly relevant to human diseases and the role of lipids in the management of various diseases.