{"title":"Construction and validation of a urinary stone composition prediction model based on machine learning.","authors":"Jiangkun Guo, Jinxiao Zhang, Jinhang Zhang, Changbao Xu, Xikun Wang, Changwei Liu","doi":"10.1007/s00240-025-01828-8","DOIUrl":null,"url":null,"abstract":"<p><p>The composition of urinary calculi serves as a critical determinant for personalized surgical strategies; however, such compositional data are often unavailable preoperatively. This study aims to develop a machine learning-based preoperative prediction model for stone composition and evaluate its clinical utility. A retrospective cohort study design was employed to include patients with urinary calculi admitted to the Department of Urology at the Second Affiliated Hospital of Zhengzhou University from 2019 to 2024. Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression combined with multivariate logistic regression, and a binary prediction model for urinary calculi was subsequently constructed. Model validation was conducted using metrics such as the area under the curve (AUC), while Shapley Additive Explanations(SHAP) values were applied to interpret the predictive outcomes. Among 708 eligible patients, distinct prediction models were established for four stone types: calcium oxalate stones: Logistic regression achieved optimal performance (AUC = 0.845), with maximum stone CT value, 24-hour urinary oxalate, and stone size as top predictors (SHAP-ranked); infection stones: Logistic regression (AUC = 0.864) prioritized stone size, urinary pH, and recurrence history; uric acid stones: LASSO-ridge-elastic net model demonstrated exceptional accuracy (AUC = 0.961), driven by maximum CT value, 24-hour oxalate, and urinary calcium; calcium-containing stones: Logistic regression attained better prediction (AUC = 0.953), relying on CT value, 24-hour calcium, and stone size. This study developed a machine learning prediction model based on multi-algorithm integration, achieving accurate preoperative discrimination of urinary stone composition. The integration of key imaging features with metabolic indicators enhanced the model's predictive performance.</p>","PeriodicalId":23411,"journal":{"name":"Urolithiasis","volume":"53 1","pages":"154"},"PeriodicalIF":2.2000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urolithiasis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00240-025-01828-8","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The composition of urinary calculi serves as a critical determinant for personalized surgical strategies; however, such compositional data are often unavailable preoperatively. This study aims to develop a machine learning-based preoperative prediction model for stone composition and evaluate its clinical utility. A retrospective cohort study design was employed to include patients with urinary calculi admitted to the Department of Urology at the Second Affiliated Hospital of Zhengzhou University from 2019 to 2024. Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression combined with multivariate logistic regression, and a binary prediction model for urinary calculi was subsequently constructed. Model validation was conducted using metrics such as the area under the curve (AUC), while Shapley Additive Explanations(SHAP) values were applied to interpret the predictive outcomes. Among 708 eligible patients, distinct prediction models were established for four stone types: calcium oxalate stones: Logistic regression achieved optimal performance (AUC = 0.845), with maximum stone CT value, 24-hour urinary oxalate, and stone size as top predictors (SHAP-ranked); infection stones: Logistic regression (AUC = 0.864) prioritized stone size, urinary pH, and recurrence history; uric acid stones: LASSO-ridge-elastic net model demonstrated exceptional accuracy (AUC = 0.961), driven by maximum CT value, 24-hour oxalate, and urinary calcium; calcium-containing stones: Logistic regression attained better prediction (AUC = 0.953), relying on CT value, 24-hour calcium, and stone size. This study developed a machine learning prediction model based on multi-algorithm integration, achieving accurate preoperative discrimination of urinary stone composition. The integration of key imaging features with metabolic indicators enhanced the model's predictive performance.
期刊介绍:
Official Journal of the International Urolithiasis Society
The journal aims to publish original articles in the fields of clinical and experimental investigation only within the sphere of urolithiasis and its related areas of research. The journal covers all aspects of urolithiasis research including the diagnosis, epidemiology, pathogenesis, genetics, clinical biochemistry, open and non-invasive surgical intervention, nephrological investigation, chemistry and prophylaxis of the disorder. The Editor welcomes contributions on topics of interest to urologists, nephrologists, radiologists, clinical biochemists, epidemiologists, nutritionists, basic scientists and nurses working in that field.
Contributions may be submitted as full-length articles or as rapid communications in the form of Letters to the Editor. Articles should be original and should contain important new findings from carefully conducted studies designed to produce statistically significant data. Please note that we no longer publish articles classified as Case Reports. Editorials and review articles may be published by invitation from the Editorial Board. All submissions are peer-reviewed. Through an electronic system for the submission and review of manuscripts, the Editor and Associate Editors aim to make publication accessible as quickly as possible to a large number of readers throughout the world.