Caroline W Grant, Jerry Li, Swan Lin, Dana Nickens, Daniele Ouellet, Mohamed H Shahin
{"title":"Application of Machine Learning for Predicting Progression-Free and Overall Survival in Patients With Renal Cell Carcinoma.","authors":"Caroline W Grant, Jerry Li, Swan Lin, Dana Nickens, Daniele Ouellet, Mohamed H Shahin","doi":"10.1111/cts.70348","DOIUrl":null,"url":null,"abstract":"<p><p>Patient outcomes in advanced renal cell carcinoma (RCC) remain poor, with five-year survival rates ranging from ~10% to 30%. Early projections of therapeutic outcomes could optimize precision medicine and accelerate drug development. While machine learning (ML) models integrating tumor growth inhibition (TGI) metrics have improved survival predictions over traditional models, their application in RCC remains unexplored. Herein, we used TGI metrics and baseline data to evaluate parametric (PM) and semi-parametric (SPM) survival models alongside ML approaches for predicting progression-free (PFS) and overall survival (OS) in 1839 RCC patients from four trials (evaluating sunitinib, axitinib, sorafenib, interferon-alpha, and avelumab + axitinib). Data were split into training (70%) and testing (30%), and feature selection was used to determine parsimonious and robust models. Bootstrap resampling (n = 100) was employed for models' validation, and performance was assessed using C-index and Integrated Brier Score. In brief, training data results demonstrated that tree-based ML models (random survival forest (RSF) and XGBoost) outperformed PM and SPM models in predicting PFS (C-index: 0.783-0.785 vs. 0.725-0.738 for PM and SPM; p < 0.05) and OS (C-index: 0.77-0.867 vs. 0.750-0.758 for PM and SPM; p < 0.05), with RSF achieving better prediction of PFS and OS using only 3-5 covariates, compared to 9-35 with other tested methods. Tree-based methods were also superior in the testing data. SHapley Additive exPlanations revealed nonlinear relationships among top predictors, including TGI metrics, underscoring the ability of tree-based methods to capture complex prognostic interactions. Further validation is required to confirm models' generalizability to additional therapies and patients with differing tumor severity.</p>","PeriodicalId":50610,"journal":{"name":"Cts-Clinical and Translational Science","volume":"18 10","pages":"e70348"},"PeriodicalIF":2.8000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cts-Clinical and Translational Science","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/cts.70348","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Patient outcomes in advanced renal cell carcinoma (RCC) remain poor, with five-year survival rates ranging from ~10% to 30%. Early projections of therapeutic outcomes could optimize precision medicine and accelerate drug development. While machine learning (ML) models integrating tumor growth inhibition (TGI) metrics have improved survival predictions over traditional models, their application in RCC remains unexplored. Herein, we used TGI metrics and baseline data to evaluate parametric (PM) and semi-parametric (SPM) survival models alongside ML approaches for predicting progression-free (PFS) and overall survival (OS) in 1839 RCC patients from four trials (evaluating sunitinib, axitinib, sorafenib, interferon-alpha, and avelumab + axitinib). Data were split into training (70%) and testing (30%), and feature selection was used to determine parsimonious and robust models. Bootstrap resampling (n = 100) was employed for models' validation, and performance was assessed using C-index and Integrated Brier Score. In brief, training data results demonstrated that tree-based ML models (random survival forest (RSF) and XGBoost) outperformed PM and SPM models in predicting PFS (C-index: 0.783-0.785 vs. 0.725-0.738 for PM and SPM; p < 0.05) and OS (C-index: 0.77-0.867 vs. 0.750-0.758 for PM and SPM; p < 0.05), with RSF achieving better prediction of PFS and OS using only 3-5 covariates, compared to 9-35 with other tested methods. Tree-based methods were also superior in the testing data. SHapley Additive exPlanations revealed nonlinear relationships among top predictors, including TGI metrics, underscoring the ability of tree-based methods to capture complex prognostic interactions. Further validation is required to confirm models' generalizability to additional therapies and patients with differing tumor severity.
期刊介绍:
Clinical and Translational Science (CTS), an official journal of the American Society for Clinical Pharmacology and Therapeutics, highlights original translational medicine research that helps bridge laboratory discoveries with the diagnosis and treatment of human disease. Translational medicine is a multi-faceted discipline with a focus on translational therapeutics. In a broad sense, translational medicine bridges across the discovery, development, regulation, and utilization spectrum. Research may appear as Full Articles, Brief Reports, Commentaries, Phase Forwards (clinical trials), Reviews, or Tutorials. CTS also includes invited didactic content that covers the connections between clinical pharmacology and translational medicine. Best-in-class methodologies and best practices are also welcomed as Tutorials. These additional features provide context for research articles and facilitate understanding for a wide array of individuals interested in clinical and translational science. CTS welcomes high quality, scientifically sound, original manuscripts focused on clinical pharmacology and translational science, including animal, in vitro, in silico, and clinical studies supporting the breadth of drug discovery, development, regulation and clinical use of both traditional drugs and innovative modalities.