Hojin Moon, Phan N Nguyen, Jaehee Park, Minho Lee, Sohyul Ahn
{"title":"AI-Guided Chemotherapy Optimization in Lung Cancer Using Genomic and Survival Data.","authors":"Hojin Moon, Phan N Nguyen, Jaehee Park, Minho Lee, Sohyul Ahn","doi":"10.3390/jpm15060218","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background</b>: Adjuvant chemotherapy (ACT) can improve survival outcomes for patients with early-stage non-small cell lung cancer (NSCLC), but its benefit varies significantly across individuals. Identifying patients who are likely to benefit from ACT remains a critical challenge in precision oncology. <b>Methods</b>: We constructed a meta-database from two publicly available NSCLC gene expression datasets (GSE37745 and GSE29013) to address population heterogeneity. Feature selection was performed using Cox-based univariate screening with leave-one-out cross-validation. We then developed and compared three survival modeling frameworks: bagging with elastic net penalized Cox regression, Random Survival Forests (RSF), and DeepSurv neural survival networks. All models incorporated clinical covariates and selected genomic features to predict survival and recommend ACT versus observation (OBS). <b>Results</b>: Across 155 patients, RSF achieved the highest predictive performance, with a test concordance index (C-index) of0.885. Model-based recommendations were associated with improved survival in both training and test datasets, as confirmed by Kaplan-Meier analysis. Key genomic features identified included TTR, MTURN, and ETV3, suggesting their potential relevance in treatment response stratification. DeepSurv demonstrated strong predictive accuracy (C-index = 0.982) but less distinct survival curve separation compared to RSF. <b>Conclusions</b>: Our findings demonstrate that machine learning-driven survival models, particularly RSF, can effectively identify NSCLC patients who may benefit from ACT. This approach supports data-driven, individualized chemotherapy decision-making and contributes to advancing personalized treatment strategies in early-stage NSCLC.</p>","PeriodicalId":16722,"journal":{"name":"Journal of Personalized Medicine","volume":"15 6","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12194319/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Personalized Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/jpm15060218","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Adjuvant chemotherapy (ACT) can improve survival outcomes for patients with early-stage non-small cell lung cancer (NSCLC), but its benefit varies significantly across individuals. Identifying patients who are likely to benefit from ACT remains a critical challenge in precision oncology. Methods: We constructed a meta-database from two publicly available NSCLC gene expression datasets (GSE37745 and GSE29013) to address population heterogeneity. Feature selection was performed using Cox-based univariate screening with leave-one-out cross-validation. We then developed and compared three survival modeling frameworks: bagging with elastic net penalized Cox regression, Random Survival Forests (RSF), and DeepSurv neural survival networks. All models incorporated clinical covariates and selected genomic features to predict survival and recommend ACT versus observation (OBS). Results: Across 155 patients, RSF achieved the highest predictive performance, with a test concordance index (C-index) of0.885. Model-based recommendations were associated with improved survival in both training and test datasets, as confirmed by Kaplan-Meier analysis. Key genomic features identified included TTR, MTURN, and ETV3, suggesting their potential relevance in treatment response stratification. DeepSurv demonstrated strong predictive accuracy (C-index = 0.982) but less distinct survival curve separation compared to RSF. Conclusions: Our findings demonstrate that machine learning-driven survival models, particularly RSF, can effectively identify NSCLC patients who may benefit from ACT. This approach supports data-driven, individualized chemotherapy decision-making and contributes to advancing personalized treatment strategies in early-stage NSCLC.
期刊介绍:
Journal of Personalized Medicine (JPM; ISSN 2075-4426) is an international, open access journal aimed at bringing all aspects of personalized medicine to one platform. JPM publishes cutting edge, innovative preclinical and translational scientific research and technologies related to personalized medicine (e.g., pharmacogenomics/proteomics, systems biology). JPM recognizes that personalized medicine—the assessment of genetic, environmental and host factors that cause variability of individuals—is a challenging, transdisciplinary topic that requires discussions from a range of experts. For a comprehensive perspective of personalized medicine, JPM aims to integrate expertise from the molecular and translational sciences, therapeutics and diagnostics, as well as discussions of regulatory, social, ethical and policy aspects. We provide a forum to bring together academic and clinical researchers, biotechnology, diagnostic and pharmaceutical companies, health professionals, regulatory and ethical experts, and government and regulatory authorities.