An explainable AI approach to surgical and radiotherapy interventions for optimized treatment decision-making in early-stage non-small cell lung cancer.
Qunzhe Ding, Chendong Wang, Zhe Zhang, Junjie Liao, Lufan Tang, Jiade Jay Lu, Zhibo Tan
{"title":"An explainable AI approach to surgical and radiotherapy interventions for optimized treatment decision-making in early-stage non-small cell lung cancer.","authors":"Qunzhe Ding, Chendong Wang, Zhe Zhang, Junjie Liao, Lufan Tang, Jiade Jay Lu, Zhibo Tan","doi":"10.21037/tlcr-2025-152","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>For individual patients with early-stage non-small cell lung cancer (NSCLC), robust evidence to guide treatment selection between surgery and stereotactic body radiotherapy (SBRT) remains limited. This study aimed to develop machine learning-driven predictive models using the Surveillance, Epidemiology, and End Results (SEER) database to evaluate the efficacy of these treatments, thereby providing a data-driven foundation for personalized treatment decisions.</p><p><strong>Methods: </strong>Stage I or IIA NSCLC patients diagnosed between 2012 and 2018 were identified from the SEER database. Six machine learning models, spanning from classical to advanced approaches, were employed to predict 1-, 3-, and 5-year survival, with their performance assessed using seven metrics. The SHAP (SHapley Additive exPlanations) interpretability method was employed to explain the optimal predictive model, focusing on analyzing the differences between surgical and radiotherapy treatments under various factors, providing valuable insights to optimizing treatment strategies. Patients diagnosed between 2019 and 2021 were selected as an external validation cohort to assess the generalizability and robustness of the previously developed models.</p><p><strong>Results: </strong>A total of 26,566 patients were included in the training and internal testing cohort of the study. LightGBM (light gradient boosting machine) outperformed other models across most metrics for survival predictions. The SHAP interpretability analysis revealed that tumor location, tumor size, pathology, and treatment type were significant factors for 3- and 5-year predictions. Furthermore, at 3- and 5-year intervals, the efficacy of radiotherapy was comparable to surgery for left upper lobe tumors, while radiotherapy appeared slightly inferior to surgery for right lower lobe tumors. Meanwhile, for tumors <1.5 cm or 3.5-5 cm, lobectomy exhibited the best efficacy, while for tumors measuring 1.5-3.5 cm, the efficacy of lobectomy seemed to be slightly inferior to radiotherapy and sublobar resection. For adenocarcinoma and squamous cell carcinoma, radiotherapy and lobectomy could be regarded as the preferred treatment methods, respectively. Besides, for patients <45 or >75 years old, sublobar resection showed the best efficacy at the 5-year interval. The external validation cohort of 11,927 patients further confirmed the effectiveness of the models in predicting 1-, 3-, and 5-year survival outcomes, reinforcing their reliability and applicability in clinical decision-making.</p><p><strong>Conclusions: </strong>This study provides valuable insights into treatment decision-making for stages I and IIA NSCLC. The LightGBM model is a reliable tool for survival prediction for early-stage NSCLC. By utilizing this model, it can be concluded that tumor location, tumor size, pathological type and age are vital factors significantly influencing the choice of treatment methods.</p>","PeriodicalId":23271,"journal":{"name":"Translational lung cancer research","volume":"14 6","pages":"2011-2030"},"PeriodicalIF":3.5000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261364/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational lung cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tlcr-2025-152","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: For individual patients with early-stage non-small cell lung cancer (NSCLC), robust evidence to guide treatment selection between surgery and stereotactic body radiotherapy (SBRT) remains limited. This study aimed to develop machine learning-driven predictive models using the Surveillance, Epidemiology, and End Results (SEER) database to evaluate the efficacy of these treatments, thereby providing a data-driven foundation for personalized treatment decisions.
Methods: Stage I or IIA NSCLC patients diagnosed between 2012 and 2018 were identified from the SEER database. Six machine learning models, spanning from classical to advanced approaches, were employed to predict 1-, 3-, and 5-year survival, with their performance assessed using seven metrics. The SHAP (SHapley Additive exPlanations) interpretability method was employed to explain the optimal predictive model, focusing on analyzing the differences between surgical and radiotherapy treatments under various factors, providing valuable insights to optimizing treatment strategies. Patients diagnosed between 2019 and 2021 were selected as an external validation cohort to assess the generalizability and robustness of the previously developed models.
Results: A total of 26,566 patients were included in the training and internal testing cohort of the study. LightGBM (light gradient boosting machine) outperformed other models across most metrics for survival predictions. The SHAP interpretability analysis revealed that tumor location, tumor size, pathology, and treatment type were significant factors for 3- and 5-year predictions. Furthermore, at 3- and 5-year intervals, the efficacy of radiotherapy was comparable to surgery for left upper lobe tumors, while radiotherapy appeared slightly inferior to surgery for right lower lobe tumors. Meanwhile, for tumors <1.5 cm or 3.5-5 cm, lobectomy exhibited the best efficacy, while for tumors measuring 1.5-3.5 cm, the efficacy of lobectomy seemed to be slightly inferior to radiotherapy and sublobar resection. For adenocarcinoma and squamous cell carcinoma, radiotherapy and lobectomy could be regarded as the preferred treatment methods, respectively. Besides, for patients <45 or >75 years old, sublobar resection showed the best efficacy at the 5-year interval. The external validation cohort of 11,927 patients further confirmed the effectiveness of the models in predicting 1-, 3-, and 5-year survival outcomes, reinforcing their reliability and applicability in clinical decision-making.
Conclusions: This study provides valuable insights into treatment decision-making for stages I and IIA NSCLC. The LightGBM model is a reliable tool for survival prediction for early-stage NSCLC. By utilizing this model, it can be concluded that tumor location, tumor size, pathological type and age are vital factors significantly influencing the choice of treatment methods.
期刊介绍:
Translational Lung Cancer Research(TLCR, Transl Lung Cancer Res, Print ISSN 2218-6751; Online ISSN 2226-4477) is an international, peer-reviewed, open-access journal, which was founded in March 2012. TLCR is indexed by PubMed/PubMed Central and the Chemical Abstracts Service (CAS) Databases. It is published quarterly the first year, and published bimonthly since February 2013. It provides practical up-to-date information on prevention, early detection, diagnosis, and treatment of lung cancer. Specific areas of its interest include, but not limited to, multimodality therapy, markers, imaging, tumor biology, pathology, chemoprevention, and technical advances related to lung cancer.