An explainable AI approach to surgical and radiotherapy interventions for optimized treatment decision-making in early-stage non-small cell lung cancer.

IF 3.5 2区 医学 Q2 ONCOLOGY
Translational lung cancer research Pub Date : 2025-06-30 Epub Date: 2025-06-26 DOI:10.21037/tlcr-2025-152
Qunzhe Ding, Chendong Wang, Zhe Zhang, Junjie Liao, Lufan Tang, Jiade Jay Lu, Zhibo Tan
{"title":"An explainable AI approach to surgical and radiotherapy interventions for optimized treatment decision-making in early-stage non-small cell lung cancer.","authors":"Qunzhe Ding, Chendong Wang, Zhe Zhang, Junjie Liao, Lufan Tang, Jiade Jay Lu, Zhibo Tan","doi":"10.21037/tlcr-2025-152","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>For individual patients with early-stage non-small cell lung cancer (NSCLC), robust evidence to guide treatment selection between surgery and stereotactic body radiotherapy (SBRT) remains limited. This study aimed to develop machine learning-driven predictive models using the Surveillance, Epidemiology, and End Results (SEER) database to evaluate the efficacy of these treatments, thereby providing a data-driven foundation for personalized treatment decisions.</p><p><strong>Methods: </strong>Stage I or IIA NSCLC patients diagnosed between 2012 and 2018 were identified from the SEER database. Six machine learning models, spanning from classical to advanced approaches, were employed to predict 1-, 3-, and 5-year survival, with their performance assessed using seven metrics. The SHAP (SHapley Additive exPlanations) interpretability method was employed to explain the optimal predictive model, focusing on analyzing the differences between surgical and radiotherapy treatments under various factors, providing valuable insights to optimizing treatment strategies. Patients diagnosed between 2019 and 2021 were selected as an external validation cohort to assess the generalizability and robustness of the previously developed models.</p><p><strong>Results: </strong>A total of 26,566 patients were included in the training and internal testing cohort of the study. LightGBM (light gradient boosting machine) outperformed other models across most metrics for survival predictions. The SHAP interpretability analysis revealed that tumor location, tumor size, pathology, and treatment type were significant factors for 3- and 5-year predictions. Furthermore, at 3- and 5-year intervals, the efficacy of radiotherapy was comparable to surgery for left upper lobe tumors, while radiotherapy appeared slightly inferior to surgery for right lower lobe tumors. Meanwhile, for tumors <1.5 cm or 3.5-5 cm, lobectomy exhibited the best efficacy, while for tumors measuring 1.5-3.5 cm, the efficacy of lobectomy seemed to be slightly inferior to radiotherapy and sublobar resection. For adenocarcinoma and squamous cell carcinoma, radiotherapy and lobectomy could be regarded as the preferred treatment methods, respectively. Besides, for patients <45 or >75 years old, sublobar resection showed the best efficacy at the 5-year interval. The external validation cohort of 11,927 patients further confirmed the effectiveness of the models in predicting 1-, 3-, and 5-year survival outcomes, reinforcing their reliability and applicability in clinical decision-making.</p><p><strong>Conclusions: </strong>This study provides valuable insights into treatment decision-making for stages I and IIA NSCLC. The LightGBM model is a reliable tool for survival prediction for early-stage NSCLC. By utilizing this model, it can be concluded that tumor location, tumor size, pathological type and age are vital factors significantly influencing the choice of treatment methods.</p>","PeriodicalId":23271,"journal":{"name":"Translational lung cancer research","volume":"14 6","pages":"2011-2030"},"PeriodicalIF":3.5000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261364/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational lung cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tlcr-2025-152","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: For individual patients with early-stage non-small cell lung cancer (NSCLC), robust evidence to guide treatment selection between surgery and stereotactic body radiotherapy (SBRT) remains limited. This study aimed to develop machine learning-driven predictive models using the Surveillance, Epidemiology, and End Results (SEER) database to evaluate the efficacy of these treatments, thereby providing a data-driven foundation for personalized treatment decisions.

Methods: Stage I or IIA NSCLC patients diagnosed between 2012 and 2018 were identified from the SEER database. Six machine learning models, spanning from classical to advanced approaches, were employed to predict 1-, 3-, and 5-year survival, with their performance assessed using seven metrics. The SHAP (SHapley Additive exPlanations) interpretability method was employed to explain the optimal predictive model, focusing on analyzing the differences between surgical and radiotherapy treatments under various factors, providing valuable insights to optimizing treatment strategies. Patients diagnosed between 2019 and 2021 were selected as an external validation cohort to assess the generalizability and robustness of the previously developed models.

Results: A total of 26,566 patients were included in the training and internal testing cohort of the study. LightGBM (light gradient boosting machine) outperformed other models across most metrics for survival predictions. The SHAP interpretability analysis revealed that tumor location, tumor size, pathology, and treatment type were significant factors for 3- and 5-year predictions. Furthermore, at 3- and 5-year intervals, the efficacy of radiotherapy was comparable to surgery for left upper lobe tumors, while radiotherapy appeared slightly inferior to surgery for right lower lobe tumors. Meanwhile, for tumors <1.5 cm or 3.5-5 cm, lobectomy exhibited the best efficacy, while for tumors measuring 1.5-3.5 cm, the efficacy of lobectomy seemed to be slightly inferior to radiotherapy and sublobar resection. For adenocarcinoma and squamous cell carcinoma, radiotherapy and lobectomy could be regarded as the preferred treatment methods, respectively. Besides, for patients <45 or >75 years old, sublobar resection showed the best efficacy at the 5-year interval. The external validation cohort of 11,927 patients further confirmed the effectiveness of the models in predicting 1-, 3-, and 5-year survival outcomes, reinforcing their reliability and applicability in clinical decision-making.

Conclusions: This study provides valuable insights into treatment decision-making for stages I and IIA NSCLC. The LightGBM model is a reliable tool for survival prediction for early-stage NSCLC. By utilizing this model, it can be concluded that tumor location, tumor size, pathological type and age are vital factors significantly influencing the choice of treatment methods.

一种可解释的人工智能方法在早期非小细胞肺癌的手术和放疗干预中优化治疗决策。
背景:对于个体早期非小细胞肺癌(NSCLC)患者,指导手术和立体定向放疗(SBRT)治疗选择的有力证据仍然有限。本研究旨在利用监测、流行病学和最终结果(SEER)数据库开发机器学习驱动的预测模型,以评估这些治疗的疗效,从而为个性化治疗决策提供数据驱动的基础。方法:从SEER数据库中确定2012年至2018年诊断的I期或IIA期NSCLC患者。从经典到先进的六种机器学习模型被用于预测1年、3年和5年的生存,并使用七个指标对其性能进行评估。采用SHAP (SHapley Additive exPlanations)可解释性方法解释最优预测模型,重点分析各种因素下手术与放疗治疗的差异,为优化治疗策略提供有价值的见解。选择2019年至2021年间诊断的患者作为外部验证队列,以评估先前开发的模型的普遍性和稳健性。结果:该研究的培训和内测队列共纳入26,566例患者。LightGBM(光梯度增强机)在大多数生存预测指标上优于其他模型。SHAP可解释性分析显示,肿瘤位置、肿瘤大小、病理和治疗类型是3年和5年预测的重要因素。此外,在3年和5年的间隔时间内,放射治疗对左上叶肿瘤的疗效与手术相当,而放射治疗对右下叶肿瘤的疗效略低于手术。同时,对于75岁的肿瘤,叶下切除术在5年的间隔时间内效果最好。11,927例患者的外部验证队列进一步证实了模型在预测1年、3年和5年生存结局方面的有效性,增强了模型在临床决策中的可靠性和适用性。结论:本研究为I期和IIA期NSCLC的治疗决策提供了有价值的见解。LightGBM模型是预测早期非小细胞肺癌生存的可靠工具。利用该模型可以得出肿瘤位置、肿瘤大小、病理类型和年龄是影响治疗方法选择的重要因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
2.50%
发文量
137
期刊介绍: Translational Lung Cancer Research(TLCR, Transl Lung Cancer Res, Print ISSN 2218-6751; Online ISSN 2226-4477) is an international, peer-reviewed, open-access journal, which was founded in March 2012. TLCR is indexed by PubMed/PubMed Central and the Chemical Abstracts Service (CAS) Databases. It is published quarterly the first year, and published bimonthly since February 2013. It provides practical up-to-date information on prevention, early detection, diagnosis, and treatment of lung cancer. Specific areas of its interest include, but not limited to, multimodality therapy, markers, imaging, tumor biology, pathology, chemoprevention, and technical advances related to lung cancer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信