An interpretable machine learning model using multimodal pretreatment features predicts pathological complete response to neoadjuvant immunochemotherapy in esophageal squamous cell carcinoma.

IF 5.9 2区 医学 Q1 IMMUNOLOGY
Frontiers in Immunology Pub Date : 2025-09-16 eCollection Date: 2025-01-01 DOI:10.3389/fimmu.2025.1660897
Xueping Wang, Wencheng Tan, Hui Sheng, Wenjia Zhou, Hailin Zheng, Kewei Huang, Jinfei Lin, Songhe Guo, Minjie Mao
{"title":"An interpretable machine learning model using multimodal pretreatment features predicts pathological complete response to neoadjuvant immunochemotherapy in esophageal squamous cell carcinoma.","authors":"Xueping Wang, Wencheng Tan, Hui Sheng, Wenjia Zhou, Hailin Zheng, Kewei Huang, Jinfei Lin, Songhe Guo, Minjie Mao","doi":"10.3389/fimmu.2025.1660897","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Although neoadjuvant immunochemotherapy (nICT) has revolutionized the management of locally advanced esophageal squamous cell carcinoma (ESCC), the inability to accurately predict pathological complete response (pCR) remains a major barrier to treatment personalization. We aimed to develop and validate an interpretable machine learning (ML) model using pretreatment multimodal features to predict pCR prior to nICT initiation.</p><p><strong>Methods: </strong>In this retrospective study, 114 ESCC patients receiving nICT were randomly allocated into training (n=81) and validation (n=33) cohorts (7:3 ratio). Predictors of pCR were identified from pretreatment clinical variables, endoscopic ultrasonography, and hematological biomarkers via least absolute shrinkage and selection operator (LASSO) regression. Eight machine learning algorithms were implemented to construct prediction models. Model performance was assessed by area under the receiver operating characteristic curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Shapley Additive Explanations (SHAP) provided feature importance and model interpretability.</p><p><strong>Results: </strong>Following feature selection, 17 variables were incorporated into model construction. The Random Forest (RF) model demonstrated perfect discrimination in the training cohort (AUC = 1.000, sensitivity = 1.000, specificity = 1.000, PPV = 1.000, NPV = 1.000), while maintaining robust predictive ability in the independent validation cohort (AUC = 0.913, sensitivity = 0.733, specificity = 0.889, PPV = 0.846, NPV = 0.800). Decision curve analysis (DCA) confirmed favorable clinical utility. SHAP analysis identified alcohol consumption, circumferential involvement ≥50%, elevated neutrophil-to-lymphocyte ratio (NLR), C-reactive protein (CRP), and alanine aminotransferase (ALT) as the key contributors to pCR prediction.</p><p><strong>Conclusions: </strong>We established a clinically applicable, interpretable ML model that accurately predicts pCR to nICT in ESCC by integrating multimodal pretreatment data. This tool may optimize patient selection for nICT and advance precision therapy paradigms.</p>","PeriodicalId":12622,"journal":{"name":"Frontiers in Immunology","volume":"16 ","pages":"1660897"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12479532/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Immunology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fimmu.2025.1660897","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Although neoadjuvant immunochemotherapy (nICT) has revolutionized the management of locally advanced esophageal squamous cell carcinoma (ESCC), the inability to accurately predict pathological complete response (pCR) remains a major barrier to treatment personalization. We aimed to develop and validate an interpretable machine learning (ML) model using pretreatment multimodal features to predict pCR prior to nICT initiation.

Methods: In this retrospective study, 114 ESCC patients receiving nICT were randomly allocated into training (n=81) and validation (n=33) cohorts (7:3 ratio). Predictors of pCR were identified from pretreatment clinical variables, endoscopic ultrasonography, and hematological biomarkers via least absolute shrinkage and selection operator (LASSO) regression. Eight machine learning algorithms were implemented to construct prediction models. Model performance was assessed by area under the receiver operating characteristic curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Shapley Additive Explanations (SHAP) provided feature importance and model interpretability.

Results: Following feature selection, 17 variables were incorporated into model construction. The Random Forest (RF) model demonstrated perfect discrimination in the training cohort (AUC = 1.000, sensitivity = 1.000, specificity = 1.000, PPV = 1.000, NPV = 1.000), while maintaining robust predictive ability in the independent validation cohort (AUC = 0.913, sensitivity = 0.733, specificity = 0.889, PPV = 0.846, NPV = 0.800). Decision curve analysis (DCA) confirmed favorable clinical utility. SHAP analysis identified alcohol consumption, circumferential involvement ≥50%, elevated neutrophil-to-lymphocyte ratio (NLR), C-reactive protein (CRP), and alanine aminotransferase (ALT) as the key contributors to pCR prediction.

Conclusions: We established a clinically applicable, interpretable ML model that accurately predicts pCR to nICT in ESCC by integrating multimodal pretreatment data. This tool may optimize patient selection for nICT and advance precision therapy paradigms.

使用多模态预处理特征的可解释机器学习模型预测食管鳞状细胞癌新辅助免疫化疗的病理完全反应。
背景:虽然新辅助免疫化疗(nICT)已经彻底改变了局部晚期食管鳞状细胞癌(ESCC)的治疗,但无法准确预测病理完全缓解(pCR)仍然是治疗个性化的主要障碍。我们的目标是开发和验证一个可解释的机器学习(ML)模型,使用预处理多模态特征来预测nICT开始前的pCR。方法:在本回顾性研究中,114例接受nICT治疗的ESCC患者随机分为训练组(n=81)和验证组(n=33),比例为7:3。通过最小绝对收缩和选择算子(LASSO)回归,从预处理临床变量、内窥镜超声检查和血液学生物标志物中确定pCR的预测因子。采用8种机器学习算法构建预测模型。通过受试者工作特征曲线下面积(AUC)、敏感性、特异性、阳性预测值(PPV)和阴性预测值(NPV)评估模型的性能。Shapley加性解释(SHAP)提供了特征重要性和模型可解释性。结果:经过特征选择,17个变量被纳入模型构建。随机森林(Random Forest, RF)模型在训练队列(AUC = 1.000,灵敏度= 1.000,特异性= 1.000,PPV = 1.000, NPV = 1.000)中表现出良好的识别能力,在独立验证队列(AUC = 0.913,灵敏度= 0.733,特异性= 0.889,PPV = 0.846, NPV = 0.800)中保持稳健的预测能力。决策曲线分析(DCA)证实了良好的临床应用。SHAP分析发现,饮酒、周向累及≥50%、中性粒细胞与淋巴细胞比值(NLR)、c反应蛋白(CRP)和丙氨酸转氨酶(ALT)升高是pCR预测的关键因素。结论:通过整合多模态预处理数据,我们建立了一个临床适用的、可解释的ML模型,该模型可以准确预测ESCC中pCR对nICT的影响。该工具可以优化nICT的患者选择,并推进精确治疗范例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.80
自引率
11.00%
发文量
7153
审稿时长
14 weeks
期刊介绍: Frontiers in Immunology is a leading journal in its field, publishing rigorously peer-reviewed research across basic, translational and clinical immunology. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide. Frontiers in Immunology is the official Journal of the International Union of Immunological Societies (IUIS). Encompassing the entire field of Immunology, this journal welcomes papers that investigate basic mechanisms of immune system development and function, with a particular emphasis given to the description of the clinical and immunological phenotype of human immune disorders, and on the definition of their molecular basis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信