Kaiyue Zhi, Yanmei Wang, Lei Yan, Feng Hou, Jie Wu, Shuo Zhang, He Zhu, Lianzi Zhao, Ning Wang, Xia Zhao, Xianjun Li, Yicong Wang, Chengcheng Chen, Nan Wang, Yuchao Xu, Guangjie Yang, Pei Nie
{"title":"The interpretable CT-based vision transformer model for preoperative prediction of clear cell renal cell carcinoma SSIGN score and outcome.","authors":"Kaiyue Zhi, Yanmei Wang, Lei Yan, Feng Hou, Jie Wu, Shuo Zhang, He Zhu, Lianzi Zhao, Ning Wang, Xia Zhao, Xianjun Li, Yicong Wang, Chengcheng Chen, Nan Wang, Yuchao Xu, Guangjie Yang, Pei Nie","doi":"10.1186/s13244-025-01972-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To develop and validate an interpretable CT-based vision transformer (ViT) model for preoperative prediction of the stage, size, grade, and necrosis (SSIGN) and outcome in clear cell renal cell carcinoma (ccRCC) patients.</p><p><strong>Methods: </strong>Eight hundred forty-five ccRCC patients from multiple centers were retrospectively enrolled. For each patient, 768 ViT features were extracted in the cortical medullary phase (CMP) and renal parenchymal phase (RPP) images, respectively. The CMP ViT model (CVM), RPP ViT model (RVM), and CMP-RPP combined ViT model (CRVM) were constructed to predict the SSIGN in ccRCC patients. The area under the receiver operating characteristic curve (AUC) was used to evaluate the performance of each model. Decision curve analysis (DCA) was used to evaluate the net clinical benefit. The endpoint was the progression-free survival (PFS). Kaplan-Meier survival analysis was used to assess the association between model-predicted SSIGN and PFS. The SHAP approach was applied to determine the prediction process of the CRVM.</p><p><strong>Results: </strong>The CVM, RVM, and CRVM demonstrated good performance in predicting SSIGN, with a high AUC of 0.859, 0.883, and 0.895, respectively, in the test cohort. DCA demonstrated the CRVM performed best in clinical net benefit. In predicting PFS, CRVM achieved a higher Harrell's concordance index (C-index, 0.840) than the CVM (0.719) and RVM (0.773) in the test cohort. The SHAP helped us understand the impact of ViT features on CRVM's SSIGN prediction from a global and individual perspective.</p><p><strong>Conclusion: </strong>The interpretable CT-based CRVM may serve as a non-invasive biomarker in predicting the SSIGN and outcome of ccRCC.</p><p><strong>Critical relevance statement: </strong>Our findings outline the potential of an interpretable CT-based ViT biomarker for predicting the SSIGN score and outcome of ccRCC, which might facilitate patient counseling and assist clinicians in therapy decision-making for individual cases.</p><p><strong>Key points: </strong>Current first-line imaging lacks preoperative prediction of the SSIGN score for ccRCC patients. The ViT model could predict the SSIGN score and outcome of ccRCC patients. This study can facilitate the development of personalized treatment for ccRCC patients.</p>","PeriodicalId":13639,"journal":{"name":"Insights into Imaging","volume":"16 1","pages":"98"},"PeriodicalIF":4.1000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12064486/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Insights into Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13244-025-01972-0","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: To develop and validate an interpretable CT-based vision transformer (ViT) model for preoperative prediction of the stage, size, grade, and necrosis (SSIGN) and outcome in clear cell renal cell carcinoma (ccRCC) patients.
Methods: Eight hundred forty-five ccRCC patients from multiple centers were retrospectively enrolled. For each patient, 768 ViT features were extracted in the cortical medullary phase (CMP) and renal parenchymal phase (RPP) images, respectively. The CMP ViT model (CVM), RPP ViT model (RVM), and CMP-RPP combined ViT model (CRVM) were constructed to predict the SSIGN in ccRCC patients. The area under the receiver operating characteristic curve (AUC) was used to evaluate the performance of each model. Decision curve analysis (DCA) was used to evaluate the net clinical benefit. The endpoint was the progression-free survival (PFS). Kaplan-Meier survival analysis was used to assess the association between model-predicted SSIGN and PFS. The SHAP approach was applied to determine the prediction process of the CRVM.
Results: The CVM, RVM, and CRVM demonstrated good performance in predicting SSIGN, with a high AUC of 0.859, 0.883, and 0.895, respectively, in the test cohort. DCA demonstrated the CRVM performed best in clinical net benefit. In predicting PFS, CRVM achieved a higher Harrell's concordance index (C-index, 0.840) than the CVM (0.719) and RVM (0.773) in the test cohort. The SHAP helped us understand the impact of ViT features on CRVM's SSIGN prediction from a global and individual perspective.
Conclusion: The interpretable CT-based CRVM may serve as a non-invasive biomarker in predicting the SSIGN and outcome of ccRCC.
Critical relevance statement: Our findings outline the potential of an interpretable CT-based ViT biomarker for predicting the SSIGN score and outcome of ccRCC, which might facilitate patient counseling and assist clinicians in therapy decision-making for individual cases.
Key points: Current first-line imaging lacks preoperative prediction of the SSIGN score for ccRCC patients. The ViT model could predict the SSIGN score and outcome of ccRCC patients. This study can facilitate the development of personalized treatment for ccRCC patients.
期刊介绍:
Insights into Imaging (I³) is a peer-reviewed open access journal published under the brand SpringerOpen. All content published in the journal is freely available online to anyone, anywhere!
I³ continuously updates scientific knowledge and progress in best-practice standards in radiology through the publication of original articles and state-of-the-art reviews and opinions, along with recommendations and statements from the leading radiological societies in Europe.
Founded by the European Society of Radiology (ESR), I³ creates a platform for educational material, guidelines and recommendations, and a forum for topics of controversy.
A balanced combination of review articles, original papers, short communications from European radiological congresses and information on society matters makes I³ an indispensable source for current information in this field.
I³ is owned by the ESR, however authors retain copyright to their article according to the Creative Commons Attribution License (see Copyright and License Agreement). All articles can be read, redistributed and reused for free, as long as the author of the original work is cited properly.
The open access fees (article-processing charges) for this journal are kindly sponsored by ESR for all Members.
The journal went open access in 2012, which means that all articles published since then are freely available online.