{"title":"Exploring prognostic biomarkers in pathological images of colorectal cancer patients via deep learning","authors":"Binshen Wei, Linqing Li, Yenan Feng, Sihan Liu, Peng Fu, Lin Tian","doi":"10.1002/2056-4538.70003","DOIUrl":null,"url":null,"abstract":"<p>Hematoxylin and eosin (H&E) whole slide images provide valuable information for predicting prognostic outcomes in colorectal cancer (CRC) patients. However, extracting prognostic indicators from pathological images is challenging due to the subtle complexities of phenotypic information. We trained a weakly supervised deep learning model on data from 640 CRC patients in the prostate, lung, colorectal, and ovarian (PLCO) cancer screening trial dataset and validated it using data from 522 CRC patients in the cancer genome atlas (TCGA) dataset. We created the colorectal cancer risk score (CRCRS) to assess patient prognosis, visualized the pathological phenotype of the risk score using Grad-CAM, and employed multiomics data from the TCGA CRC cohort to investigate the potential biological mechanisms underlying the risk score. The overall survival analysis revealed that the CRCRS served as an independent prognostic indicator for both the PLCO cohort (<i>p</i> < 0.001) and the TCGA cohort (<i>p</i> < 0.001), with its predictive efficacy remaining unaffected by the clinical staging system. Additionally, satisfactory chemotherapeutic benefits were observed in stage II/III CRC patients with high CRCRS but not in those with low CRCRS. A pathomics nomogram constructed by integrating the CRCRS with the tumor-node-metastasis (TNM) staging system enhanced prognostic prediction accuracy compared with using the TNM staging system alone. Noteworthy features of the risk score were identified, such as immature tumor mesenchyme, disorganized gland structures, small clusters of cancer cells associated with unfavorable prognosis, and infiltrating inflammatory cells associated with favorable prognosis. The TCGA multiomics data revealed potential correlations between the CRCRS and the activation of energy production and metabolic pathways, the tumor immune microenvironment, and genetic mutations in <i>APC</i>, <i>SMAD2</i>, <i>EEF1AKMT4</i>, <i>EPG5</i>, and <i>TANC1</i>. In summary, our deep learning algorithm identified the CRCRS as a prognostic indicator in CRC, providing a significant approach for prognostic risk stratification and tailoring precise treatment strategies for individual patients.</p>","PeriodicalId":48612,"journal":{"name":"Journal of Pathology Clinical Research","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/2056-4538.70003","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pathology Clinical Research","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/2056-4538.70003","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Hematoxylin and eosin (H&E) whole slide images provide valuable information for predicting prognostic outcomes in colorectal cancer (CRC) patients. However, extracting prognostic indicators from pathological images is challenging due to the subtle complexities of phenotypic information. We trained a weakly supervised deep learning model on data from 640 CRC patients in the prostate, lung, colorectal, and ovarian (PLCO) cancer screening trial dataset and validated it using data from 522 CRC patients in the cancer genome atlas (TCGA) dataset. We created the colorectal cancer risk score (CRCRS) to assess patient prognosis, visualized the pathological phenotype of the risk score using Grad-CAM, and employed multiomics data from the TCGA CRC cohort to investigate the potential biological mechanisms underlying the risk score. The overall survival analysis revealed that the CRCRS served as an independent prognostic indicator for both the PLCO cohort (p < 0.001) and the TCGA cohort (p < 0.001), with its predictive efficacy remaining unaffected by the clinical staging system. Additionally, satisfactory chemotherapeutic benefits were observed in stage II/III CRC patients with high CRCRS but not in those with low CRCRS. A pathomics nomogram constructed by integrating the CRCRS with the tumor-node-metastasis (TNM) staging system enhanced prognostic prediction accuracy compared with using the TNM staging system alone. Noteworthy features of the risk score were identified, such as immature tumor mesenchyme, disorganized gland structures, small clusters of cancer cells associated with unfavorable prognosis, and infiltrating inflammatory cells associated with favorable prognosis. The TCGA multiomics data revealed potential correlations between the CRCRS and the activation of energy production and metabolic pathways, the tumor immune microenvironment, and genetic mutations in APC, SMAD2, EEF1AKMT4, EPG5, and TANC1. In summary, our deep learning algorithm identified the CRCRS as a prognostic indicator in CRC, providing a significant approach for prognostic risk stratification and tailoring precise treatment strategies for individual patients.
期刊介绍:
The Journal of Pathology: Clinical Research and The Journal of Pathology serve as translational bridges between basic biomedical science and clinical medicine with particular emphasis on, but not restricted to, tissue based studies.
The focus of The Journal of Pathology: Clinical Research is the publication of studies that illuminate the clinical relevance of research in the broad area of the study of disease. Appropriately powered and validated studies with novel diagnostic, prognostic and predictive significance, and biomarker discover and validation, will be welcomed. Studies with a predominantly mechanistic basis will be more appropriate for the companion Journal of Pathology.