Deep learning for fine-grained molecular-based colorectal cancer classification.

IF 1.5 4区 医学 Q4 ONCOLOGY
Translational cancer research Pub Date : 2025-05-30 Epub Date: 2025-05-08 DOI:10.21037/tcr-2024-2348
Junyu Bian, Yansong Li, Yamei Dang, Yonglin Chen
{"title":"Deep learning for fine-grained molecular-based colorectal cancer classification.","authors":"Junyu Bian, Yansong Li, Yamei Dang, Yonglin Chen","doi":"10.21037/tcr-2024-2348","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Colorectal cancer (CRC) is one of the most common malignancies globally and a major cause of cancer-related deaths. In the molecular diagnosis of CRC, microsatellite instability (MSI) status and mutations in genes such as <i>BRAF</i>, <i>KRAS</i>, and <i>NRAS</i> are important molecular markers. Traditional molecular detection methods are costly and time-consuming. Therefore, this study proposes a fine-grained classification method for CRC based on hematoxylin and eosin (H&E) stained tissue section images combined with deep learning (DL) technology, aiming to provide new insights into the molecular diagnosis of CRC.</p><p><strong>Methods: </strong>In this study, we first collected H&E-stained tissue section images of 383 CRC patients from The First Hospital of Lanzhou University (LZUFH) and constructed the LZUFH_CRC dataset. Then, we proposed a hybrid DL model combining Convolutional Neural Network (CNN) and Vision Transformer (ViT) for fine-grained classification tasks in CRC. The model consists of three parts: a feature extractor, an aggregator, and a classification head. A two-stage training strategy was adopted for model training. Finally, we evaluated the performance of the model on the LZUFH_CRC dataset and compared it with other methods.</p><p><strong>Results: </strong>The results showed that the proposed model achieved an overall accuracy (ACC) of 0.524 and area under the receiver operating characteristic curve (AUC) of 0.791 on the LZUFH_CRC dataset. Among them, the grouping names MSI and NRAS had better classification performance, with F1-scores of 0.724 and 0.514, respectively. Additionally, the study visualized the feature activation maps to show the regions of interest of the model for different input images, finding that the model paid more attention to the transitional areas between tumor and non-tumor regions and the mesenchymal areas of the tumor. Meanwhile, comparisons among different clinical characteristic groups showed that the model did not exhibit significant biases in terms of gender, age and tumor location.</p><p><strong>Conclusions: </strong>This study proposed a fine-grained classification method for CRC based on DL technology, which combines H&E-stained tissue section images with DL technologies such as CNN and ViT, providing new insights into the molecular diagnosis of CRC. Although the performance of the model needs further improvement, the results indicate that DL technology has potential in the molecular detection of CRC. In the future, the research team will continue to optimize the model to improve the ACC and efficiency of fine-grained classification in CRC.</p>","PeriodicalId":23216,"journal":{"name":"Translational cancer research","volume":"14 5","pages":"3035-3046"},"PeriodicalIF":1.5000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12169992/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tcr-2024-2348","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/8 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Colorectal cancer (CRC) is one of the most common malignancies globally and a major cause of cancer-related deaths. In the molecular diagnosis of CRC, microsatellite instability (MSI) status and mutations in genes such as BRAF, KRAS, and NRAS are important molecular markers. Traditional molecular detection methods are costly and time-consuming. Therefore, this study proposes a fine-grained classification method for CRC based on hematoxylin and eosin (H&E) stained tissue section images combined with deep learning (DL) technology, aiming to provide new insights into the molecular diagnosis of CRC.

Methods: In this study, we first collected H&E-stained tissue section images of 383 CRC patients from The First Hospital of Lanzhou University (LZUFH) and constructed the LZUFH_CRC dataset. Then, we proposed a hybrid DL model combining Convolutional Neural Network (CNN) and Vision Transformer (ViT) for fine-grained classification tasks in CRC. The model consists of three parts: a feature extractor, an aggregator, and a classification head. A two-stage training strategy was adopted for model training. Finally, we evaluated the performance of the model on the LZUFH_CRC dataset and compared it with other methods.

Results: The results showed that the proposed model achieved an overall accuracy (ACC) of 0.524 and area under the receiver operating characteristic curve (AUC) of 0.791 on the LZUFH_CRC dataset. Among them, the grouping names MSI and NRAS had better classification performance, with F1-scores of 0.724 and 0.514, respectively. Additionally, the study visualized the feature activation maps to show the regions of interest of the model for different input images, finding that the model paid more attention to the transitional areas between tumor and non-tumor regions and the mesenchymal areas of the tumor. Meanwhile, comparisons among different clinical characteristic groups showed that the model did not exhibit significant biases in terms of gender, age and tumor location.

Conclusions: This study proposed a fine-grained classification method for CRC based on DL technology, which combines H&E-stained tissue section images with DL technologies such as CNN and ViT, providing new insights into the molecular diagnosis of CRC. Although the performance of the model needs further improvement, the results indicate that DL technology has potential in the molecular detection of CRC. In the future, the research team will continue to optimize the model to improve the ACC and efficiency of fine-grained classification in CRC.

基于细粒度分子的结直肠癌分类的深度学习。
背景:结直肠癌(CRC)是全球最常见的恶性肿瘤之一,也是癌症相关死亡的主要原因之一。在结直肠癌的分子诊断中,BRAF、KRAS、NRAS等基因的微卫星不稳定性(microsatellite instability, MSI)状态和突变是重要的分子标志物。传统的分子检测方法成本高,耗时长。因此,本研究提出了一种基于苏木精和伊红(H&E)染色组织切片图像结合深度学习(DL)技术的CRC细粒度分类方法,旨在为CRC的分子诊断提供新的见解。方法:本研究首先收集兰州大学第一医院383例结直肠癌患者的h&e染色组织切片图像,构建LZUFH_CRC数据集。然后,我们提出了一种结合卷积神经网络(CNN)和视觉变压器(ViT)的混合深度学习模型,用于CRC的细粒度分类任务。该模型由三部分组成:特征提取器、聚合器和分类头。模型训练采用两阶段训练策略。最后,我们在LZUFH_CRC数据集上评估了该模型的性能,并与其他方法进行了比较。结果:该模型在LZUFH_CRC数据集上的总体精度(ACC)为0.524,接收者工作特征曲线下面积(AUC)为0.791。其中,分组名称MSI和NRAS的分类性能较好,f1得分分别为0.724和0.514。此外,研究将特征激活图可视化,显示模型对不同输入图像感兴趣的区域,发现模型更加关注肿瘤与非肿瘤区域之间的过渡区域以及肿瘤的间质区域。同时,不同临床特征组间的比较表明,该模型在性别、年龄和肿瘤部位方面均未出现显著偏倚。结论:本研究提出了一种基于DL技术的CRC细粒度分类方法,将h&e染色组织切片图像与CNN、ViT等DL技术相结合,为CRC的分子诊断提供了新的思路。虽然该模型的性能有待进一步改进,但结果表明DL技术在CRC的分子检测中具有潜力。未来,研究团队将继续对模型进行优化,以提高CRC中细粒度分类的ACC和效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.10
自引率
0.00%
发文量
252
期刊介绍: Translational Cancer Research (Transl Cancer Res TCR; Print ISSN: 2218-676X; Online ISSN 2219-6803; http://tcr.amegroups.com/) is an Open Access, peer-reviewed journal, indexed in Science Citation Index Expanded (SCIE). TCR publishes laboratory studies of novel therapeutic interventions as well as clinical trials which evaluate new treatment paradigms for cancer; results of novel research investigations which bridge the laboratory and clinical settings including risk assessment, cellular and molecular characterization, prevention, detection, diagnosis and treatment of human cancers with the overall goal of improving the clinical care of cancer patients. The focus of TCR is original, peer-reviewed, science-based research that successfully advances clinical medicine toward the goal of improving patients'' quality of life. The editors and an international advisory group of scientists and clinician-scientists as well as other experts will hold TCR articles to the high-quality standards. We accept Original Articles as well as Review Articles, Editorials and Brief Articles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信