Hybrid deep learning model for automated colorectal cancer detection using local and global feature extraction

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Ishak Pacal , Omneya Attallah
{"title":"Hybrid deep learning model for automated colorectal cancer detection using local and global feature extraction","authors":"Ishak Pacal ,&nbsp;Omneya Attallah","doi":"10.1016/j.knosys.2025.113625","DOIUrl":null,"url":null,"abstract":"<div><div>Colorectal cancer (CRC) ranks among the most lethal malignancies globally, underscoring the importance of timely and precise diagnosis. Although histopathological examination remains the clinical gold standard, the intricate morphology of tissue samples and inter-observer variability drive the need for robust automated methods. To address these challenges, this paper presents a hybrid deep learning model that integrates InceptionNeXt blocks, enhanced Swin Transformer blocks, and a Residual Multi-Layer Perceptron (ResMLP). In the initial stages, InceptionNeXt blocks employ multi-branch convolutions to capture nuclear morphology, glandular structures, and stromal textures, particularly benefiting limited training data scenarios. Subsequent layers utilize enhanced Swin Transformer blocks with window-based self-attention and shifted windows, effectively modeling long-range dependencies. The ResMLP component further refines feature representation via residual learning. Comprehensive evaluations on two benchmark CRC datasets—NCT-CRC<img>HE-100K and Kather-5K—demonstrated accuracies of 99.96 % and 99.06 %, respectively, outperforming 10 state-of-the-art CNN and 10 ViT-based models. Additionally, Grad-CAM visualizations highlight the critical regions influencing classification decisions, enhancing model interpretability. These results establish the proposed method as a reliable, generalizable, and clinically viable solution for automated CRC detection.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"319 ","pages":"Article 113625"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006719","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Colorectal cancer (CRC) ranks among the most lethal malignancies globally, underscoring the importance of timely and precise diagnosis. Although histopathological examination remains the clinical gold standard, the intricate morphology of tissue samples and inter-observer variability drive the need for robust automated methods. To address these challenges, this paper presents a hybrid deep learning model that integrates InceptionNeXt blocks, enhanced Swin Transformer blocks, and a Residual Multi-Layer Perceptron (ResMLP). In the initial stages, InceptionNeXt blocks employ multi-branch convolutions to capture nuclear morphology, glandular structures, and stromal textures, particularly benefiting limited training data scenarios. Subsequent layers utilize enhanced Swin Transformer blocks with window-based self-attention and shifted windows, effectively modeling long-range dependencies. The ResMLP component further refines feature representation via residual learning. Comprehensive evaluations on two benchmark CRC datasets—NCT-CRCHE-100K and Kather-5K—demonstrated accuracies of 99.96 % and 99.06 %, respectively, outperforming 10 state-of-the-art CNN and 10 ViT-based models. Additionally, Grad-CAM visualizations highlight the critical regions influencing classification decisions, enhancing model interpretability. These results establish the proposed method as a reliable, generalizable, and clinically viable solution for automated CRC detection.
基于局部和全局特征提取的结直肠癌自动检测混合深度学习模型
结直肠癌(CRC)是全球最致命的恶性肿瘤之一,这凸显了及时准确诊断的重要性。尽管组织病理学检查仍然是临床金标准,但组织样本的复杂形态和观察者之间的可变性推动了对强大的自动化方法的需求。为了解决这些挑战,本文提出了一种混合深度学习模型,该模型集成了InceptionNeXt块、增强型Swin Transformer块和残差多层感知器(ResMLP)。在初始阶段,InceptionNeXt块采用多分支卷积来捕获核形态、腺体结构和基质纹理,特别有利于有限的训练数据场景。后续层利用增强的Swin Transformer块和基于窗口的自关注和移动窗口,有效地建模远程依赖关系。ResMLP组件通过残差学习进一步细化特征表示。对两个基准CRC数据集(nct - crche - 100k和ther- 5k)的综合评估表明,准确率分别为99.96%和99.06%,优于10个最先进的CNN和10个基于vit的模型。此外,Grad-CAM可视化突出了影响分类决策的关键区域,增强了模型的可解释性。这些结果表明,该方法是一种可靠的、可推广的、临床可行的自动CRC检测方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信