{"title":"Hybrid deep learning model for automated colorectal cancer detection using local and global feature extraction","authors":"Ishak Pacal , Omneya Attallah","doi":"10.1016/j.knosys.2025.113625","DOIUrl":null,"url":null,"abstract":"<div><div>Colorectal cancer (CRC) ranks among the most lethal malignancies globally, underscoring the importance of timely and precise diagnosis. Although histopathological examination remains the clinical gold standard, the intricate morphology of tissue samples and inter-observer variability drive the need for robust automated methods. To address these challenges, this paper presents a hybrid deep learning model that integrates InceptionNeXt blocks, enhanced Swin Transformer blocks, and a Residual Multi-Layer Perceptron (ResMLP). In the initial stages, InceptionNeXt blocks employ multi-branch convolutions to capture nuclear morphology, glandular structures, and stromal textures, particularly benefiting limited training data scenarios. Subsequent layers utilize enhanced Swin Transformer blocks with window-based self-attention and shifted windows, effectively modeling long-range dependencies. The ResMLP component further refines feature representation via residual learning. Comprehensive evaluations on two benchmark CRC datasets—NCT-CRC<img>HE-100K and Kather-5K—demonstrated accuracies of 99.96 % and 99.06 %, respectively, outperforming 10 state-of-the-art CNN and 10 ViT-based models. Additionally, Grad-CAM visualizations highlight the critical regions influencing classification decisions, enhancing model interpretability. These results establish the proposed method as a reliable, generalizable, and clinically viable solution for automated CRC detection.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"319 ","pages":"Article 113625"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006719","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Colorectal cancer (CRC) ranks among the most lethal malignancies globally, underscoring the importance of timely and precise diagnosis. Although histopathological examination remains the clinical gold standard, the intricate morphology of tissue samples and inter-observer variability drive the need for robust automated methods. To address these challenges, this paper presents a hybrid deep learning model that integrates InceptionNeXt blocks, enhanced Swin Transformer blocks, and a Residual Multi-Layer Perceptron (ResMLP). In the initial stages, InceptionNeXt blocks employ multi-branch convolutions to capture nuclear morphology, glandular structures, and stromal textures, particularly benefiting limited training data scenarios. Subsequent layers utilize enhanced Swin Transformer blocks with window-based self-attention and shifted windows, effectively modeling long-range dependencies. The ResMLP component further refines feature representation via residual learning. Comprehensive evaluations on two benchmark CRC datasets—NCT-CRCHE-100K and Kather-5K—demonstrated accuracies of 99.96 % and 99.06 %, respectively, outperforming 10 state-of-the-art CNN and 10 ViT-based models. Additionally, Grad-CAM visualizations highlight the critical regions influencing classification decisions, enhancing model interpretability. These results establish the proposed method as a reliable, generalizable, and clinically viable solution for automated CRC detection.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.