SNet: A novel convolutional neural network architecture for advanced endoscopic image classification of gastrointestinal disorders

IF 3.7 4区医学 Q3 BIOCHEMICAL RESEARCH METHODS

SLAS Technology Pub Date : 2025-05-30 DOI:10.1016/j.slast.2025.100304

Samra Siddiqui , Junaid A. Khan , Tallha Akram , Meshal Alharbi , Jaehyuk Cha , Dina A. AlHammadi

{"title":"SNet: A novel convolutional neural network architecture for advanced endoscopic image classification of gastrointestinal disorders","authors":"Samra Siddiqui , Junaid A. Khan , Tallha Akram , Meshal Alharbi , Jaehyuk Cha , Dina A. AlHammadi","doi":"10.1016/j.slast.2025.100304","DOIUrl":null,"url":null,"abstract":"<div><div>With the intent of assisting gastroenterologists from all over the world, the proposed work aims to eliminate the effort required to achieve accurate diagnoses. Statistically, gastrointestinal diseases often result in fatal disorders, contributing to a significant number of fatalities. The upper gastrointestinal tract (GIT) includes the stomach, esophagus, and duodenum, while the lower one comprises a section of the small intestine, namely the ileum, as well as the large intestine, including the colon. The challenges associated with GIT tract issues are apparently complex. Therefore, multiple challenges exist regarding CAD (Computer-aided diagnosis) and endoscopy, including a lack of annotated images, a dark background, poor contrast, and an irregular pattern. The objective of this research is to develop a robust deep network, called SNet, that offers a solution to complex classification problems. Firstly, the endoscopic images undergo preprocessing before being subjected to feature extraction. This step involves image resizing along with the augmentation step. The proposed convolutional neural network (CNN) model is comprised of six blocks placed at different layers. To enable the exhaustive evaluation of proposed framework across different datasets, the model has undergone training on a very complex HyperKvasir dataset, and later tested on Kvasir v1 and v2 datasets. This facilitates cross-dataset system evaluation, resulting in an efficient system for an unseen image diagnosis. To avoid the problem of “<em>curse of dimensionality</em>”, the most discriminant feature information is selected based on proposed minimum redundancy maximum relevance (MRMR) algorithm. The proposed architecture has been evaluated using a range of performance metrics, such as accuracy, sensitivity, specificity, and Area under curve (AUC). With classification accuracy as the main metric, the achieved accuracy is 98.45% on the Kvasir v1, and 97.83% on the Kvasir v2 datasets.</div></div>","PeriodicalId":54248,"journal":{"name":"SLAS Technology","volume":"33 ","pages":"Article 100304"},"PeriodicalIF":3.7000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SLAS Technology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2472630325000627","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

With the intent of assisting gastroenterologists from all over the world, the proposed work aims to eliminate the effort required to achieve accurate diagnoses. Statistically, gastrointestinal diseases often result in fatal disorders, contributing to a significant number of fatalities. The upper gastrointestinal tract (GIT) includes the stomach, esophagus, and duodenum, while the lower one comprises a section of the small intestine, namely the ileum, as well as the large intestine, including the colon. The challenges associated with GIT tract issues are apparently complex. Therefore, multiple challenges exist regarding CAD (Computer-aided diagnosis) and endoscopy, including a lack of annotated images, a dark background, poor contrast, and an irregular pattern. The objective of this research is to develop a robust deep network, called SNet, that offers a solution to complex classification problems. Firstly, the endoscopic images undergo preprocessing before being subjected to feature extraction. This step involves image resizing along with the augmentation step. The proposed convolutional neural network (CNN) model is comprised of six blocks placed at different layers. To enable the exhaustive evaluation of proposed framework across different datasets, the model has undergone training on a very complex HyperKvasir dataset, and later tested on Kvasir v1 and v2 datasets. This facilitates cross-dataset system evaluation, resulting in an efficient system for an unseen image diagnosis. To avoid the problem of “curse of dimensionality”, the most discriminant feature information is selected based on proposed minimum redundancy maximum relevance (MRMR) algorithm. The proposed architecture has been evaluated using a range of performance metrics, such as accuracy, sensitivity, specificity, and Area under curve (AUC). With classification accuracy as the main metric, the achieved accuracy is 98.45% on the Kvasir v1, and 97.83% on the Kvasir v2 datasets.

查看原文本刊更多论文

SNet：一种新的卷积神经网络架构，用于胃肠疾病的高级内镜图像分类。

为了帮助来自世界各地的胃肠病学家，建议的工作旨在消除实现准确诊断所需的努力。据统计，胃肠道疾病往往导致致命的疾病，造成大量死亡。上消化道包括胃、食道和十二指肠，下消化道包括小肠的一段，即回肠，以及大肠，包括结肠。与GIT通道问题相关的挑战显然是复杂的。因此，CAD（计算机辅助诊断）和内窥镜检查存在多重挑战，包括缺乏注释图像、暗背景、对比度差和不规则图案。这项研究的目的是开发一个强大的深度网络，称为SNet，为复杂的分类问题提供解决方案。首先对内窥镜图像进行预处理，然后进行特征提取。这一步包括图像大小调整和增强步骤。所提出的卷积神经网络（CNN）模型由放置在不同层的六个块组成。为了能够跨不同数据集对所提出的框架进行详尽的评估，该模型在非常复杂的HyperKvasir数据集上进行了训练，随后在Kvasir v1和v2数据集上进行了测试。这有助于跨数据集系统评估，从而形成一个有效的系统，用于未见过的图像诊断。为了避免“维数诅咒”的问题，基于所提出的最小冗余最大相关（MRMR）算法选择最具判别性的特征信息。所提出的体系结构已经使用一系列性能指标进行了评估，例如准确性、灵敏度、特异性和曲线下面积（AUC）。以分类准确率为主要指标，在Kvasir v1数据集上实现的准确率为98.45%，在Kvasir v2数据集上实现的准确率为97.83%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

SLAS Technology Computer Science-Computer Science Applications

CiteScore

6.30

自引率

7.40%

发文量

审稿时长

106 days

期刊介绍： SLAS Technology emphasizes scientific and technical advances that enable and improve life sciences research and development; drug-delivery; diagnostics; biomedical and molecular imaging; and personalized and precision medicine. This includes high-throughput and other laboratory automation technologies; micro/nanotechnologies; analytical, separation and quantitative techniques; synthetic chemistry and biology; informatics (data analysis, statistics, bio, genomic and chemoinformatics); and more.