DeepPatchNet: A deep learning model for enhanced screening and diagnosis of oral cancer

Q1 Medicine

Informatics in Medicine Unlocked Pub Date : 2025-01-01 DOI:10.1016/j.imu.2025.101658

Idriss Tafala , Fatima-Ezzahraa Ben-Bouazza , Aymane Edder , Oumaima Manchadi , Bassma Jioudi

{"title":"DeepPatchNet: A deep learning model for enhanced screening and diagnosis of oral cancer","authors":"Idriss Tafala , Fatima-Ezzahraa Ben-Bouazza , Aymane Edder , Oumaima Manchadi , Bassma Jioudi","doi":"10.1016/j.imu.2025.101658","DOIUrl":null,"url":null,"abstract":"<div><div>Oral cancer remains a serious global health challenge, significantly affecting patient survival and quality of life. While convolutional neural networks (CNNs) have historically dominated image classification tasks, recent advances suggest that transformer-based models may offer superior performance—albeit with high data and computational demands. In this study, we present <strong>DeepPatchNet</strong>, a novel deep learning architecture that integrates DeepLabV3+ and ConvMixer to address these limitations. Designed for histopathological image classification, DeepPatchNet provides a lightweight, interpretable, and high-performing solution. We evaluated the model on the NDB-UFES dataset (3763 images) and an independent H&E-stained OSCC dataset (1020 images), benchmarking its performance against state-of-the-art models including Vision Transformers (ViTs)<span><span>[1]</span></span>, <span><span>[2]</span></span>, InceptionResNetV2, VGG19, and ConvNeXt. DeepPatchNet achieved superior performance with 86.71% accuracy, 86.80% precision, 86.71% recall, and an F1 score of 86.75%, outperforming all comparison models. Furthermore, the integration of Gradient-weighted Class Activation Mapping (Grad-CAM) enhances interpretability by visually highlighting diagnostically relevant features, addressing a key barrier to clinical adoption. While our results are promising, further validation in real-world clinical settings is needed. DeepPatchNet shows strong potential as a reliable decision support tool for early oral cancer detection and diagnosis.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"57 ","pages":"Article 101658"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000462","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Oral cancer remains a serious global health challenge, significantly affecting patient survival and quality of life. While convolutional neural networks (CNNs) have historically dominated image classification tasks, recent advances suggest that transformer-based models may offer superior performance—albeit with high data and computational demands. In this study, we present DeepPatchNet, a novel deep learning architecture that integrates DeepLabV3+ and ConvMixer to address these limitations. Designed for histopathological image classification, DeepPatchNet provides a lightweight, interpretable, and high-performing solution. We evaluated the model on the NDB-UFES dataset (3763 images) and an independent H&E-stained OSCC dataset (1020 images), benchmarking its performance against state-of-the-art models including Vision Transformers (ViTs)[1], [2], InceptionResNetV2, VGG19, and ConvNeXt. DeepPatchNet achieved superior performance with 86.71% accuracy, 86.80% precision, 86.71% recall, and an F1 score of 86.75%, outperforming all comparison models. Furthermore, the integration of Gradient-weighted Class Activation Mapping (Grad-CAM) enhances interpretability by visually highlighting diagnostically relevant features, addressing a key barrier to clinical adoption. While our results are promising, further validation in real-world clinical settings is needed. DeepPatchNet shows strong potential as a reliable decision support tool for early oral cancer detection and diagnosis.

查看原文本刊更多论文

deep patchnet：用于增强口腔癌筛查和诊断的深度学习模型

口腔癌仍然是一个严重的全球健康挑战，严重影响患者的生存和生活质量。虽然卷积神经网络（cnn）在历史上一直主导着图像分类任务，但最近的进展表明，基于变压器的模型可能会提供更好的性能——尽管数据和计算需求很高。在本研究中，我们提出了一种新的深度学习架构DeepPatchNet，它集成了DeepLabV3+和ConvMixer来解决这些限制。DeepPatchNet专为组织病理学图像分类而设计，提供了轻量级，可解释和高性能的解决方案。我们在ndb - ues数据集（3763张图像）和独立的H&； e染色的OSCC数据集（1020张图像）上评估了该模型，并将其性能与最先进的模型（包括Vision Transformers (ViTs)[1], [2], InceptionResNetV2， VGG19和ConvNeXt）进行了基准测试。DeepPatchNet的准确率为86.71%，精密度为86.80%，召回率为86.71%，F1得分为86.75%，优于所有比较模型。此外，梯度加权类激活映射（Grad-CAM）的集成通过可视化地突出诊断相关特征来增强可解释性，解决了临床采用的关键障碍。虽然我们的结果很有希望，但需要在现实世界的临床环境中进一步验证。作为早期口腔癌检测和诊断的可靠决策支持工具，DeepPatchNet显示出强大的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Informatics in Medicine Unlocked Medicine-Health Informatics

CiteScore

9.50

自引率

0.00%

发文量

282

审稿时长

39 days

期刊介绍： Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.