基于MaskShiftNet和transformer-augmented cnn的双重深度学习框架用于组织病理学图像的口腔癌诊断

Intelligence-based medicine Pub Date : 2025-01-01 DOI:10.1016/j.ibmed.2025.100301

R. Dharani , K. Danesh

{"title":"基于MaskShiftNet和transformer-augmented cnn的双重深度学习框架用于组织病理学图像的口腔癌诊断","authors":"R. Dharani , K. Danesh","doi":"10.1016/j.ibmed.2025.100301","DOIUrl":null,"url":null,"abstract":"<div><div>Oral squamous cell carcinoma (OSCC) is the most prevalent form of oral cancer and poses a significant health threat to the community due to its high death rate. The early detection of OSCC serves as a crucial element for both successful treatment and better patient survival outcomes. A biopsy represents the traditional method for OSCC detection which requires extensive manual processing and expert evaluation. This paper introduces two innovative deep learning architectures, MaskShiftNet and a combined Convolutional neural network with vision Transformer Network (CNN-TransNet), for the efficient segmentation and classification of OSCC from histopathology images. MaskShiftNet amalgamates color, texture, and shape attributes to precisely delineate malignant areas, enhancing localization while minimizing false positives and negatives. CNN-TransNet is a hybrid model that integrates CNN with transformer-based attention mechanisms for efficient gathering of local as well as global contextual data for the robust identification of early-stage OSCC. Comprehensive experimental assessments indicate that the suggested framework outperforms current methodologies, achieving a classification accuracy of 98.94 %, with precision, sensitivity, and specificity at 98.9 %, 98.96 %, and 97.18 %, respectively. Ablation experiments further emphasize the essential functions of segmentation and hybrid feature extraction in improving OSCC classification. These findings validate the capability of CNN-TransNet as a dependable and effective instrument for automated oral cancer detection.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"12 ","pages":"Article 100301"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A dual deep learning framework using MaskShiftNet and transformer-augmented CNNs for oral cancer diagnosis from histopathology images\",\"authors\":\"R. Dharani , K. Danesh\",\"doi\":\"10.1016/j.ibmed.2025.100301\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Oral squamous cell carcinoma (OSCC) is the most prevalent form of oral cancer and poses a significant health threat to the community due to its high death rate. The early detection of OSCC serves as a crucial element for both successful treatment and better patient survival outcomes. A biopsy represents the traditional method for OSCC detection which requires extensive manual processing and expert evaluation. This paper introduces two innovative deep learning architectures, MaskShiftNet and a combined Convolutional neural network with vision Transformer Network (CNN-TransNet), for the efficient segmentation and classification of OSCC from histopathology images. MaskShiftNet amalgamates color, texture, and shape attributes to precisely delineate malignant areas, enhancing localization while minimizing false positives and negatives. CNN-TransNet is a hybrid model that integrates CNN with transformer-based attention mechanisms for efficient gathering of local as well as global contextual data for the robust identification of early-stage OSCC. Comprehensive experimental assessments indicate that the suggested framework outperforms current methodologies, achieving a classification accuracy of 98.94 %, with precision, sensitivity, and specificity at 98.9 %, 98.96 %, and 97.18 %, respectively. Ablation experiments further emphasize the essential functions of segmentation and hybrid feature extraction in improving OSCC classification. These findings validate the capability of CNN-TransNet as a dependable and effective instrument for automated oral cancer detection.</div></div>\",\"PeriodicalId\":73399,\"journal\":{\"name\":\"Intelligence-based medicine\",\"volume\":\"12 \",\"pages\":\"Article 100301\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligence-based medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266652122500105X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266652122500105X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

口腔鳞状细胞癌（OSCC）是最常见的口腔癌，由于其高死亡率，对社会构成了重大的健康威胁。早期发现OSCC是成功治疗和提高患者生存率的关键因素。活检代表了传统的OSCC检测方法，需要大量的人工处理和专家评估。本文介绍了两种创新的深度学习架构，MaskShiftNet和卷积神经网络与视觉转换网络（CNN-TransNet）的组合，用于从组织病理学图像中高效分割和分类OSCC。MaskShiftNet混合了颜色、纹理和形状属性来精确地描绘恶性区域，增强了定位，同时最大限度地减少了误报和误报。CNN- transnet是一种混合模型，它将CNN与基于变压器的注意力机制集成在一起，用于有效收集本地和全局上下文数据，以实现早期OSCC的鲁棒识别。综合实验评估表明，该框架优于现有方法，分类准确率为98.94%，精密度、灵敏度和特异性分别为98.9%、98.96%和97.18%。消融实验进一步强调了分割和混合特征提取在改进OSCC分类中的重要作用。这些发现验证了CNN-TransNet作为一种可靠有效的口腔癌自动检测工具的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A dual deep learning framework using MaskShiftNet and transformer-augmented CNNs for oral cancer diagnosis from histopathology images

Oral squamous cell carcinoma (OSCC) is the most prevalent form of oral cancer and poses a significant health threat to the community due to its high death rate. The early detection of OSCC serves as a crucial element for both successful treatment and better patient survival outcomes. A biopsy represents the traditional method for OSCC detection which requires extensive manual processing and expert evaluation. This paper introduces two innovative deep learning architectures, MaskShiftNet and a combined Convolutional neural network with vision Transformer Network (CNN-TransNet), for the efficient segmentation and classification of OSCC from histopathology images. MaskShiftNet amalgamates color, texture, and shape attributes to precisely delineate malignant areas, enhancing localization while minimizing false positives and negatives. CNN-TransNet is a hybrid model that integrates CNN with transformer-based attention mechanisms for efficient gathering of local as well as global contextual data for the robust identification of early-stage OSCC. Comprehensive experimental assessments indicate that the suggested framework outperforms current methodologies, achieving a classification accuracy of 98.94 %, with precision, sensitivity, and specificity at 98.9 %, 98.96 %, and 97.18 %, respectively. Ablation experiments further emphasize the essential functions of segmentation and hybrid feature extraction in improving OSCC classification. These findings validate the capability of CNN-TransNet as a dependable and effective instrument for automated oral cancer detection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Intelligence-based medicine Health Informatics

CiteScore

5.00

自引率

0.00%

发文量

审稿时长

187 days