{"title":"An explainable AI for breast cancer classification using vision Transformer (ViT)","authors":"Marwa Naas , Hiba Mzoughi , Ines Njeh , Mohamed BenSlima","doi":"10.1016/j.bspc.2025.108011","DOIUrl":null,"url":null,"abstract":"<div><div>Manual classification of breast cancer (BC) through an optical microscope is regarded as an essential task throughout clinical routines, necessitating highly skilled pathologists. Computer-aided diagnosis (CAD) techniques based on deep learning (DL) are developed to assist the pathologists in making diagnostic decisions. Nevertheless, the black-box nature and the absence of interpretability and transparency of these DL-based models render their application highly difficult in sensitive and critical medical applications. In addition to providing explanations for the model predictions, explainable artificial intelligence (XAI) strategies help to gain the trust of clinicians. The current Convolutional Neural Network (CNN) architectures have limitations in capturing the global feature information details present in BC histopathological images. To overcome the challenge of long-range dependenciesin CNN-based models, Vision Transformer (ViT) architectures have recently been created.</div><div>These architectures have a self-attention mechanism that enables the analysis of images. As a result, the network is able to record the deep long-range dependence between pixels. The present work aims to develop an effective CAD tool for BC classification. In this study, we investigated a deep ViT architecture trained to perform binary lesions classification (malignant versus benign) using histopathology images. Various XAI techniques have been implemented: Gradient-Weighted Class Activation Mapping (Grad-CAM), Vanilla gradient, Integrated gradients, Saliency Maps, Local Interpretable Model Agnostic Explanation (LIME), and Attention Maps to highlight the most important features of the model prediction outcomes. The evaluation task was performed using the publicly accessible benchmark dataset BreakHis. Based on the research outcomes, our suggested ViT architecture demonstrates competitive performance, surpassing state-of-the-art CNN models in the analysis of histopathological images. Furthermore, the proposed models provide precise and accurate interpretations, reinforcing their reliability. Therefore, we can affirm that the proposed CAD system can be effectively integrated into clinical diagnostic routines, offering enhanced support for medical professionals.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"108 ","pages":"Article 108011"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425005221","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Manual classification of breast cancer (BC) through an optical microscope is regarded as an essential task throughout clinical routines, necessitating highly skilled pathologists. Computer-aided diagnosis (CAD) techniques based on deep learning (DL) are developed to assist the pathologists in making diagnostic decisions. Nevertheless, the black-box nature and the absence of interpretability and transparency of these DL-based models render their application highly difficult in sensitive and critical medical applications. In addition to providing explanations for the model predictions, explainable artificial intelligence (XAI) strategies help to gain the trust of clinicians. The current Convolutional Neural Network (CNN) architectures have limitations in capturing the global feature information details present in BC histopathological images. To overcome the challenge of long-range dependenciesin CNN-based models, Vision Transformer (ViT) architectures have recently been created.
These architectures have a self-attention mechanism that enables the analysis of images. As a result, the network is able to record the deep long-range dependence between pixels. The present work aims to develop an effective CAD tool for BC classification. In this study, we investigated a deep ViT architecture trained to perform binary lesions classification (malignant versus benign) using histopathology images. Various XAI techniques have been implemented: Gradient-Weighted Class Activation Mapping (Grad-CAM), Vanilla gradient, Integrated gradients, Saliency Maps, Local Interpretable Model Agnostic Explanation (LIME), and Attention Maps to highlight the most important features of the model prediction outcomes. The evaluation task was performed using the publicly accessible benchmark dataset BreakHis. Based on the research outcomes, our suggested ViT architecture demonstrates competitive performance, surpassing state-of-the-art CNN models in the analysis of histopathological images. Furthermore, the proposed models provide precise and accurate interpretations, reinforcing their reliability. Therefore, we can affirm that the proposed CAD system can be effectively integrated into clinical diagnostic routines, offering enhanced support for medical professionals.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.