Post Hoc Interpretability in Swin UNETR-Based Volumetric Segmentation Using Supervoxel Attributions

IF 2.5 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2026-02-25 DOI:10.1002/ima.70323

Ankit Srivastava, Sandipan Bhowmick, Munesh Chandra, Ashim Saha

{"title":"Post Hoc Interpretability in Swin UNETR-Based Volumetric Segmentation Using Supervoxel Attributions","authors":"Ankit Srivastava, Sandipan Bhowmick, Munesh Chandra, Ashim Saha","doi":"10.1002/ima.70323","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In 3D medical imaging, achieving accurate segmentation using a deep learning model is a vital task, but it is also important to understand how the models produce these results. In the deep learning model, they mostly get high performance, but their inner workings are difficult to understand. The healthcare sector is basically focused on accuracy, and something is missed, such as interpretability and model bias. Mostly, explanation models are designed for 2D data; when they are used in 3D data, they face hurdles in handling its complexity. This paper uses the voxel-level attribution frameworks to focus on which parts of a 3D image are most influential in the model's prediction, and this can be done by using a global binary mask to highlight the most relevant regions and filter out the less important ones. The proposed frameworks use the model-agnostic tool KernelSHAP, which makes the grouping in super-voxel, which reduces the computational load without compromising explanation quality. This combined approach makes it easier to understand how the model works in complex medical scenarios. It provides clear and localized insight into the model decision. This framework supports more transparent and clinically trustworthy applications of deep learning in 3D medical image analysis.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"36 2","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2026-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70323","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

In 3D medical imaging, achieving accurate segmentation using a deep learning model is a vital task, but it is also important to understand how the models produce these results. In the deep learning model, they mostly get high performance, but their inner workings are difficult to understand. The healthcare sector is basically focused on accuracy, and something is missed, such as interpretability and model bias. Mostly, explanation models are designed for 2D data; when they are used in 3D data, they face hurdles in handling its complexity. This paper uses the voxel-level attribution frameworks to focus on which parts of a 3D image are most influential in the model's prediction, and this can be done by using a global binary mask to highlight the most relevant regions and filter out the less important ones. The proposed frameworks use the model-agnostic tool KernelSHAP, which makes the grouping in super-voxel, which reduces the computational load without compromising explanation quality. This combined approach makes it easier to understand how the model works in complex medical scenarios. It provides clear and localized insight into the model decision. This framework supports more transparent and clinically trustworthy applications of deep learning in 3D medical image analysis.

Abstract Image

查看原文本刊更多论文

基于超体素属性的Swin unetrs体积分割的事后可解释性

在3D医学成像中，使用深度学习模型实现准确分割是一项至关重要的任务，但了解模型如何产生这些结果也很重要。在深度学习模型中，它们大多获得了很高的性能，但它们的内部工作原理很难理解。医疗保健行业基本上关注的是准确性，而忽略了一些东西，比如可解释性和模型偏差。大多数解释模型是针对二维数据设计的；当它们用于3D数据时，它们在处理其复杂性方面面临障碍。本文使用体素级归因框架来关注3D图像的哪些部分对模型的预测影响最大，这可以通过使用全局二值掩码来突出最相关的区域并过滤掉不太重要的区域来实现。所提出的框架使用模型不可知工具KernelSHAP，该工具以超体素进行分组，在不影响解释质量的情况下减少了计算负荷。这种结合的方法可以更容易地理解模型在复杂的医疗场景中的工作原理。它提供了对模型决策的清晰和本地化的洞察。该框架支持在3D医学图像分析中更透明和临床可信的深度学习应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.