Post Hoc Interpretability in Swin UNETR-Based Volumetric Segmentation Using Supervoxel Attributions

IF 2.5 4区 计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Ankit Srivastava, Sandipan Bhowmick, Munesh Chandra, Ashim Saha
{"title":"Post Hoc Interpretability in Swin UNETR-Based Volumetric Segmentation Using Supervoxel Attributions","authors":"Ankit Srivastava,&nbsp;Sandipan Bhowmick,&nbsp;Munesh Chandra,&nbsp;Ashim Saha","doi":"10.1002/ima.70323","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In 3D medical imaging, achieving accurate segmentation using a deep learning model is a vital task, but it is also important to understand how the models produce these results. In the deep learning model, they mostly get high performance, but their inner workings are difficult to understand. The healthcare sector is basically focused on accuracy, and something is missed, such as interpretability and model bias. Mostly, explanation models are designed for 2D data; when they are used in 3D data, they face hurdles in handling its complexity. This paper uses the voxel-level attribution frameworks to focus on which parts of a 3D image are most influential in the model's prediction, and this can be done by using a global binary mask to highlight the most relevant regions and filter out the less important ones. The proposed frameworks use the model-agnostic tool KernelSHAP, which makes the grouping in super-voxel, which reduces the computational load without compromising explanation quality. This combined approach makes it easier to understand how the model works in complex medical scenarios. It provides clear and localized insight into the model decision. This framework supports more transparent and clinically trustworthy applications of deep learning in 3D medical image analysis.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"36 2","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2026-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70323","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

In 3D medical imaging, achieving accurate segmentation using a deep learning model is a vital task, but it is also important to understand how the models produce these results. In the deep learning model, they mostly get high performance, but their inner workings are difficult to understand. The healthcare sector is basically focused on accuracy, and something is missed, such as interpretability and model bias. Mostly, explanation models are designed for 2D data; when they are used in 3D data, they face hurdles in handling its complexity. This paper uses the voxel-level attribution frameworks to focus on which parts of a 3D image are most influential in the model's prediction, and this can be done by using a global binary mask to highlight the most relevant regions and filter out the less important ones. The proposed frameworks use the model-agnostic tool KernelSHAP, which makes the grouping in super-voxel, which reduces the computational load without compromising explanation quality. This combined approach makes it easier to understand how the model works in complex medical scenarios. It provides clear and localized insight into the model decision. This framework supports more transparent and clinically trustworthy applications of deep learning in 3D medical image analysis.

Abstract Image

Abstract Image

基于超体素属性的Swin unetrs体积分割的事后可解释性
在3D医学成像中,使用深度学习模型实现准确分割是一项至关重要的任务,但了解模型如何产生这些结果也很重要。在深度学习模型中,它们大多获得了很高的性能,但它们的内部工作原理很难理解。医疗保健行业基本上关注的是准确性,而忽略了一些东西,比如可解释性和模型偏差。大多数解释模型是针对二维数据设计的;当它们用于3D数据时,它们在处理其复杂性方面面临障碍。本文使用体素级归因框架来关注3D图像的哪些部分对模型的预测影响最大,这可以通过使用全局二值掩码来突出最相关的区域并过滤掉不太重要的区域来实现。所提出的框架使用模型不可知工具KernelSHAP,该工具以超体素进行分组,在不影响解释质量的情况下减少了计算负荷。这种结合的方法可以更容易地理解模型在复杂的医疗场景中的工作原理。它提供了对模型决策的清晰和本地化的洞察。该框架支持在3D医学图像分析中更透明和临床可信的深度学习应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Imaging Systems and Technology
International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术
CiteScore
6.90
自引率
6.10%
发文量
138
审稿时长
3 months
期刊介绍: The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书