Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation.

McKell Woodland, Nihil Patel, Austin Castelo, Mais Al Taie, Mohamed Eltaher, Joshua P Yung, Tucker J Netherton, Tiffany L Calderone, Jessica I Sanchez, Darrel W Cleere, Ahmed Elsaiey, Nakul Gupta, David Victor, Laura Beretta, Ankit B Patel, Kristy K Brock
{"title":"Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation.","authors":"McKell Woodland, Nihil Patel, Austin Castelo, Mais Al Taie, Mohamed Eltaher, Joshua P Yung, Tucker J Netherton, Tiffany L Calderone, Jessica I Sanchez, Darrel W Cleere, Ahmed Elsaiey, Nakul Gupta, David Victor, Laura Beretta, Ankit B Patel, Kristy K Brock","doi":"10.59275/j.melba.2024-g93a","DOIUrl":null,"url":null,"abstract":"<p><p>Clinically deployed deep learning-based segmentation models are known to fail on data outside of their training distributions. While clinicians review the segmentations, these models tend to perform well in most instances, which could exacerbate automation bias. Therefore, detecting out-of-distribution images at inference is critical to warn the clinicians that the model likely failed. This work applied the Mahalanobis distance (MD) post hoc to the bottleneck features of four Swin UNETR and nnU-net models that segmented the liver on T1-weighted magnetic resonance imaging and computed tomography. By reducing the dimensions of the bottleneck features with either principal component analysis or uniform manifold approximation and projection, images the models failed on were detected with high performance and minimal computational load. In addition, this work explored a non-parametric alternative to the MD, a k-th nearest neighbors distance (KNN). KNN drastically improved scalability and performance over MD when both were applied to raw and average-pooled bottleneck features. Our code is available at https://github.com/mckellwoodland/dimen_reduce_mahal.</p>","PeriodicalId":75083,"journal":{"name":"The journal of machine learning for biomedical imaging","volume":"2 UNSURE2023 Spec Iss","pages":"2006-2052"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12123533/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of machine learning for biomedical imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59275/j.melba.2024-g93a","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/23 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Clinically deployed deep learning-based segmentation models are known to fail on data outside of their training distributions. While clinicians review the segmentations, these models tend to perform well in most instances, which could exacerbate automation bias. Therefore, detecting out-of-distribution images at inference is critical to warn the clinicians that the model likely failed. This work applied the Mahalanobis distance (MD) post hoc to the bottleneck features of four Swin UNETR and nnU-net models that segmented the liver on T1-weighted magnetic resonance imaging and computed tomography. By reducing the dimensions of the bottleneck features with either principal component analysis or uniform manifold approximation and projection, images the models failed on were detected with high performance and minimal computational load. In addition, this work explored a non-parametric alternative to the MD, a k-th nearest neighbors distance (KNN). KNN drastically improved scalability and performance over MD when both were applied to raw and average-pooled bottleneck features. Our code is available at https://github.com/mckellwoodland/dimen_reduce_mahal.

降维和最近邻改进医学图像分割中的离分布检测。
众所周知,临床部署的基于深度学习的分割模型在训练分布之外的数据上是失败的。当临床医生审查分割时,这些模型在大多数情况下往往表现良好,这可能会加剧自动化偏见。因此,在推理中检测出分布外的图像对于警告临床医生模型可能失败至关重要。本研究将马氏距离(MD)应用于四种Swin UNETR和nnU-net模型的瓶颈特征,这些模型在t1加权磁共振成像和计算机断层扫描上对肝脏进行分割。通过主成分分析或均匀流形逼近和投影来降低瓶颈特征的维数,以高性能和最小的计算负荷检测出模型失败的图像。此外,本研究探索了MD的非参数替代方法,即第k近邻距离(KNN)。当KNN应用于原始和平均池瓶颈特征时,它大大提高了可伸缩性和性能。我们的代码可在https://github.com/mckellwoodland/dimen_reduce_mahal上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信