Enhancing reconstruction-based out-of-distribution detection in brain MRI with model and metric ensembles

IF 4.8 2区医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computer methods and programs in biomedicine Pub Date : 2025-09-01 DOI:10.1016/j.cmpb.2025.109045

Evi M.C. Huijben , Sina Amirrajab , Josien P.W. Pluim

{"title":"Enhancing reconstruction-based out-of-distribution detection in brain MRI with model and metric ensembles","authors":"Evi M.C. Huijben , Sina Amirrajab , Josien P.W. Pluim","doi":"10.1016/j.cmpb.2025.109045","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Out-of-distribution (OOD) detection is crucial for safely deploying automated medical image analysis systems, as abnormal patterns in images could hamper their performance. However, OOD detection in medical imaging remains an open challenge. In this study, we aim to optimize a reconstruction-based autoencoder specifically for OOD detection. We address three gaps: the underexplored potential of a simple OOD detection model, the lack of optimization of deep learning strategies specifically for OOD detection, and the selection of appropriate reconstruction metrics.</div></div><div><h3>Methods:</h3><div>We investigated the effectiveness of a reconstruction-based autoencoder for unsupervised detection of synthetic local and global artifacts in brain MRI. We evaluated the general reconstruction capability of the model, analyzed the impact of the selected training epoch and reconstruction metrics, assessed the potential of model and/or metric ensembles, and tested the model on a dataset containing a diverse range of artifacts.</div></div><div><h3>Results:</h3><div>Among the metrics assessed, the learned perceptual image patch similarity (LPIPS) and the contrast component of structural similarity index measure (SSIM) consistently outperformed others in detecting homogeneous circular anomalies. By combining two well-converged models and using LPIPS and contrast as reconstruction metrics, we achieved a pixel-level area under the Precision–Recall curve of 0.66. Furthermore, with the more realistic OOD dataset, we observed that the detection performance varied between artifact types; local artifacts were more difficult to detect, while global artifacts showed better detection results.</div></div><div><h3>Conclusions:</h3><div>Our study shows that a reconstruction-based autoencoder, when combined with appropriate metrics, enhances OOD detection in brain MRI. These findings emphasize the importance of carefully selecting metrics and model configurations and highlight the need for tailored approaches, as standard deep learning approaches do not always align with the unique challenges of OOD detection. Improving OOD detection can increase the reliability of automated medical image analysis.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"272 ","pages":"Article 109045"},"PeriodicalIF":4.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725004626","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Background and Objective:

Out-of-distribution (OOD) detection is crucial for safely deploying automated medical image analysis systems, as abnormal patterns in images could hamper their performance. However, OOD detection in medical imaging remains an open challenge. In this study, we aim to optimize a reconstruction-based autoencoder specifically for OOD detection. We address three gaps: the underexplored potential of a simple OOD detection model, the lack of optimization of deep learning strategies specifically for OOD detection, and the selection of appropriate reconstruction metrics.

Methods:

We investigated the effectiveness of a reconstruction-based autoencoder for unsupervised detection of synthetic local and global artifacts in brain MRI. We evaluated the general reconstruction capability of the model, analyzed the impact of the selected training epoch and reconstruction metrics, assessed the potential of model and/or metric ensembles, and tested the model on a dataset containing a diverse range of artifacts.

Results:

Among the metrics assessed, the learned perceptual image patch similarity (LPIPS) and the contrast component of structural similarity index measure (SSIM) consistently outperformed others in detecting homogeneous circular anomalies. By combining two well-converged models and using LPIPS and contrast as reconstruction metrics, we achieved a pixel-level area under the Precision–Recall curve of 0.66. Furthermore, with the more realistic OOD dataset, we observed that the detection performance varied between artifact types; local artifacts were more difficult to detect, while global artifacts showed better detection results.

Conclusions:

Our study shows that a reconstruction-based autoencoder, when combined with appropriate metrics, enhances OOD detection in brain MRI. These findings emphasize the importance of carefully selecting metrics and model configurations and highlight the need for tailored approaches, as standard deep learning approaches do not always align with the unique challenges of OOD detection. Improving OOD detection can increase the reliability of automated medical image analysis.

Abstract Image

查看原文本刊更多论文

利用模型和度量集合增强基于重建的脑MRI非分布检测

背景和目的：分布外（OOD）检测对于安全部署自动医学图像分析系统至关重要，因为图像中的异常模式可能会阻碍其性能。然而，医学成像中的OOD检测仍然是一个开放的挑战。在这项研究中，我们的目标是优化一个基于重建的自编码器，专门用于OOD检测。我们解决了三个差距：简单OOD检测模型的潜力未被充分开发，缺乏针对OOD检测的深度学习策略优化，以及选择适当的重建指标。方法：我们研究了基于重建的自编码器在脑MRI合成局部和全局伪影的无监督检测中的有效性。我们评估了模型的一般重建能力，分析了所选训练历元和重建指标的影响，评估了模型和/或度量集合的潜力，并在包含各种工件的数据集上测试了模型。结果：在评估的指标中，学习感知图像斑块相似性（LPIPS）和结构相似性指数度量（SSIM）的对比成分在检测均匀圆形异常方面始终优于其他指标。通过结合两个良好收敛的模型，并使用LPIPS和对比度作为重建指标，我们在精度-召回率曲线下获得了0.66的像素级区域。此外，对于更真实的OOD数据集，我们观察到不同工件类型的检测性能有所不同；局部伪影较难检测，而全局伪影的检测效果较好。结论：我们的研究表明，当与适当的指标相结合时，基于重建的自编码器可以增强脑MRI中OOD的检测。这些发现强调了仔细选择指标和模型配置的重要性，并强调了定制方法的必要性，因为标准的深度学习方法并不总是与OOD检测的独特挑战相一致。改进OOD检测可以提高医学图像自动化分析的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer methods and programs in biomedicine 工程技术-工程：生物医学

CiteScore

12.30

自引率

6.60%

发文量

601

审稿时长

135 days

期刊介绍： To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine. Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.