Evi M.C. Huijben , Sina Amirrajab , Josien P.W. Pluim
{"title":"Enhancing reconstruction-based out-of-distribution detection in brain MRI with model and metric ensembles","authors":"Evi M.C. Huijben , Sina Amirrajab , Josien P.W. Pluim","doi":"10.1016/j.cmpb.2025.109045","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Out-of-distribution (OOD) detection is crucial for safely deploying automated medical image analysis systems, as abnormal patterns in images could hamper their performance. However, OOD detection in medical imaging remains an open challenge. In this study, we aim to optimize a reconstruction-based autoencoder specifically for OOD detection. We address three gaps: the underexplored potential of a simple OOD detection model, the lack of optimization of deep learning strategies specifically for OOD detection, and the selection of appropriate reconstruction metrics.</div></div><div><h3>Methods:</h3><div>We investigated the effectiveness of a reconstruction-based autoencoder for unsupervised detection of synthetic local and global artifacts in brain MRI. We evaluated the general reconstruction capability of the model, analyzed the impact of the selected training epoch and reconstruction metrics, assessed the potential of model and/or metric ensembles, and tested the model on a dataset containing a diverse range of artifacts.</div></div><div><h3>Results:</h3><div>Among the metrics assessed, the learned perceptual image patch similarity (LPIPS) and the contrast component of structural similarity index measure (SSIM) consistently outperformed others in detecting homogeneous circular anomalies. By combining two well-converged models and using LPIPS and contrast as reconstruction metrics, we achieved a pixel-level area under the Precision–Recall curve of 0.66. Furthermore, with the more realistic OOD dataset, we observed that the detection performance varied between artifact types; local artifacts were more difficult to detect, while global artifacts showed better detection results.</div></div><div><h3>Conclusions:</h3><div>Our study shows that a reconstruction-based autoencoder, when combined with appropriate metrics, enhances OOD detection in brain MRI. These findings emphasize the importance of carefully selecting metrics and model configurations and highlight the need for tailored approaches, as standard deep learning approaches do not always align with the unique challenges of OOD detection. Improving OOD detection can increase the reliability of automated medical image analysis.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"272 ","pages":"Article 109045"},"PeriodicalIF":4.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725004626","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objective:
Out-of-distribution (OOD) detection is crucial for safely deploying automated medical image analysis systems, as abnormal patterns in images could hamper their performance. However, OOD detection in medical imaging remains an open challenge. In this study, we aim to optimize a reconstruction-based autoencoder specifically for OOD detection. We address three gaps: the underexplored potential of a simple OOD detection model, the lack of optimization of deep learning strategies specifically for OOD detection, and the selection of appropriate reconstruction metrics.
Methods:
We investigated the effectiveness of a reconstruction-based autoencoder for unsupervised detection of synthetic local and global artifacts in brain MRI. We evaluated the general reconstruction capability of the model, analyzed the impact of the selected training epoch and reconstruction metrics, assessed the potential of model and/or metric ensembles, and tested the model on a dataset containing a diverse range of artifacts.
Results:
Among the metrics assessed, the learned perceptual image patch similarity (LPIPS) and the contrast component of structural similarity index measure (SSIM) consistently outperformed others in detecting homogeneous circular anomalies. By combining two well-converged models and using LPIPS and contrast as reconstruction metrics, we achieved a pixel-level area under the Precision–Recall curve of 0.66. Furthermore, with the more realistic OOD dataset, we observed that the detection performance varied between artifact types; local artifacts were more difficult to detect, while global artifacts showed better detection results.
Conclusions:
Our study shows that a reconstruction-based autoencoder, when combined with appropriate metrics, enhances OOD detection in brain MRI. These findings emphasize the importance of carefully selecting metrics and model configurations and highlight the need for tailored approaches, as standard deep learning approaches do not always align with the unique challenges of OOD detection. Improving OOD detection can increase the reliability of automated medical image analysis.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.