Automatic quality control of brain 3D FLAIR MRIs for a clinical data warehouse

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-05-02 DOI:10.1016/j.media.2025.103617

Sophie Loizillon , Simona Bottani , Aurélien Maire , Sebastian Ströer , Lydia Chougar , Didier Dormont , Olivier Colliot , Ninon Burgos , APPRIMAGE Study Group

{"title":"Automatic quality control of brain 3D FLAIR MRIs for a clinical data warehouse","authors":"Sophie Loizillon , Simona Bottani , Aurélien Maire , Sebastian Ströer , Lydia Chougar , Didier Dormont , Olivier Colliot , Ninon Burgos , APPRIMAGE Study Group","doi":"10.1016/j.media.2025.103617","DOIUrl":null,"url":null,"abstract":"<div><div>Clinical data warehouses, which have arisen over the last decade, bring together the medical data of millions of patients and offer the potential to train and validate machine learning models in real-world scenarios. The quality of MRIs collected in clinical data warehouses differs significantly from that generally observed in research datasets, reflecting the variability inherent to clinical practice. Consequently, the use of clinical data requires the implementation of robust quality control tools.</div><div>By using a substantial number of pre-existing manually labelled T1-weighted MR images (5,500) alongside a smaller set of newly labelled FLAIR images (926), we present a novel semi-supervised adversarial domain adaptation architecture designed to exploit shared representations between MRI sequences thanks to a shared feature extractor, while taking into account the specificities of the FLAIR thanks to a specific classification head for each sequence. This architecture thus consists of a common invariant feature extractor, a domain classifier and two classification heads specific to the source and target, all designed to effectively deal with potential class distribution shifts between the source and target data classes. The primary objectives of this paper were: (1) to identify images which are not proper 3D FLAIR brain MRIs; (2) to rate the overall image quality.</div><div>For the first objective, our approach demonstrated excellent results, with a balanced accuracy of 89%, comparable to that of human raters. For the second objective, our approach achieved good performance, although lower than that of human raters. Nevertheless, the automatic approach accurately identified bad quality images (balanced accuracy >79%). In conclusion, our proposed approach overcomes the initial barrier of heterogeneous image quality in clinical data warehouses, thereby facilitating the development of new research using clinical routine 3D FLAIR brain images.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103617"},"PeriodicalIF":11.8000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001641","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Clinical data warehouses, which have arisen over the last decade, bring together the medical data of millions of patients and offer the potential to train and validate machine learning models in real-world scenarios. The quality of MRIs collected in clinical data warehouses differs significantly from that generally observed in research datasets, reflecting the variability inherent to clinical practice. Consequently, the use of clinical data requires the implementation of robust quality control tools.

By using a substantial number of pre-existing manually labelled T1-weighted MR images (5,500) alongside a smaller set of newly labelled FLAIR images (926), we present a novel semi-supervised adversarial domain adaptation architecture designed to exploit shared representations between MRI sequences thanks to a shared feature extractor, while taking into account the specificities of the FLAIR thanks to a specific classification head for each sequence. This architecture thus consists of a common invariant feature extractor, a domain classifier and two classification heads specific to the source and target, all designed to effectively deal with potential class distribution shifts between the source and target data classes. The primary objectives of this paper were: (1) to identify images which are not proper 3D FLAIR brain MRIs; (2) to rate the overall image quality.

For the first objective, our approach demonstrated excellent results, with a balanced accuracy of 89%, comparable to that of human raters. For the second objective, our approach achieved good performance, although lower than that of human raters. Nevertheless, the automatic approach accurately identified bad quality images (balanced accuracy >79%). In conclusion, our proposed approach overcomes the initial barrier of heterogeneous image quality in clinical data warehouses, thereby facilitating the development of new research using clinical routine 3D FLAIR brain images.

查看原文本刊更多论文

用于临床数据仓库的脑3D FLAIR核磁共振成像自动质量控制

临床数据仓库在过去十年中兴起，汇集了数百万患者的医疗数据，并提供了在现实世界场景中训练和验证机器学习模型的潜力。临床数据仓库中收集的核磁共振成像的质量与研究数据集中通常观察到的质量有很大不同，这反映了临床实践固有的可变性。因此，临床数据的使用需要实施强有力的质量控制工具。通过使用大量预先存在的人工标记的t1加权MR图像（5,500）以及较小的新标记的FLAIR图像（926），我们提出了一种新的半监督对抗性域适应架构，该架构旨在利用共享特征提取器来利用MRI序列之间的共享表示，同时考虑到由于每个序列的特定分类头而导致的FLAIR的特异性。因此，该体系结构由一个通用不变特征提取器、一个域分类器和两个特定于源和目标的分类头组成，所有这些都旨在有效地处理源和目标数据类之间潜在的类分布转移。本文的主要目的是：(1)识别不正确的3D FLAIR脑mri图像；(2)评价整体图像质量。对于第一个目标，我们的方法显示了出色的结果，平衡准确率为89%，与人类评分者相当。对于第二个目标，我们的方法取得了良好的性能，尽管低于人类评分者。然而，自动方法准确地识别出质量差的图像（平衡精度>；79%）。总之，我们提出的方法克服了临床数据仓库中图像质量异构的最初障碍，从而促进了使用临床常规3D FLAIR脑图像进行新研究的发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.