Agreement of Image Quality Metrics with Radiological Evaluation in the Presence of Motion Artifacts.

ArXiv Pub Date : 2024-12-24

Elisa Marchetto, Hannah Eichhorn, Daniel Gallichan, Julia A Schnabel, Melanie Ganz

{"title":"Agreement of Image Quality Metrics with Radiological Evaluation in the Presence of Motion Artifacts.","authors":"Elisa Marchetto, Hannah Eichhorn, Daniel Gallichan, Julia A Schnabel, Melanie Ganz","doi":"","DOIUrl":null,"url":null,"abstract":"Purpose: Reliable image quality assessment is crucial for evaluating new motion correction methods for magnetic resonance imaging. In this work, we compare the performance of commonly used reference-based and reference-free image quality metrics on a unique dataset with real motion artifacts. We further analyze the image quality metrics' robustness to typical pre-processing techniques.Methods: We compared five reference-based and five reference-free image quality metrics on data acquired with and without intentional motion (2D and 3D sequences). The metrics were recalculated seven times with varying pre-processing steps. The anonymized images were rated by radiologists and radiographers on a 1-5 Likert scale. Spearman correlation coefficients were computed to assess the relationship between image quality metrics and observer scores.Results: All reference-based image quality metrics showed strong correlation with observer assessments, with minor performance variations across sequences. Among reference-free metrics, Average Edge Strength offers the most promising results, as it consistently displayed stronger correlations across all sequences compared to the other reference-free metrics. Overall, the strongest correlation was achieved with percentile normalization and restricting the metric values to the skull-stripped brain region. In contrast, correlations were weaker when not applying any brain mask and using min-max or no normalization.Conclusion: Reference-based metrics reliably correlate with radiological evaluation across different sequences and datasets. Pre-processing steps, particularly normalization and brain masking, significantly influence the correlation values. Future research should focus on refining pre-processing techniques and exploring machine learning approaches for automated image quality evaluation.","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11703327/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ArXiv","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Reliable image quality assessment is crucial for evaluating new motion correction methods for magnetic resonance imaging. In this work, we compare the performance of commonly used reference-based and reference-free image quality metrics on a unique dataset with real motion artifacts. We further analyze the image quality metrics' robustness to typical pre-processing techniques.

Methods: We compared five reference-based and five reference-free image quality metrics on data acquired with and without intentional motion (2D and 3D sequences). The metrics were recalculated seven times with varying pre-processing steps. The anonymized images were rated by radiologists and radiographers on a 1-5 Likert scale. Spearman correlation coefficients were computed to assess the relationship between image quality metrics and observer scores.

Results: All reference-based image quality metrics showed strong correlation with observer assessments, with minor performance variations across sequences. Among reference-free metrics, Average Edge Strength offers the most promising results, as it consistently displayed stronger correlations across all sequences compared to the other reference-free metrics. Overall, the strongest correlation was achieved with percentile normalization and restricting the metric values to the skull-stripped brain region. In contrast, correlations were weaker when not applying any brain mask and using min-max or no normalization.

Conclusion: Reference-based metrics reliably correlate with radiological evaluation across different sequences and datasets. Pre-processing steps, particularly normalization and brain masking, significantly influence the correlation values. Future research should focus on refining pre-processing techniques and exploring machine learning approaches for automated image quality evaluation.

本刊更多论文

运动伪影存在下图像质量度量与放射学评价的一致性。

目的：可靠的图像质量评估是评估磁共振成像运动校正新方法的关键。在这项工作中，我们比较了常用的基于参考和无参考的图像质量指标在具有真实运动伪影的独特数据集上的性能。我们进一步分析了图像质量指标对典型预处理技术的鲁棒性。方法：我们比较了五种基于参考和五种无参考的图像质量指标在有和没有故意运动（2D和3D序列）的情况下获得的数据。使用不同的预处理步骤重新计算指标七次。放射科医生和放射技师按照1-5的李克特量表对这些匿名图像进行评分。计算Spearman相关系数来评估图像质量指标与观察者评分之间的关系。结果：所有基于参考的图像质量指标显示与观察者评估有很强的相关性，在序列之间有较小的性能变化。在无参考指标中，平均边缘强度提供了最有希望的结果，因为与其他无参考指标相比，它始终显示出所有序列之间更强的相关性。总体而言，最强的相关性是通过百分位数归一化实现的，并将度量值限制在去颅骨的大脑区域。相比之下，当不使用任何脑罩和使用最小最大值或不进行归一化时，相关性较弱。结论：基于参考的指标与不同序列和数据集的放射学评估可靠相关。预处理步骤，特别是归一化和脑掩蔽，显著影响相关值。未来的研究应该集中在改进预处理技术和探索自动图像质量评估的机器学习方法上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ArXiv

自引率

0.00%

发文量