Development of an initial training and evaluation programme for manual lower limb muscle MRI segmentation.

IF 3.7 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

European Radiology Experimental Pub Date : 2024-07-25 DOI:10.1186/s41747-024-00475-9

Jasper M Morrow, Sachit Shah, Lara Cristiano, Matthew R B Evans, Carolynne M Doherty, Talal Alnaemi, Abeer Saab, Ahmed Emira, Uros Klickovic, Ahmed Hammam, Afnan Altuwaijri, Stephen Wastling, Mary M Reilly, Michael G Hanna, Tarek A Yousry, John S Thornton

{"title":"Development of an initial training and evaluation programme for manual lower limb muscle MRI segmentation.","authors":"Jasper M Morrow, Sachit Shah, Lara Cristiano, Matthew R B Evans, Carolynne M Doherty, Talal Alnaemi, Abeer Saab, Ahmed Emira, Uros Klickovic, Ahmed Hammam, Afnan Altuwaijri, Stephen Wastling, Mary M Reilly, Michael G Hanna, Tarek A Yousry, John S Thornton","doi":"10.1186/s41747-024-00475-9","DOIUrl":null,"url":null,"abstract":"Background: Magnetic resonance imaging (MRI) quantification of intramuscular fat accumulation is a responsive biomarker in neuromuscular diseases. Despite emergence of automated methods, manual muscle segmentation remains an essential foundation. We aimed to develop a training programme for new observers to demonstrate competence in lower limb muscle segmentation and establish reliability benchmarks for future human observers and machine learning segmentation packages.Methods: The learning phase of the training programme comprised a training manual, direct instruction, and eight lower limb MRI scans with reference standard large and small regions of interest (ROIs). The assessment phase used test-retest scans from two patients and two healthy controls. Interscan and interobserver reliability metrics were calculated to identify underperforming outliers and to determine competency benchmarks.Results: Three experienced observers undertook the assessment phase, whilst eight new observers completed the full training programme. Two of the new observers were identified as underperforming outliers, relating to variation in size or consistency of segmentations; six had interscan and interobserver reliability equivalent to those of experienced observers. The calculated benchmark for the Sørensen-Dice similarity coefficient between observers was greater than 0.87 and 0.92 for individual thigh and calf muscles, respectively. Interscan and interobserver reliability were significantly higher for large than small ROIs (all p < 0.001).Conclusions: We developed, implemented, and analysed the first formal training programme for manual lower limb muscle segmentation. Large ROI showed superior reliability to small ROI for fat fraction assessment.Relevance statement: Observers competent in lower limb muscle segmentation are critical to application of quantitative muscle MRI biomarkers in neuromuscular diseases. This study has established competency benchmarks for future human observers or automated segmentation methods.Key points: • Observers competent in muscle segmentation are critical for quantitative muscle MRI biomarkers. • A training programme for muscle segmentation was undertaken by eight new observers. • We established competency benchmarks for future human observers or automated segmentation methods.","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"8 1","pages":"85"},"PeriodicalIF":3.7000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282017/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Radiology Experimental","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s41747-024-00475-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Magnetic resonance imaging (MRI) quantification of intramuscular fat accumulation is a responsive biomarker in neuromuscular diseases. Despite emergence of automated methods, manual muscle segmentation remains an essential foundation. We aimed to develop a training programme for new observers to demonstrate competence in lower limb muscle segmentation and establish reliability benchmarks for future human observers and machine learning segmentation packages.

Methods: The learning phase of the training programme comprised a training manual, direct instruction, and eight lower limb MRI scans with reference standard large and small regions of interest (ROIs). The assessment phase used test-retest scans from two patients and two healthy controls. Interscan and interobserver reliability metrics were calculated to identify underperforming outliers and to determine competency benchmarks.

Results: Three experienced observers undertook the assessment phase, whilst eight new observers completed the full training programme. Two of the new observers were identified as underperforming outliers, relating to variation in size or consistency of segmentations; six had interscan and interobserver reliability equivalent to those of experienced observers. The calculated benchmark for the Sørensen-Dice similarity coefficient between observers was greater than 0.87 and 0.92 for individual thigh and calf muscles, respectively. Interscan and interobserver reliability were significantly higher for large than small ROIs (all p < 0.001).

Conclusions: We developed, implemented, and analysed the first formal training programme for manual lower limb muscle segmentation. Large ROI showed superior reliability to small ROI for fat fraction assessment.

Relevance statement: Observers competent in lower limb muscle segmentation are critical to application of quantitative muscle MRI biomarkers in neuromuscular diseases. This study has established competency benchmarks for future human observers or automated segmentation methods.

Key points: • Observers competent in muscle segmentation are critical for quantitative muscle MRI biomarkers. • A training programme for muscle segmentation was undertaken by eight new observers. • We established competency benchmarks for future human observers or automated segmentation methods.

Abstract Image

查看原文本刊更多论文

为手动下肢肌肉核磁共振成像分段制定初步培训和评估计划。

背景：磁共振成像（MRI）对肌肉内脂肪堆积的量化是神经肌肉疾病的一种反应性生物标志物。尽管出现了自动化方法，但人工肌肉分割仍然是一项重要的基础工作。我们的目标是为新观察者制定一项培训计划，以展示下肢肌肉分割的能力，并为未来的人类观察者和机器学习分割软件包建立可靠性基准：培训计划的学习阶段包括培训手册、直接指导和八次下肢核磁共振成像扫描，并参考标准的大感兴趣区和小感兴趣区（ROI）。评估阶段使用两名患者和两名健康对照者的扫描结果进行重复测试。计算了扫描间和观察者间的可靠性指标，以识别表现不佳的异常值，并确定能力基准：三名经验丰富的观察者完成了评估阶段，八名新观察者完成了全部培训计划。其中两名新观察者被确定为表现不佳的异常值，与分割大小或一致性的变化有关；六名新观察者的扫描间和观察者间可靠性与经验丰富的观察者相当。计算得出的观察者之间的索伦森-戴斯相似系数基准分别大于 0.87 和 0.92（大腿肌肉和小腿肌肉）。大的 ROI 的扫描间可靠性和观察者间可靠性明显高于小的 ROI（所有 p 均为结论）：我们开发、实施并分析了首个手动下肢肌肉分割的正式培训计划。在脂肪分数评估方面，大ROI的可靠性优于小ROI：具备下肢肌肉分割能力的观察者对于定量肌肉磁共振成像生物标记在神经肌肉疾病中的应用至关重要。这项研究为未来的人类观察者或自动分割方法建立了能力基准：- 要点：具备肌肉分割能力的观察者对于肌肉磁共振成像定量生物标志物至关重要。- 八名新观察者参加了肌肉分割培训计划。- 我们为未来的人类观察者或自动分割方法制定了能力基准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊