MIDRC mRALE Mastermind Grand Challenge: AI to predict COVID severity on chest radiographs.

IF 1.7 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Journal of Medical Imaging Pub Date : 2025-03-01 Epub Date: 2025-04-18 DOI:10.1117/1.JMI.12.2.024505

Samuel G Armato, Karen Drukker, Lubomir Hadjiiski, Carol C Wu, Jayashree Kalpathy-Cramer, George Shih, Maryellen L Giger, Natalie Baughan, Benjamin Bearce, Adam E Flanders, Robyn L Ball, Kyle J Myers, Heather M Whitney, The Midrc Grand Challenge Working Group

{"title":"MIDRC mRALE Mastermind Grand Challenge: AI to predict COVID severity on chest radiographs.","authors":"Samuel G Armato, Karen Drukker, Lubomir Hadjiiski, Carol C Wu, Jayashree Kalpathy-Cramer, George Shih, Maryellen L Giger, Natalie Baughan, Benjamin Bearce, Adam E Flanders, Robyn L Ball, Kyle J Myers, Heather M Whitney, The Midrc Grand Challenge Working Group","doi":"10.1117/1.JMI.12.2.024505","DOIUrl":null,"url":null,"abstract":"Purpose: The Medical Imaging and Data Resource Center (MIDRC) mRALE Mastermind Grand Challenge fostered the development of artificial intelligence (AI) techniques for the automated assignment of mRALE (modified radiographic assessment of lung edema) scores to portable chest radiographs from patients known to have COVID-19.Approach: The challenge utilized 2079 training cases obtained from the publicly available MIDRC data commons, with validation and test cases sampled from not-yet-public MIDRC cases that were inaccessible to challenge participants. The reference standard mRALE scores for the challenge cases were established by a pool of 22 radiologist annotators. Using the MedICI challenge platform, participants submitted their trained algorithms encapsulated in Docker containers. Algorithms were evaluated by the challenge organizers on 814 test cases through two performance assessment metrics: quadratic-weighted kappa and prediction probability concordance.Results: Nine AI algorithms were submitted to the challenge for assessment against the test set cases. The algorithm that demonstrated the highest agreement with the reference standard had a quadratic-weighted kappa of 0.885 and a prediction probability concordance of 0.875. Substantial variability in mRALE scores assigned by the annotators and output by the AI algorithms was observed.Conclusions: The MIDRC mRALE Mastermind Grand Challenge revealed the potential of AI to assess COVID-19 severity from portable CXRs, demonstrating promising performance against the reference standard. The observed variability in mRALE scores highlights the challenges in standardizing severity assessment. These findings contribute to ongoing efforts to develop AI technologies for potential use in clinical practice and offer insights for the enhancement of COVID-19 severity assessment.","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 2","pages":"024505"},"PeriodicalIF":1.7000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12014941/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.2.024505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/18 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: The Medical Imaging and Data Resource Center (MIDRC) mRALE Mastermind Grand Challenge fostered the development of artificial intelligence (AI) techniques for the automated assignment of mRALE (modified radiographic assessment of lung edema) scores to portable chest radiographs from patients known to have COVID-19.

Approach: The challenge utilized 2079 training cases obtained from the publicly available MIDRC data commons, with validation and test cases sampled from not-yet-public MIDRC cases that were inaccessible to challenge participants. The reference standard mRALE scores for the challenge cases were established by a pool of 22 radiologist annotators. Using the MedICI challenge platform, participants submitted their trained algorithms encapsulated in Docker containers. Algorithms were evaluated by the challenge organizers on 814 test cases through two performance assessment metrics: quadratic-weighted kappa and prediction probability concordance.

Results: Nine AI algorithms were submitted to the challenge for assessment against the test set cases. The algorithm that demonstrated the highest agreement with the reference standard had a quadratic-weighted kappa of 0.885 and a prediction probability concordance of 0.875. Substantial variability in mRALE scores assigned by the annotators and output by the AI algorithms was observed.

Conclusions: The MIDRC mRALE Mastermind Grand Challenge revealed the potential of AI to assess COVID-19 severity from portable CXRs, demonstrating promising performance against the reference standard. The observed variability in mRALE scores highlights the challenges in standardizing severity assessment. These findings contribute to ongoing efforts to develop AI technologies for potential use in clinical practice and offer insights for the enhancement of COVID-19 severity assessment.

查看原文本刊更多论文

MIDRC mRALE大师大挑战：人工智能预测胸片上的COVID严重程度。

目的：医学成像和数据资源中心（MIDRC） mRALE Mastermind大挑战促进了人工智能（AI）技术的发展，用于将mRALE（改进的肺水肿放射评估）评分自动分配给已知患有COVID-19的患者的便携式胸片。方法：挑战利用了从公开可用的MIDRC数据共享中获得的2079个培训案例，并从尚未公开的MIDRC案例中抽样验证和测试案例，这些案例对挑战参与者来说是不可访问的。挑战病例的参考标准mRALE分数由22名放射科医师注释者建立。使用MedICI挑战平台，参与者提交了封装在Docker容器中的训练算法。挑战赛组织者通过二次加权kappa和预测概率一致性两种性能评估指标对814个测试用例的算法进行了评估。结果：9个AI算法被提交到挑战中，针对测试集案例进行评估。与参考标准一致性最高的算法，二次加权kappa为0.885，预测概率一致性为0.875。观察到注释者分配的mRALE分数和AI算法输出的mRALE分数存在实质性差异。结论：MIDRC mRALE Mastermind大挑战揭示了人工智能在便携式cxr中评估COVID-19严重程度的潜力，在参考标准下显示出良好的性能。观察到的mRALE评分的可变性突出了标准化严重性评估的挑战。这些发现有助于持续努力开发人工智能技术，以用于临床实践，并为加强COVID-19严重程度评估提供见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.10

自引率

4.20%

发文量

期刊介绍： JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.