AI-Assisted Lung Sliding Detection in Point-of-Care Ultrasound by Marine Corps Corpsmen: A Multi-Reader Study.

Q3 Medicine

Journal of special operations medicine : a peer reviewed journal for SOF medical professionals Pub Date : 2026-04-03 DOI:10.55460/J.Spec.Oper.Med.2026.1SDN-NWTW

Melissa Cote, Ross Prager, Khoa Tran, Nicolas Orozco, Delaney Smith, Zoe Holliday, Robert Arntfield

{"title":"AI-Assisted Lung Sliding Detection in Point-of-Care Ultrasound by Marine Corps Corpsmen: A Multi-Reader Study.","authors":"Melissa Cote, Ross Prager, Khoa Tran, Nicolas Orozco, Delaney Smith, Zoe Holliday, Robert Arntfield","doi":"10.55460/J.Spec.Oper.Med.2026.1SDN-NWTW","DOIUrl":null,"url":null,"abstract":"Background: Artificial intelligence (AI) has the potential to address training limitations and inter-operator variability that constrain the use of lung ultrasound (LUS) in austere and prehospital settings. This pilot study evaluated whether AI-based decision support could improve the diagnostic accuracy and confidence of United States Marine Corps Corpsmen in identifying absent lung sliding, a key indicator of pneumothorax, during LUS interpretation.Methods: This pilot-prospective multi-reader, multi-case study involved five military medics, all novices in point-of-care ultrasound, each interpreting 50 de-identified LUS video clips twice, once without AI assistance (control) and once with AI assistance (ATLAS, Deep Breathe Inc., London, Canada), in randomized order with at least a 2-hour washout between sessions. Expert consensus served as a reference standard. Diagnostic performance was assessed using area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, and accuracy. Differences were analyzed using the Random-Reader Random-Case method. Per-clip reader confidence ratings were compared using the Stuart-Maxwell test.Results: AI assistance significantly improved diagnostic performance across all measured outcomes. The mean AUROC increased from 0.72 (SD 0.16) without AI to 0.93 (SD 0.04) with AI (P=.03). Sensitivity rose from 0.63 (SD 0.14) to 0.90 (SD 0.09), specificity from 0.70 (SD 0.15) to 0.86 (SD 0.10), and overall accuracy from 0.67 (SD 0.10) to 0.88 (0.06) (McNemar's test, P<.001). Reader confidence also improved, with high-confidence ratings nearly doubling from 20% to 37%, and low-confidence ratings decreasing from 38% to 33%. These distributional changes were statistically significant (Stuart-Maxwell χ², P<.001).Conclusion: AI support markedly improved the diagnostic accuracy and confidence of novice LUS interpretation for detecting absent lung sliding. These findings suggest that real-time AI-based decision support may help improve access to high-quality LUS in military and other resource-limited care settings.","PeriodicalId":53630,"journal":{"name":"Journal of special operations medicine : a peer reviewed journal for SOF medical professionals","volume":" ","pages":"57-0"},"PeriodicalIF":0.0000,"publicationDate":"2026-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of special operations medicine : a peer reviewed journal for SOF medical professionals","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55460/J.Spec.Oper.Med.2026.1SDN-NWTW","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Artificial intelligence (AI) has the potential to address training limitations and inter-operator variability that constrain the use of lung ultrasound (LUS) in austere and prehospital settings. This pilot study evaluated whether AI-based decision support could improve the diagnostic accuracy and confidence of United States Marine Corps Corpsmen in identifying absent lung sliding, a key indicator of pneumothorax, during LUS interpretation.

Methods: This pilot-prospective multi-reader, multi-case study involved five military medics, all novices in point-of-care ultrasound, each interpreting 50 de-identified LUS video clips twice, once without AI assistance (control) and once with AI assistance (ATLAS, Deep Breathe Inc., London, Canada), in randomized order with at least a 2-hour washout between sessions. Expert consensus served as a reference standard. Diagnostic performance was assessed using area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, and accuracy. Differences were analyzed using the Random-Reader Random-Case method. Per-clip reader confidence ratings were compared using the Stuart-Maxwell test.

Results: AI assistance significantly improved diagnostic performance across all measured outcomes. The mean AUROC increased from 0.72 (SD 0.16) without AI to 0.93 (SD 0.04) with AI (P=.03). Sensitivity rose from 0.63 (SD 0.14) to 0.90 (SD 0.09), specificity from 0.70 (SD 0.15) to 0.86 (SD 0.10), and overall accuracy from 0.67 (SD 0.10) to 0.88 (0.06) (McNemar's test, P<.001). Reader confidence also improved, with high-confidence ratings nearly doubling from 20% to 37%, and low-confidence ratings decreasing from 38% to 33%. These distributional changes were statistically significant (Stuart-Maxwell χ², P<.001).

Conclusion: AI support markedly improved the diagnostic accuracy and confidence of novice LUS interpretation for detecting absent lung sliding. These findings suggest that real-time AI-based decision support may help improve access to high-quality LUS in military and other resource-limited care settings.

查看原文本刊更多论文

海军陆战队士兵在护理点超声中人工智能辅助肺滑动检测：一项多阅读器研究。

背景：人工智能（AI）有潜力解决训练限制和操作员之间的差异，这些差异限制了在严峻和院前环境中使用肺部超声（LUS）。这项初步研究评估了基于人工智能的决策支持是否可以提高美国海军陆战队士兵在LUS解释期间识别无肺滑动（气胸的一个关键指标）的诊断准确性和信心。方法：这项前瞻性多读者、多病例研究涉及5名军事医务人员，他们都是急诊超声的新手，每人两次解读50个去识别的LUS视频片段，一次没有人工智能辅助（控制），一次有人工智能辅助（ATLAS, Deep Breathe Inc.，伦敦，加拿大），随机顺序，两次之间至少有2小时的洗脱。专家共识作为参考标准。采用受试者工作特征曲线下面积（AUROC）、敏感性、特异性和准确性评估诊断效果。使用Random-Reader Random-Case方法分析差异。使用斯图尔特-麦克斯韦测试比较每个片段的读者信心评级。结果：人工智能辅助显着提高了所有测量结果的诊断性能。平均AUROC由未加人工智能的0.72 （SD 0.16）增加到加人工智能的0.93 (SD 0.04) （P= 0.03）。灵敏度从0.63 （SD 0.14）上升到0.90 (SD 0.09)，特异度从0.70 （SD 0.15）上升到0.86 (SD 0.10)，总体准确度从0.67 （SD 0.10）上升到0.88 (0.06)（McNemar’s test, p）。结论：人工智能支持显著提高了新手LUS诊断肺缺失滑动的准确性和置信度。这些发现表明，基于实时人工智能的决策支持可能有助于在军事和其他资源有限的护理环境中改善获得高质量LUS的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of special operations medicine : a peer reviewed journal for SOF medical professionals Medicine-Medicine (all)

CiteScore

1.30

自引率

0.00%

发文量