Eren Çamur , Turay Cesur , Yasin Celal Güneş , Yusuf Öztürk , Yunus Şerefettin , Ersin Doğanözü , Ayşegül Akçebe , Ahmet Kürşad Güneş , İbrahim Ethem Cakcak
{"title":"Evaluating Large Language Models for imaging modality selection: Potential to reduce unnecessary contrast agent use and radiation exposure","authors":"Eren Çamur , Turay Cesur , Yasin Celal Güneş , Yusuf Öztürk , Yunus Şerefettin , Ersin Doğanözü , Ayşegül Akçebe , Ahmet Kürşad Güneş , İbrahim Ethem Cakcak","doi":"10.1016/j.clinimag.2025.110573","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Large Language Models (LLMs) represent a transformative leap in artificial intelligence with the potential to revolutionize radiologic decision-making. This study uniquely evaluates the performance of various LLMs from different vendors in selecting appropriate imaging modalities and comparing their responses with those of clinicians across different specialties and radiologists with different experience levels.</div></div><div><h3>Methods</h3><div>In a cross-sectional experimental design, 120 clinical scenarios derived from ACR AC and 120 “Multifaceted practice-oriented clinical scenarios” (including breast, cardiac, gastrointestinal, musculoskeletal, neuro, thoracic, genitourinary, vascular sections) were assessed using three different prompts. The performance of four LLMs from different vendors were evaluated and compared with four clinicians (emergency physician, cardiologist, internist and general surgeon) and four radiologists with different experience level. Also, the performances of LLMs contrast agent use and X-ray-containing imaging modality selection were evaluated. The responses were categorized according to ACR AC. Short and long-term reproducibility were assessed in the same clinical scenarios.</div></div><div><h3>Results</h3><div>All LLMs yielded identical modality recommendations across the three distinct prompts (κ = 1). In ACR clinical scenarios, DeepSeek-R1 identified the appropriate imaging modality in 98.3 % of cases, achieving superior accuracy without inter-model differences (p > 0.006). In realistic scenarios, DeepSeek-R1 again led, matching board-certified junior radiologist performance and exceeding clinician and resident performance. The short-term reproducibility ranged from κ = 0.773 to 0.886, with long-term reproducibility spanning κ = 0.507 to 0.787.</div></div><div><h3>Discussion</h3><div>This study underscores that LLMs have remarkable potential for selecting appropriate imaging modalities for different clinical scenarios related to various sections and their valuable contributions as supportive tools in clinical practice in this field.</div></div>","PeriodicalId":50680,"journal":{"name":"Clinical Imaging","volume":"125 ","pages":"Article 110573"},"PeriodicalIF":1.5000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Imaging","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0899707125001731","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Large Language Models (LLMs) represent a transformative leap in artificial intelligence with the potential to revolutionize radiologic decision-making. This study uniquely evaluates the performance of various LLMs from different vendors in selecting appropriate imaging modalities and comparing their responses with those of clinicians across different specialties and radiologists with different experience levels.
Methods
In a cross-sectional experimental design, 120 clinical scenarios derived from ACR AC and 120 “Multifaceted practice-oriented clinical scenarios” (including breast, cardiac, gastrointestinal, musculoskeletal, neuro, thoracic, genitourinary, vascular sections) were assessed using three different prompts. The performance of four LLMs from different vendors were evaluated and compared with four clinicians (emergency physician, cardiologist, internist and general surgeon) and four radiologists with different experience level. Also, the performances of LLMs contrast agent use and X-ray-containing imaging modality selection were evaluated. The responses were categorized according to ACR AC. Short and long-term reproducibility were assessed in the same clinical scenarios.
Results
All LLMs yielded identical modality recommendations across the three distinct prompts (κ = 1). In ACR clinical scenarios, DeepSeek-R1 identified the appropriate imaging modality in 98.3 % of cases, achieving superior accuracy without inter-model differences (p > 0.006). In realistic scenarios, DeepSeek-R1 again led, matching board-certified junior radiologist performance and exceeding clinician and resident performance. The short-term reproducibility ranged from κ = 0.773 to 0.886, with long-term reproducibility spanning κ = 0.507 to 0.787.
Discussion
This study underscores that LLMs have remarkable potential for selecting appropriate imaging modalities for different clinical scenarios related to various sections and their valuable contributions as supportive tools in clinical practice in this field.
期刊介绍:
The mission of Clinical Imaging is to publish, in a timely manner, the very best radiology research from the United States and around the world with special attention to the impact of medical imaging on patient care. The journal''s publications cover all imaging modalities, radiology issues related to patients, policy and practice improvements, and clinically-oriented imaging physics and informatics. The journal is a valuable resource for practicing radiologists, radiologists-in-training and other clinicians with an interest in imaging. Papers are carefully peer-reviewed and selected by our experienced subject editors who are leading experts spanning the range of imaging sub-specialties, which include:
-Body Imaging-
Breast Imaging-
Cardiothoracic Imaging-
Imaging Physics and Informatics-
Molecular Imaging and Nuclear Medicine-
Musculoskeletal and Emergency Imaging-
Neuroradiology-
Practice, Policy & Education-
Pediatric Imaging-
Vascular and Interventional Radiology