Zeynep Bendella , Barbara Daria Wichtmann , Ralf Clauberg , Vera C. Keil , Nils C. Lehnen , Robert Haase , Laura C. Sáez , Isabella C. Wiest , Jakob Nikolas Kather , Christoph Endler , Alexander Radbruch , Daniel Paech , Katerina Deike
{"title":"与委员会认证的神经放射学家相比,Chat GPT-4在MRI方案选择方面显示出高度的一致性。","authors":"Zeynep Bendella , Barbara Daria Wichtmann , Ralf Clauberg , Vera C. Keil , Nils C. Lehnen , Robert Haase , Laura C. Sáez , Isabella C. Wiest , Jakob Nikolas Kather , Christoph Endler , Alexander Radbruch , Daniel Paech , Katerina Deike","doi":"10.1016/j.ejrad.2025.112416","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>The aim of this study was to determine whether ChatGPT-4 can correctly suggest MRI protocols and additional MRI sequences based on real-world Radiology Request Forms (RRFs) as well as to investigate the ability of ChatGPT-4 to suggest time saving protocols.</div></div><div><h3>Material & methods</h3><div>Retrospectively, 1,001 RRFs of our Department of Neuroradiology (in-house dataset), 200 RRFs of an independent Department of General Radiology (independent dataset) and 300 RRFs from an external, foreign Department of Neuroradiology (external dataset) were included. Patients’ age, sex, and clinical information were extracted from the RRFs and used to prompt ChatGPT- 4 to choose an adequate MRI protocol from predefined institutional lists. Four independent raters then assessed its performance. Additionally, ChatGPT-4 was tasked with creating case-specific protocols aimed at saving time.</div></div><div><h3>Results</h3><div>Two and 7 of 1,001 protocol suggestions of ChatGPT-4 were rated “unacceptable” in the in-house dataset for reader 1 and 2, respectively. No protocol suggestions were rated “unacceptable” in both the independent and external dataset. When assessing the inter-reader agreement, Coheńs weighted ĸ ranged from 0.88 to 0.98 (each p < 0.001).</div><div>ChatGPT-4′s freely composed protocols were approved in 766/1,001 (76.5 %) and 140/300 (46.67 %) cases of the in-house and external dataset with mean time savings (standard deviation) of 3:51 (minutes:seconds) (±2:40) minutes and 2:59 (±3:42) minutes per adopted in-house and external MRI protocol.</div></div><div><h3>Conclusion</h3><div>ChatGPT-4 demonstrated a very high agreement with board-certified (neuro-)radiologists in selecting MRI protocols and was able to suggest approved time saving protocols from the set of available sequences.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"193 ","pages":"Article 112416"},"PeriodicalIF":3.3000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chat GPT-4 shows high agreement in MRI protocol selection compared to board-certified neuroradiologists\",\"authors\":\"Zeynep Bendella , Barbara Daria Wichtmann , Ralf Clauberg , Vera C. Keil , Nils C. Lehnen , Robert Haase , Laura C. Sáez , Isabella C. Wiest , Jakob Nikolas Kather , Christoph Endler , Alexander Radbruch , Daniel Paech , Katerina Deike\",\"doi\":\"10.1016/j.ejrad.2025.112416\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objectives</h3><div>The aim of this study was to determine whether ChatGPT-4 can correctly suggest MRI protocols and additional MRI sequences based on real-world Radiology Request Forms (RRFs) as well as to investigate the ability of ChatGPT-4 to suggest time saving protocols.</div></div><div><h3>Material & methods</h3><div>Retrospectively, 1,001 RRFs of our Department of Neuroradiology (in-house dataset), 200 RRFs of an independent Department of General Radiology (independent dataset) and 300 RRFs from an external, foreign Department of Neuroradiology (external dataset) were included. Patients’ age, sex, and clinical information were extracted from the RRFs and used to prompt ChatGPT- 4 to choose an adequate MRI protocol from predefined institutional lists. Four independent raters then assessed its performance. Additionally, ChatGPT-4 was tasked with creating case-specific protocols aimed at saving time.</div></div><div><h3>Results</h3><div>Two and 7 of 1,001 protocol suggestions of ChatGPT-4 were rated “unacceptable” in the in-house dataset for reader 1 and 2, respectively. No protocol suggestions were rated “unacceptable” in both the independent and external dataset. When assessing the inter-reader agreement, Coheńs weighted ĸ ranged from 0.88 to 0.98 (each p < 0.001).</div><div>ChatGPT-4′s freely composed protocols were approved in 766/1,001 (76.5 %) and 140/300 (46.67 %) cases of the in-house and external dataset with mean time savings (standard deviation) of 3:51 (minutes:seconds) (±2:40) minutes and 2:59 (±3:42) minutes per adopted in-house and external MRI protocol.</div></div><div><h3>Conclusion</h3><div>ChatGPT-4 demonstrated a very high agreement with board-certified (neuro-)radiologists in selecting MRI protocols and was able to suggest approved time saving protocols from the set of available sequences.</div></div>\",\"PeriodicalId\":12063,\"journal\":{\"name\":\"European Journal of Radiology\",\"volume\":\"193 \",\"pages\":\"Article 112416\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Radiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0720048X25005029\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X25005029","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Chat GPT-4 shows high agreement in MRI protocol selection compared to board-certified neuroradiologists
Objectives
The aim of this study was to determine whether ChatGPT-4 can correctly suggest MRI protocols and additional MRI sequences based on real-world Radiology Request Forms (RRFs) as well as to investigate the ability of ChatGPT-4 to suggest time saving protocols.
Material & methods
Retrospectively, 1,001 RRFs of our Department of Neuroradiology (in-house dataset), 200 RRFs of an independent Department of General Radiology (independent dataset) and 300 RRFs from an external, foreign Department of Neuroradiology (external dataset) were included. Patients’ age, sex, and clinical information were extracted from the RRFs and used to prompt ChatGPT- 4 to choose an adequate MRI protocol from predefined institutional lists. Four independent raters then assessed its performance. Additionally, ChatGPT-4 was tasked with creating case-specific protocols aimed at saving time.
Results
Two and 7 of 1,001 protocol suggestions of ChatGPT-4 were rated “unacceptable” in the in-house dataset for reader 1 and 2, respectively. No protocol suggestions were rated “unacceptable” in both the independent and external dataset. When assessing the inter-reader agreement, Coheńs weighted ĸ ranged from 0.88 to 0.98 (each p < 0.001).
ChatGPT-4′s freely composed protocols were approved in 766/1,001 (76.5 %) and 140/300 (46.67 %) cases of the in-house and external dataset with mean time savings (standard deviation) of 3:51 (minutes:seconds) (±2:40) minutes and 2:59 (±3:42) minutes per adopted in-house and external MRI protocol.
Conclusion
ChatGPT-4 demonstrated a very high agreement with board-certified (neuro-)radiologists in selecting MRI protocols and was able to suggest approved time saving protocols from the set of available sequences.
期刊介绍:
European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field.
Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.