Michal A. Mankowski, Ian S. Jaffe, Jingzhi Xu, Sunjae Bae, Eric K. Oermann, Yindalon Aphinyanaphongs, Mara A. McAdams-DeMarco, Bonnie E. Lonze, Babak J. Orandi, Darren Stewart, Macey Levan, Allan Massie, Sommer Gentry, Dorry L. Segev
{"title":"ChatGPT 解决复杂的肾移植案例:与人类受访者的比较研究","authors":"Michal A. Mankowski, Ian S. Jaffe, Jingzhi Xu, Sunjae Bae, Eric K. Oermann, Yindalon Aphinyanaphongs, Mara A. McAdams-DeMarco, Bonnie E. Lonze, Babak J. Orandi, Darren Stewart, Macey Levan, Allan Massie, Sommer Gentry, Dorry L. Segev","doi":"10.1111/ctr.15466","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Introduction</h3>\n \n <p>ChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Overall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Each iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.</p>\n </section>\n </div>","PeriodicalId":10467,"journal":{"name":"Clinical Transplantation","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ChatGPT Solving Complex Kidney Transplant Cases: A Comparative Study With Human Respondents\",\"authors\":\"Michal A. Mankowski, Ian S. Jaffe, Jingzhi Xu, Sunjae Bae, Eric K. Oermann, Yindalon Aphinyanaphongs, Mara A. McAdams-DeMarco, Bonnie E. Lonze, Babak J. Orandi, Darren Stewart, Macey Levan, Allan Massie, Sommer Gentry, Dorry L. Segev\",\"doi\":\"10.1111/ctr.15466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Introduction</h3>\\n \\n <p>ChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>We prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>Overall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>Each iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.</p>\\n </section>\\n </div>\",\"PeriodicalId\":10467,\"journal\":{\"name\":\"Clinical Transplantation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Transplantation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/ctr.15466\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Transplantation","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/ctr.15466","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}
ChatGPT Solving Complex Kidney Transplant Cases: A Comparative Study With Human Respondents
Introduction
ChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents.
Methods
We prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting.
Results
Overall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low.
Conclusion
Each iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.
期刊介绍:
Clinical Transplantation: The Journal of Clinical and Translational Research aims to serve as a channel of rapid communication for all those involved in the care of patients who require, or have had, organ or tissue transplants, including: kidney, intestine, liver, pancreas, islets, heart, heart valves, lung, bone marrow, cornea, skin, bone, and cartilage, viable or stored.
Published monthly, Clinical Transplantation’s scope is focused on the complete spectrum of present transplant therapies, as well as also those that are experimental or may become possible in future. Topics include:
Immunology and immunosuppression;
Patient preparation;
Social, ethical, and psychological issues;
Complications, short- and long-term results;
Artificial organs;
Donation and preservation of organ and tissue;
Translational studies;
Advances in tissue typing;
Updates on transplant pathology;.
Clinical and translational studies are particularly welcome, as well as focused reviews. Full-length papers and short communications are invited. Clinical reviews are encouraged, as well as seminal papers in basic science which might lead to immediate clinical application. Prominence is regularly given to the results of cooperative surveys conducted by the organ and tissue transplant registries.
Clinical Transplantation: The Journal of Clinical and Translational Research is essential reading for clinicians and researchers in the diverse field of transplantation: surgeons; clinical immunologists; cryobiologists; hematologists; gastroenterologists; hepatologists; pulmonologists; nephrologists; cardiologists; and endocrinologists. It will also be of interest to sociologists, psychologists, research workers, and to all health professionals whose combined efforts will improve the prognosis of transplant recipients.