Alpay Duran, Anıl Demiröz, Oguz Çörtük, Bora Ok, Mustafa Özten, Sinem Eroğlu
{"title":"Human vs Machine: The Future of Decision-making in Plastic and Reconstructive Surgery.","authors":"Alpay Duran, Anıl Demiröz, Oguz Çörtük, Bora Ok, Mustafa Özten, Sinem Eroğlu","doi":"10.1093/asj/sjaf015","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI)-driven technologies offer transformative potential in plastic surgery, spanning pre-operative planning, surgical procedures, and post-operative care, with the promise of improved patient outcomes.</p><p><strong>Objectives: </strong>To compare the web-based ChatGPT-4o (omni; OpenAI, San Francisco, CA) and Gemini Advanced (Alphabet Inc., Mountain View, CA), focusing on their data upload feature and examining outcomes before and after exposure to CME articles, particularly regarding their efficacy relative to human participants.</p><p><strong>Methods: </strong>Participants and LLMs completed 22 multiple-choice questions to assess baseline knowledge of CME topics. Initially, both LLMs and participants answered without article access. In incognito mode, the LLMs repeated the tests over 6 days. After accessing the articles, responses from both LLMs and participants were extracted and analyzed.</p><p><strong>Results: </strong>There was a significant increase in mean scores after the article was read in the resident group, indicating a significant rise. In the LLM groups, the ChatGPT-4.o (omni) group showed no significant difference between pre- and post-article scores, but the Gemini Advanced group demonstrated a significant increase. It can be stated that the ChatGPT-4.o and Gemini Advanced groups have higher accuracy means compared to the resident group in both pre and post-article periods.</p><p><strong>Conclusions: </strong>The analysis between human participants and LLMs indicates promising implications for the incorporation of LLMs in medical education. As these models increase in sophistication, they offer the potential to serve as supplementary tools within traditional learning environments. This could aid in bridging the gap between theoretical knowledge and practical implementation.</p>","PeriodicalId":7728,"journal":{"name":"Aesthetic Surgery Journal","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aesthetic Surgery Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/asj/sjaf015","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Artificial intelligence (AI)-driven technologies offer transformative potential in plastic surgery, spanning pre-operative planning, surgical procedures, and post-operative care, with the promise of improved patient outcomes.
Objectives: To compare the web-based ChatGPT-4o (omni; OpenAI, San Francisco, CA) and Gemini Advanced (Alphabet Inc., Mountain View, CA), focusing on their data upload feature and examining outcomes before and after exposure to CME articles, particularly regarding their efficacy relative to human participants.
Methods: Participants and LLMs completed 22 multiple-choice questions to assess baseline knowledge of CME topics. Initially, both LLMs and participants answered without article access. In incognito mode, the LLMs repeated the tests over 6 days. After accessing the articles, responses from both LLMs and participants were extracted and analyzed.
Results: There was a significant increase in mean scores after the article was read in the resident group, indicating a significant rise. In the LLM groups, the ChatGPT-4.o (omni) group showed no significant difference between pre- and post-article scores, but the Gemini Advanced group demonstrated a significant increase. It can be stated that the ChatGPT-4.o and Gemini Advanced groups have higher accuracy means compared to the resident group in both pre and post-article periods.
Conclusions: The analysis between human participants and LLMs indicates promising implications for the incorporation of LLMs in medical education. As these models increase in sophistication, they offer the potential to serve as supplementary tools within traditional learning environments. This could aid in bridging the gap between theoretical knowledge and practical implementation.
期刊介绍:
Aesthetic Surgery Journal is a peer-reviewed international journal focusing on scientific developments and clinical techniques in aesthetic surgery. The official publication of The Aesthetic Society, ASJ is also the official English-language journal of many major international societies of plastic, aesthetic and reconstructive surgery representing South America, Central America, Europe, Asia, and the Middle East. It is also the official journal of the British Association of Aesthetic Plastic Surgeons, the Canadian Society for Aesthetic Plastic Surgery and The Rhinoplasty Society.