评估人工智能在减肥手术中的能力：ChatGPT-4和DALL·e3的识别和说明准确性研究

IF 2.9 3区医学 Q1 SURGERY

Obesity Surgery Pub Date : 2025-02-01 Epub Date: 2024-12-29 DOI:10.1007/s11695-024-07653-z

Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi

{"title":"评估人工智能在减肥手术中的能力：ChatGPT-4和DALL·e3的识别和说明准确性研究","authors":"Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi","doi":"10.1007/s11695-024-07653-z","DOIUrl":null,"url":null,"abstract":"Background: With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.Methods: Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.Results: ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.Conclusion: The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.","PeriodicalId":19460,"journal":{"name":"Obesity Surgery","volume":" ","pages":"638-641"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.\",\"authors\":\"Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi\",\"doi\":\"10.1007/s11695-024-07653-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.Methods: Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.Results: ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.Conclusion: The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.\",\"PeriodicalId\":19460,\"journal\":{\"name\":\"Obesity Surgery\",\"volume\":\" \",\"pages\":\"638-641\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Obesity Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s11695-024-07653-z\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Obesity Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11695-024-07653-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}

引用次数: 0

摘要

背景：随着人工智能（AI）在医学教育中的兴起，OpenAI的ChatGPT-4和DALL·E 3等工具在增强学习材料方面具有潜在的应用前景。本研究旨在评估chatgpt - 40在从插图中识别减肥手术过程中的熟练程度，并评估DALL·e3在生成准确手术插图方面的有效性。方法：来自IFSO代谢与减肥手术图谱的六种减肥手术（单吻合式胃分流术、Roux-en-Y胃分流术、单吻合式十二指肠回肠分流术联合袖胃切除术、袖胃切除术、胆道胰分流术和可调节胃束带）的图片。ChatGPT-4的任务是根据这些插图识别每个程序，以评估其分类准确性。同时，在DALL·e3中提示每个手术的具体名称，生成相应的医学插图。结果：ChatGPT-4仅正确识别了可调节胃束带图，对其他五种手术进行了错误分类。DALL·e3未能为所有六个程序生成准确的插图。结论：该研究强调了进一步评估人工智能在减肥手术中的必要性。ChatGPT-4和DALL·e3虽然很有前景，但在识别和生成减肥手术过程的准确插图方面存在重大局限性。这些发现需要继续研究和开发，使人工智能模型适用于减肥手术的医学教育应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.

Background: With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.

Methods: Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.

Results: ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.

Conclusion: The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Obesity Surgery 医学-外科

CiteScore

5.80

自引率

24.10%

发文量

567

审稿时长

3-6 weeks

期刊介绍： Obesity Surgery is the official journal of the International Federation for the Surgery of Obesity and metabolic disorders (IFSO). A journal for bariatric/metabolic surgeons, Obesity Surgery provides an international, interdisciplinary forum for communicating the latest research, surgical and laparoscopic techniques, for treatment of massive obesity and metabolic disorders. Topics covered include original research, clinical reports, current status, guidelines, historical notes, invited commentaries, letters to the editor, medicolegal issues, meeting abstracts, modern surgery/technical innovations, new concepts, reviews, scholarly presentations and opinions. Obesity Surgery benefits surgeons performing obesity/metabolic surgery, general surgeons and surgical residents, endoscopists, anesthetists, support staff, nurses, dietitians, psychiatrists, psychologists, plastic surgeons, internists including endocrinologists and diabetologists, nutritional scientists, and those dealing with eating disorders.