Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi
{"title":"评估人工智能在减肥手术中的能力:ChatGPT-4和DALL·e3的识别和说明准确性研究","authors":"Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi","doi":"10.1007/s11695-024-07653-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.</p><p><strong>Methods: </strong>Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.</p><p><strong>Results: </strong>ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.</p><p><strong>Conclusion: </strong>The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.</p>","PeriodicalId":19460,"journal":{"name":"Obesity Surgery","volume":" ","pages":"638-641"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.\",\"authors\":\"Mohammad Mahjoubi, Shahab Shahabi, Saba Sheikhbahaei, Amir Hossein Davarpanah Jazi\",\"doi\":\"10.1007/s11695-024-07653-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.</p><p><strong>Methods: </strong>Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.</p><p><strong>Results: </strong>ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.</p><p><strong>Conclusion: </strong>The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.</p>\",\"PeriodicalId\":19460,\"journal\":{\"name\":\"Obesity Surgery\",\"volume\":\" \",\"pages\":\"638-641\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Obesity Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s11695-024-07653-z\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Obesity Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11695-024-07653-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SURGERY","Score":null,"Total":0}
Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.
Background: With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.
Methods: Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.
Results: ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.
Conclusion: The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.
期刊介绍:
Obesity Surgery is the official journal of the International Federation for the Surgery of Obesity and metabolic disorders (IFSO). A journal for bariatric/metabolic surgeons, Obesity Surgery provides an international, interdisciplinary forum for communicating the latest research, surgical and laparoscopic techniques, for treatment of massive obesity and metabolic disorders. Topics covered include original research, clinical reports, current status, guidelines, historical notes, invited commentaries, letters to the editor, medicolegal issues, meeting abstracts, modern surgery/technical innovations, new concepts, reviews, scholarly presentations and opinions.
Obesity Surgery benefits surgeons performing obesity/metabolic surgery, general surgeons and surgical residents, endoscopists, anesthetists, support staff, nurses, dietitians, psychiatrists, psychologists, plastic surgeons, internists including endocrinologists and diabetologists, nutritional scientists, and those dealing with eating disorders.