ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer

IF 3.5 2区医学 Q2 ONCOLOGY

Ejso Pub Date : 2025-04-24 DOI:10.1016/j.ejso.2025.110096

Huizi Li , Jiaobao Huang , Kuntang Liu , Jibiao Liu , Queling Liu , Zhiyong Zhou , Zhen Zong , Shengxun Mao

{"title":"ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer","authors":"Huizi Li , Jiaobao Huang , Kuntang Liu , Jibiao Liu , Queling Liu , Zhiyong Zhou , Zhen Zong , Shengxun Mao","doi":"10.1016/j.ejso.2025.110096","DOIUrl":null,"url":null,"abstract":"<div><h3>Background & aims</h3><div>The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making. Artificial intelligence (AI) chatbots offer potential tools to enhance multidisciplinary team (MDT) discussions. This study aims to compare the performances of ChatGPT-4o and Gemini Advanced in generating treatment recommendations for advanced GC.</div></div><div><h3>Methods</h3><div>The study involved three steps: (1) evaluating responses to ten critical clinical questions, (2) analyzing clinical cases from MDT meetings at our institution, and (3) reviewing rare GC cases from PubMed. It included 95 advanced GC patients discussed between November 2022 and July 2024, and 14 rare cases from PubMed. Prompts designed from advanced GC cases were submitted to ChatGPT-4o and Gemini Advanced using a standardized format. Outputs were evaluated for accuracy and completeness using a structured 4-point Likert scale. Interrater reliability was calculated to ensure consistency among evaluators.</div></div><div><h3>Results</h3><div>For the ten clinical questions, ChatGPT-4o achieved better performances compared to Gemini Advanced. In MDT cases, ChatGPT-4o provided more valuable recommendations in surgical suggestion, chemotherapy recommendation, and chemotherapy regimens. Subgroup analysis confirmed these findings in both routine and complex cases with high interrater reliability. ChatGPT-4o also outperformed Gemini Advanced in the analysis of rare GC cases from PubMed, showing superior accuracy with high interrater reliability.</div></div><div><h3>Conclusions</h3><div>While our findings suggest that AI chatbots can generate clinically relevant and guideline-based treatment recommendations, their use in MDT decision-making should be viewed as supportive rather than autonomous. We emphasize that while AI chatbots have potential as decision-support tools, but they should be integrated only under expert supervision in a real-world clinical context.</div></div>","PeriodicalId":11522,"journal":{"name":"Ejso","volume":"51 8","pages":"Article 110096"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ejso","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0748798325005244","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background & aims

The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making. Artificial intelligence (AI) chatbots offer potential tools to enhance multidisciplinary team (MDT) discussions. This study aims to compare the performances of ChatGPT-4o and Gemini Advanced in generating treatment recommendations for advanced GC.

Methods

The study involved three steps: (1) evaluating responses to ten critical clinical questions, (2) analyzing clinical cases from MDT meetings at our institution, and (3) reviewing rare GC cases from PubMed. It included 95 advanced GC patients discussed between November 2022 and July 2024, and 14 rare cases from PubMed. Prompts designed from advanced GC cases were submitted to ChatGPT-4o and Gemini Advanced using a standardized format. Outputs were evaluated for accuracy and completeness using a structured 4-point Likert scale. Interrater reliability was calculated to ensure consistency among evaluators.

Results

For the ten clinical questions, ChatGPT-4o achieved better performances compared to Gemini Advanced. In MDT cases, ChatGPT-4o provided more valuable recommendations in surgical suggestion, chemotherapy recommendation, and chemotherapy regimens. Subgroup analysis confirmed these findings in both routine and complex cases with high interrater reliability. ChatGPT-4o also outperformed Gemini Advanced in the analysis of rare GC cases from PubMed, showing superior accuracy with high interrater reliability.

Conclusions

While our findings suggest that AI chatbots can generate clinically relevant and guideline-based treatment recommendations, their use in MDT decision-making should be viewed as supportive rather than autonomous. We emphasize that while AI chatbots have potential as decision-support tools, but they should be integrated only under expert supervision in a real-world clinical context.

查看原文本刊更多论文

chatgpt - 40在协助晚期胃癌的多学科决策方面优于gemini advanced

背景,目的晚期胃癌（GC）的治疗需要精确、全面的临床决策。人工智能（AI）聊天机器人为加强多学科团队（MDT）讨论提供了潜在的工具。本研究旨在比较chatgpt - 40和Gemini Advanced在为晚期GC提供治疗建议方面的性能。方法本研究分为三个步骤：(1)评估对10个关键临床问题的回答；(2)分析我院MDT会议的临床病例；(3)回顾PubMed上罕见的GC病例。该研究包括了2022年11月至2024年7月期间讨论的95例晚期胃癌患者，以及来自PubMed的14例罕见病例。根据高级GC案例设计的提示以标准化格式提交给chatgpt - 40和Gemini advanced。使用结构化的4点李克特量表评估输出的准确性和完整性。计算了评估者之间的信度以确保评估者之间的一致性。结果chatgpt - 40在10个临床问题上均优于Gemini Advanced。在MDT病例中，chatgpt - 40在手术建议、化疗建议、化疗方案等方面提供了更有价值的建议。亚组分析在常规病例和复杂病例中证实了这些发现，具有较高的相互可靠性。chatggt - 40在PubMed的罕见GC病例分析中也优于Gemini Advanced，显示出卓越的准确性和高的相互可靠性。虽然我们的研究结果表明，AI聊天机器人可以产生临床相关的和基于指南的治疗建议，但它们在MDT决策中的使用应被视为支持性的，而不是自主的。我们强调，虽然人工智能聊天机器人有潜力成为决策支持工具，但它们只应该在现实世界的临床环境中在专家监督下进行整合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ejso 医学-外科

CiteScore

6.40

自引率

2.60%

发文量

1148

审稿时长

41 days

期刊介绍： JSO - European Journal of Surgical Oncology ("the Journal of Cancer Surgery") is the Official Journal of the European Society of Surgical Oncology and BASO ~ the Association for Cancer Surgery. The EJSO aims to advance surgical oncology research and practice through the publication of original research articles, review articles, editorials, debates and correspondence.