Evaluation of AI-Based Chatbots in Liver Cancer Information Dissemination: A Comparative Analysis of GPT, DeepSeek, Copilot, and Gemini.

IF 2.5 3区医学 Q3 ONCOLOGY

Oncology Pub Date : 2025-06-10 DOI:10.1159/000546726

Mustafa Karaagac, Sedat Carkit

{"title":"Evaluation of AI-Based Chatbots in Liver Cancer Information Dissemination: A Comparative Analysis of GPT, DeepSeek, Copilot, and Gemini.","authors":"Mustafa Karaagac, Sedat Carkit","doi":"10.1159/000546726","DOIUrl":null,"url":null,"abstract":"Background/objectives: This study aimed to evaluate AI-based chatbots (GPT, DeepSeek, Copilot, Gemini) in disseminating information on liver cancer, emphasizing content quality, adherence to established guidelines, and ease of comprehension.Methods: Between January and February 2025, four chatbots were examined us-ing publicly accessible free versions lacking independent reasoning capabilities. Three frequently searched Google Trends questions (\"What is liver cancer awareness?\", \"What are the symptoms of liver cancer?\", and \"Is liver cancer treatable?\") were posed. Their responses were assessed via the DISCERN instrument, Cole-man-Liau Index, Patient Education Materials Assessment Tool for Print, and alignment with American Asso-ciation for the Study of Liver Diseases, National Comprehensive Cancer Network, and European Society for Medical Oncology recommendations. Statistical analysis was performed using SPSS 22.Results: All chatbots largely provided relevant and impartial information. GPT and DeepSeek scored lower on specifying infor-mation sources and update timelines, whereas Copilot omitted local therapies (e.g., Radiofrequency Ablation, Transarterial Chemoembolization, Transarterial Radioembolization), resulting in reduced scientific accuracy. Gemini and Copilot performed better in \"Understandability,\" while GPT and DeepSeek excelled in \"Actiona-bility.\" Although GPT demonstrated consistency across multiple treatment options, it did not explicitly refer-ence international guidelines. Study limitations included language constraints, variations in chatbot updates, and reliance on a single inquiry round.Conclusions: AI chatbots show potential as initial informational tools for liver cancer but cannot replace professional medical consultation. In complex diseases requiring multidis-ciplinary management, frequent guideline-based updates, expert validation, and diverse data sources are critical to enhancing clinical relevance and patient outcomes.","PeriodicalId":19497,"journal":{"name":"Oncology","volume":" ","pages":"1-15"},"PeriodicalIF":2.5000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000546726","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background/objectives: This study aimed to evaluate AI-based chatbots (GPT, DeepSeek, Copilot, Gemini) in disseminating information on liver cancer, emphasizing content quality, adherence to established guidelines, and ease of comprehension.

Methods: Between January and February 2025, four chatbots were examined us-ing publicly accessible free versions lacking independent reasoning capabilities. Three frequently searched Google Trends questions ("What is liver cancer awareness?", "What are the symptoms of liver cancer?", and "Is liver cancer treatable?") were posed. Their responses were assessed via the DISCERN instrument, Cole-man-Liau Index, Patient Education Materials Assessment Tool for Print, and alignment with American Asso-ciation for the Study of Liver Diseases, National Comprehensive Cancer Network, and European Society for Medical Oncology recommendations. Statistical analysis was performed using SPSS 22.

Results: All chatbots largely provided relevant and impartial information. GPT and DeepSeek scored lower on specifying infor-mation sources and update timelines, whereas Copilot omitted local therapies (e.g., Radiofrequency Ablation, Transarterial Chemoembolization, Transarterial Radioembolization), resulting in reduced scientific accuracy. Gemini and Copilot performed better in "Understandability," while GPT and DeepSeek excelled in "Actiona-bility." Although GPT demonstrated consistency across multiple treatment options, it did not explicitly refer-ence international guidelines. Study limitations included language constraints, variations in chatbot updates, and reliance on a single inquiry round.

Conclusions: AI chatbots show potential as initial informational tools for liver cancer but cannot replace professional medical consultation. In complex diseases requiring multidis-ciplinary management, frequent guideline-based updates, expert validation, and diverse data sources are critical to enhancing clinical relevance and patient outcomes.

查看原文本刊更多论文

基于人工智能的肝癌信息传播聊天机器人评价：GPT、DeepSeek、Copilot和Gemini的比较分析

背景/目的：本研究旨在评估基于人工智能的聊天机器人（GPT、DeepSeek、Copilot、Gemini）在传播肝癌信息、强调内容质量、遵守既定指南和易于理解方面的能力。方法：在2025年1月至2月期间，使用缺乏独立推理能力的公开免费版本对四个聊天机器人进行了测试。谷歌Trends提出了三个搜索频率最高的问题（“什么是肝癌意识？”、“肝癌的症状是什么？”和“肝癌可以治疗吗？”）。他们的反应通过DISCERN仪器、Cole-man-Liau指数、患者教育材料评估工具进行评估，并与美国肝病研究协会、国家综合癌症网络和欧洲医学肿瘤学会的建议保持一致。采用SPSS 22进行统计学分析。结果：所有聊天机器人基本上都提供了相关和公正的信息。GPT和DeepSeek在指定信息源和更新时间表方面得分较低，而Copilot省略了局部治疗（例如，射频消融、经动脉化疗栓塞、经动脉放射栓塞），导致科学准确性降低。Gemini和Copilot在“理解能力”方面表现较好，而GPT和DeepSeek在“行动能力”方面表现较好。虽然GPT在多种治疗方案中表现出一致性，但它没有明确参考国际指南。研究的局限性包括语言限制、聊天机器人更新的变化以及对单一查询轮的依赖。结论：人工智能聊天机器人有潜力作为肝癌的初步信息工具，但不能取代专业的医疗咨询。在需要多学科管理的复杂疾病中，频繁的基于指南的更新、专家验证和多样化的数据来源对于提高临床相关性和患者预后至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Oncology 医学-肿瘤学

CiteScore

6.00

自引率

2.90%

发文量

审稿时长

6-12 weeks

期刊介绍： Although laboratory and clinical cancer research need to be closely linked, observations at the basic level often remain removed from medical applications. This journal works to accelerate the translation of experimental results into the clinic, and back again into the laboratory for further investigation. The fundamental purpose of this effort is to advance clinically-relevant knowledge of cancer, and improve the outcome of prevention, diagnosis and treatment of malignant disease. The journal publishes significant clinical studies from cancer programs around the world, along with important translational laboratory findings, mini-reviews (invited and submitted) and in-depth discussions of evolving and controversial topics in the oncology arena. A unique feature of the journal is a new section which focuses on rapid peer-review and subsequent publication of short reports of phase 1 and phase 2 clinical cancer trials, with a goal of insuring that high-quality clinical cancer research quickly enters the public domain, regardless of the trial’s ultimate conclusions regarding efficacy or toxicity.