Comparing the performances of a fifty-four-year-old computer-based consultation to ChatGPT-4o.

IF 2.2 2区医学 Q4 MEDICAL INFORMATICS

Applied Clinical Informatics Pub Date : 2025-06-06 DOI:10.1055/a-2628-8408

Elvan Burak Verdi, Oguz Akbilgic

{"title":"Comparing the performances of a fifty-four-year-old computer-based consultation to ChatGPT-4o.","authors":"Elvan Burak Verdi, Oguz Akbilgic","doi":"10.1055/a-2628-8408","DOIUrl":null,"url":null,"abstract":"Objective: To evaluate and compare the diagnostic responses generated by two artificial intelligence models developed 54 years apart and to encourage physicians to explore the use of large language models (LLMs) like GPT-4o in clinical practice.Methods: A clinical case of metabolic acidosis was presented to GPT-4o, and the model's diagnostic reasoning, data interpretation, and management recommendations were recorded. These outputs were then compared to the responses from Schwartz's 1970 AI model built with a decision-tree algorithm using Conversational Algebraic Language (CAL). Both models were given the same patient data to ensure a fair comparison.Results: GPT-4o generated an advanced analysis of the patient's acid-base disturbance, correctly identifying likely causes and suggesting relevant diagnostic tests and treatments. It provided a detailed, narrative explanation of the metabolic acidosis. The 1970 CAL model, while correctly recognizing the metabolic acidosis and flagging implausible inputs, was constrained by its rule-based design. CAL offered only basic stepwise guidance and required sequential prompts for each data point, reflecting a limited capacity to handle complex or unanticipated information. GPT-4o, by contrast, integrated the data more holistically, although it occasionally ventured beyond the provided information.Conclusion: This comparison illustrates substantial advances in AI capabilities over five decades. GPT-4o's performance demonstrates the transformative potential of modern LLMs in clinical decision-making, showcasing abilities to synthesize complex data and assist diagnosis without specialized training, yet necessitating further validation, rigorous clinical trials, and adaptation to clinical contexts. Although innovative for its era and offering certain advantages over GPT-4o, the rule-based CAL system had technical limitations. Rather than viewing one as simply \"better,\" this study provides perspective on how far AI in medicine has progressed while acknowledging that current AI tools remain supplements to-not replacements for-physician judgment.","PeriodicalId":48956,"journal":{"name":"Applied Clinical Informatics","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Clinical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2628-8408","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To evaluate and compare the diagnostic responses generated by two artificial intelligence models developed 54 years apart and to encourage physicians to explore the use of large language models (LLMs) like GPT-4o in clinical practice.

Methods: A clinical case of metabolic acidosis was presented to GPT-4o, and the model's diagnostic reasoning, data interpretation, and management recommendations were recorded. These outputs were then compared to the responses from Schwartz's 1970 AI model built with a decision-tree algorithm using Conversational Algebraic Language (CAL). Both models were given the same patient data to ensure a fair comparison.

Results: GPT-4o generated an advanced analysis of the patient's acid-base disturbance, correctly identifying likely causes and suggesting relevant diagnostic tests and treatments. It provided a detailed, narrative explanation of the metabolic acidosis. The 1970 CAL model, while correctly recognizing the metabolic acidosis and flagging implausible inputs, was constrained by its rule-based design. CAL offered only basic stepwise guidance and required sequential prompts for each data point, reflecting a limited capacity to handle complex or unanticipated information. GPT-4o, by contrast, integrated the data more holistically, although it occasionally ventured beyond the provided information.

Conclusion: This comparison illustrates substantial advances in AI capabilities over five decades. GPT-4o's performance demonstrates the transformative potential of modern LLMs in clinical decision-making, showcasing abilities to synthesize complex data and assist diagnosis without specialized training, yet necessitating further validation, rigorous clinical trials, and adaptation to clinical contexts. Although innovative for its era and offering certain advantages over GPT-4o, the rule-based CAL system had technical limitations. Rather than viewing one as simply "better," this study provides perspective on how far AI in medicine has progressed while acknowledging that current AI tools remain supplements to-not replacements for-physician judgment.

查看原文本刊更多论文

比较54年前的计算机咨询与chatgpt - 40的表现。

目的：评估和比较两种相距54年的人工智能模型产生的诊断反应，并鼓励医生探索在临床实践中使用像gpt - 40这样的大语言模型（LLMs）。方法：向gpt - 40报告1例代谢性酸中毒的临床病例，记录该模型的诊断推理、数据解释和管理建议。然后将这些输出与Schwartz 1970年使用会话代数语言（CAL）的决策树算法构建的AI模型的响应进行比较。两种模型的患者数据相同，以确保公平的比较。结果：gpt - 40对患者的酸碱紊乱进行了先进的分析，正确识别可能的原因，并建议相关的诊断测试和治疗。它提供了代谢性酸中毒的详细叙述解释。1970年CAL模型虽然正确识别代谢性酸中毒并标记不合理的输入，但受到其基于规则的设计的限制。CAL只提供基本的逐步指导，并要求对每个数据点进行顺序提示，这反映了处理复杂或意外信息的能力有限。相比之下，gpt - 40更全面地整合了数据，尽管它偶尔会超出所提供的信息。结论：这一对比说明了人工智能在过去50年里的巨大进步。gpt - 40的表现展示了现代法学硕士在临床决策方面的变革潜力，展示了在没有专业培训的情况下合成复杂数据和辅助诊断的能力，但需要进一步验证、严格的临床试验和适应临床环境。尽管在当时是创新的，并且比gpt - 40有一定的优势，但基于规则的CAL系统有技术局限性。这项研究并不是简单地认为一种工具“更好”，而是提供了人工智能在医学领域取得进展的视角，同时承认目前的人工智能工具仍然是对医生判断的补充，而不是替代。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Clinical Informatics MEDICAL INFORMATICS-

CiteScore

4.60

自引率

24.10%

发文量

132

期刊介绍： ACI is the third Schattauer journal dealing with biomedical and health informatics. It perfectly complements our other journals Öffnet internen Link im aktuellen FensterMethods of Information in Medicine and the Öffnet internen Link im aktuellen FensterYearbook of Medical Informatics. The Yearbook of Medical Informatics being the “Milestone” or state-of-the-art journal and Methods of Information in Medicine being the “Science and Research” journal of IMIA, ACI intends to be the “Practical” journal of IMIA.