Performance of a novel medical artificial intelligence large language model on supporting decision-making for emergency patients with suspected sepsis.
Sen Jiang, Xiandong Liu, Tong Liu, Yi Gu, Bo An, Chunxue Wang, Dongyang Zhao, Haitao Zhang, Lunxian Tang
{"title":"Performance of a novel medical artificial intelligence large language model on supporting decision-making for emergency patients with suspected sepsis.","authors":"Sen Jiang, Xiandong Liu, Tong Liu, Yi Gu, Bo An, Chunxue Wang, Dongyang Zhao, Haitao Zhang, Lunxian Tang","doi":"10.5847/wjem.j.1920-8642.2025.095","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) are being explored for disease prediction and diagnosis; however, their efficacy for early sepsis identification in emergency departments (EDs) remains unexplored. This study aims to evaluate MedGo, a novel medical LLM, as a decision-support tool for clinicians managing patients with suspected sepsis.</p><p><strong>Methods: </strong>This retrospective study included anonymized medical records of 203 patients (mean age 79.9±10.2 years) with confirmed sepsis from a tertiary hospital ED between January 2023 and January 2024. MedGo performance across nine sepsis-related assessment tasks was compared with that of two junior (<3 years of experience) and two senior (>10 years of experience) ED physicians. Assessments were scored on a 5-point Likert scale for accuracy, comprehensiveness, readability, and case-analysis skills.</p><p><strong>Results: </strong>MedGo demonstrated diagnostic performance comparable to that of senior physicians across most metrics, achieving a median Likert score of 4 in accuracy, comprehensiveness, and readability. MedGo significantly outperformed junior physicians (<i>P</i><0.001 for accuracy and case-analysis skills). MedGo assistance significantly enhanced both junior (<i>P</i><0.001) and senior (<i>P</i><0.05) physicians' diagnostic accuracy. Notably, MedGo-assisted junior physicians achieved accuracy levels comparable to those of unassisted senior physicians. MedGo maintained consistent performance across varying sepsis severities.</p><p><strong>Conclusion: </strong>MedGo shows significant diagnostic efficacy for sepsis and effectively supports clinicians in the ED, particularly enhancing junior physicians' performance. Our study highlights the potential of MedGo as a valuable decision-support tool for sepsis management, paving the way for specialized sepsis AI models.</p>","PeriodicalId":23685,"journal":{"name":"World journal of emergency medicine","volume":"16 5","pages":"447-455"},"PeriodicalIF":3.2000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12444244/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World journal of emergency medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5847/wjem.j.1920-8642.2025.095","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Large language models (LLMs) are being explored for disease prediction and diagnosis; however, their efficacy for early sepsis identification in emergency departments (EDs) remains unexplored. This study aims to evaluate MedGo, a novel medical LLM, as a decision-support tool for clinicians managing patients with suspected sepsis.
Methods: This retrospective study included anonymized medical records of 203 patients (mean age 79.9±10.2 years) with confirmed sepsis from a tertiary hospital ED between January 2023 and January 2024. MedGo performance across nine sepsis-related assessment tasks was compared with that of two junior (<3 years of experience) and two senior (>10 years of experience) ED physicians. Assessments were scored on a 5-point Likert scale for accuracy, comprehensiveness, readability, and case-analysis skills.
Results: MedGo demonstrated diagnostic performance comparable to that of senior physicians across most metrics, achieving a median Likert score of 4 in accuracy, comprehensiveness, and readability. MedGo significantly outperformed junior physicians (P<0.001 for accuracy and case-analysis skills). MedGo assistance significantly enhanced both junior (P<0.001) and senior (P<0.05) physicians' diagnostic accuracy. Notably, MedGo-assisted junior physicians achieved accuracy levels comparable to those of unassisted senior physicians. MedGo maintained consistent performance across varying sepsis severities.
Conclusion: MedGo shows significant diagnostic efficacy for sepsis and effectively supports clinicians in the ED, particularly enhancing junior physicians' performance. Our study highlights the potential of MedGo as a valuable decision-support tool for sepsis management, paving the way for specialized sepsis AI models.
期刊介绍:
The journal will cover technical, clinical and bioengineering studies related to multidisciplinary specialties of emergency medicine, such as cardiopulmonary resuscitation, acute injury, out-of-hospital emergency medical service, intensive care, injury and disease prevention, disaster management, healthy policy and ethics, toxicology, and sudden illness, including cardiology, internal medicine, anesthesiology, orthopedics, and trauma care, and more. The journal also features basic science, special reports, case reports, board review questions, and more. Editorials and communications to the editor explore controversial issues and encourage further discussion by physicians dealing with emergency medicine.