NeuroGPT, evaluando ChatGPT: Diagnóstico y tratamiento de 72 pacientes neurológicos

Q4 Medicine

Neurologia Argentina Pub Date : 2024-07-01 DOI:10.1016/j.neuarg.2024.08.002

Alejandro Fernández Cabrera , Jesús García de Soto , Paula Santamaría Montero , Héctor Chinea García , Robustiano Pego Reigosa

{"title":"NeuroGPT, evaluando ChatGPT: Diagnóstico y tratamiento de 72 pacientes neurológicos","authors":"Alejandro Fernández Cabrera , Jesús García de Soto , Paula Santamaría Montero , Héctor Chinea García , Robustiano Pego Reigosa","doi":"10.1016/j.neuarg.2024.08.002","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>There has been a significant boom in the field of artificial intelligence in recent years, especially in terms of accessibility and its use in different areas. This study attempts to determine if an AI can diagnose neurology patients.</p></div><div><h3>Objective</h3><p>To evaluate the utility and accuracy of ChatGPT 3.5 as a tool for conducting patient history, diagnosis, and treatment in cases of neurological pathology.</p></div><div><h3>Materials and methods</h3><p>A descriptive qualitative observational study was conducted, without intervention in patients, focused on evaluating the utility and accuracy of ChatGPT 3.5 for taking patient history, diagnosis, and treatment in patients with neurological pathology. The information provided to the neurologist was entered into the language model. Subsequently, the questions determined by ChatGPT were asked, and the complete neurological examination was provided. ChatGPT's diagnosis was compared with that of two different neurologists. Recruitment took place from May 2022 to June 2023 in a neurology consultation at a medium-sized hospital in Spain.</p></div><div><h3>Results</h3><p>A total of 72 patients (median age 58.71 years and 55.6% female) were enrolled in this study. Complementary tests suggested by the AI were considered correct in 33.3% of cases. The accuracy of the AI's diagnosis was 44.4%, and treatment recommendations were correct in 37.5%. The diagnosis was checked by two different neurologists following the latest national and international Neurology guidelines. In most cases, the diagnosis between the two neurologists agreed, with a kappa coefficient of 0.94.</p></div><div><h3>Conclusions</h3><p>Although we are in an unprecedented era of advancement in the field of artificial intelligence, it does not seem that ChatGPT can currently replace the evaluation of a neurology specialist.</p></div>","PeriodicalId":39051,"journal":{"name":"Neurologia Argentina","volume":"16 3","pages":"Pages 136-141"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurologia Argentina","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S185300282400034X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

There has been a significant boom in the field of artificial intelligence in recent years, especially in terms of accessibility and its use in different areas. This study attempts to determine if an AI can diagnose neurology patients.

Objective

To evaluate the utility and accuracy of ChatGPT 3.5 as a tool for conducting patient history, diagnosis, and treatment in cases of neurological pathology.

Materials and methods

A descriptive qualitative observational study was conducted, without intervention in patients, focused on evaluating the utility and accuracy of ChatGPT 3.5 for taking patient history, diagnosis, and treatment in patients with neurological pathology. The information provided to the neurologist was entered into the language model. Subsequently, the questions determined by ChatGPT were asked, and the complete neurological examination was provided. ChatGPT's diagnosis was compared with that of two different neurologists. Recruitment took place from May 2022 to June 2023 in a neurology consultation at a medium-sized hospital in Spain.

Results

A total of 72 patients (median age 58.71 years and 55.6% female) were enrolled in this study. Complementary tests suggested by the AI were considered correct in 33.3% of cases. The accuracy of the AI's diagnosis was 44.4%, and treatment recommendations were correct in 37.5%. The diagnosis was checked by two different neurologists following the latest national and international Neurology guidelines. In most cases, the diagnosis between the two neurologists agreed, with a kappa coefficient of 0.94.

Conclusions

Although we are in an unprecedented era of advancement in the field of artificial intelligence, it does not seem that ChatGPT can currently replace the evaluation of a neurology specialist.

查看原文本刊更多论文

NeuroGPT, 评估 ChatGPT：72 名神经病患者的诊断和治疗

导言近年来，人工智能领域蓬勃发展，尤其是在可访问性及其在不同领域的应用方面。本研究试图确定人工智能是否能诊断神经病学患者。目的评估 ChatGPT 3.5 作为神经病学病例的病史采集、诊断和治疗工具的实用性和准确性。材料和方法在不对患者进行干预的情况下，进行了一项描述性定性观察研究，重点评估 ChatGPT 3.5 在神经病学病例的病史采集、诊断和治疗方面的实用性和准确性。向神经科医生提供的信息被输入到语言模型中。随后，询问 ChatGPT 确定的问题，并提供完整的神经系统检查。ChatGPT 的诊断结果与两位不同神经科医生的诊断结果进行了比较。本研究于 2022 年 5 月至 2023 年 6 月在西班牙一家中型医院的神经科会诊中招募了 72 名患者（中位年龄 58.71 岁，55.6% 为女性）。33.3%的病例认为人工智能建议的辅助检查是正确的。人工智能诊断的准确率为 44.4%，治疗建议的正确率为 37.5%。诊断由两名不同的神经病学专家根据最新的国内和国际神经病学指南进行核对。在大多数情况下，两位神经科医生的诊断结果一致，卡帕系数为 0.94。结论虽然我们正处于人工智能领域前所未有的进步时代，但目前看来 ChatGPT 还不能取代神经科专家的评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurologia Argentina Medicine-Neurology (clinical)

CiteScore

0.50

自引率

0.00%

发文量

期刊介绍： Neurología Argentina es la publicación oficial de la Sociedad Neurológica Argentina. Todos los artículos, publicados en español, son sometidos a un proceso de revisión sobre ciego por pares con la finalidad de ofrecer información original, relevante y de alta calidad que abarca todos los aspectos de la Neurología y la Neurociencia.