Performance of ChatGPT 3.5 and 4 as a tool for patient support before and after DBS surgery for Parkinson's disease.

IF 2.7 4区医学 Q2 CLINICAL NEUROLOGY

Neurological Sciences Pub Date : 2024-12-01 Epub Date: 2024-08-29 DOI:10.1007/s10072-024-07732-0

Ana Lúcia Oliveira, Miguel Coelho, Leonor Correia Guedes, Maria Begoña Cattoni, Herculano Carvalho, Pedro Duarte-Batista

{"title":"Performance of ChatGPT 3.5 and 4 as a tool for patient support before and after DBS surgery for Parkinson's disease.","authors":"Ana Lúcia Oliveira, Miguel Coelho, Leonor Correia Guedes, Maria Begoña Cattoni, Herculano Carvalho, Pedro Duarte-Batista","doi":"10.1007/s10072-024-07732-0","DOIUrl":null,"url":null,"abstract":"<p><p>Deep brain stimulation (DBS) is a neurosurgical procedure that involves implanting electrodes into specific areas of the brain to treat a variety of medical conditions, including Parkinson's disease. Doubts and questions from patients prior to or following surgery should be addressed in line with the most recent scientific and clinical practice. ChatGPT emerges as an example of how artificial intelligence can be used, with its ability to comprehend and answer medical questions in an understandable way, accessible to everyone. However, the risks of these resources still need to be fully understood.ChatGPT models 3.5 and 4 responses to 40 questions in English and Portuguese were independently graded by two experienced specialists in functional neurosurgery and neurological movement disorders and resolved by a third reviewer. ChatGPT 3.5 and 4 demonstrated a good level of accuracy in responding to 80 questions in both English and Portuguese, related to DBS surgery for Parkinson's disease. The proportion of responses graded as correct was 57.5% and 83.8% for GPT 3.5 and GPT 4, respectively. GPT 3.5 provided potentially harmful answers for 6.3% (5/80) of its responses. No responses from GPT 4 were graded as harmful. In general, ChatGPT 3.5 and 4 demonstrated good performance in terms of quality and reliability across two different languages. Nonetheless, harmful responses should not be scorned, and it's crucial to consider this aspect when addressing patients using these resources. Considering the current safety concerns, it's not advisable for patients to use such models for DBS surgery guidance. Performance of ChatGPT 3.5 and 4 as a tool for patient support before and after DBS surgery for Parkinson's disease.</p>","PeriodicalId":19191,"journal":{"name":"Neurological Sciences","volume":" ","pages":"5757-5764"},"PeriodicalIF":2.7000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11554841/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurological Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10072-024-07732-0","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/29 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Deep brain stimulation (DBS) is a neurosurgical procedure that involves implanting electrodes into specific areas of the brain to treat a variety of medical conditions, including Parkinson's disease. Doubts and questions from patients prior to or following surgery should be addressed in line with the most recent scientific and clinical practice. ChatGPT emerges as an example of how artificial intelligence can be used, with its ability to comprehend and answer medical questions in an understandable way, accessible to everyone. However, the risks of these resources still need to be fully understood.ChatGPT models 3.5 and 4 responses to 40 questions in English and Portuguese were independently graded by two experienced specialists in functional neurosurgery and neurological movement disorders and resolved by a third reviewer. ChatGPT 3.5 and 4 demonstrated a good level of accuracy in responding to 80 questions in both English and Portuguese, related to DBS surgery for Parkinson's disease. The proportion of responses graded as correct was 57.5% and 83.8% for GPT 3.5 and GPT 4, respectively. GPT 3.5 provided potentially harmful answers for 6.3% (5/80) of its responses. No responses from GPT 4 were graded as harmful. In general, ChatGPT 3.5 and 4 demonstrated good performance in terms of quality and reliability across two different languages. Nonetheless, harmful responses should not be scorned, and it's crucial to consider this aspect when addressing patients using these resources. Considering the current safety concerns, it's not advisable for patients to use such models for DBS surgery guidance. Performance of ChatGPT 3.5 and 4 as a tool for patient support before and after DBS surgery for Parkinson's disease.

Abstract Image

查看原文本刊更多论文

ChatGPT 3.5 和 4 作为帕金森病 DBS 手术前后患者支持工具的性能。

脑深部刺激术（DBS）是一种神经外科手术，通过在大脑特定区域植入电极来治疗包括帕金森病在内的多种疾病。术前或术后患者的疑虑和问题应根据最新的科学和临床实践加以解决。ChatGPT 是人工智能应用的一个范例，它能够以一种人人都能理解的方式理解和回答医疗问题。ChatGPT 3.5 和 4 模型用英语和葡萄牙语回答了 40 个问题，由功能神经外科和神经运动障碍领域的两位经验丰富的专家独立评分，并由第三位评审员解决。ChatGPT 3.5 和 4 在回答与帕金森病的 DBS 手术有关的 80 个英语和葡萄牙语问题时表现出很高的准确性。GPT 3.5 和 GPT 4 的正确率分别为 57.5% 和 83.8%。GPT 3.5 中有 6.3%（5/80）的回答可能有害。GPT 4 中没有回答被评为有害。总的来说，ChatGPT 3.5 和 4 在两种不同语言的质量和可靠性方面表现良好。尽管如此，我们不应蔑视有害的回复，在使用这些资源为患者提供服务时考虑到这一点至关重要。考虑到当前的安全问题，不建议患者使用此类模型进行 DBS 手术指导。ChatGPT 3.5 和 4 作为帕金森病 DBS 手术前后患者支持工具的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurological Sciences 医学-临床神经学

CiteScore

6.10

自引率

3.00%

发文量

743

审稿时长

4 months

期刊介绍： Neurological Sciences is intended to provide a medium for the communication of results and ideas in the field of neuroscience. The journal welcomes contributions in both the basic and clinical aspects of the neurosciences. The official language of the journal is English. Reports are published in the form of original articles, short communications, editorials, reviews and letters to the editor. Original articles present the results of experimental or clinical studies in the neurosciences, while short communications are succinct reports permitting the rapid publication of novel results. Original contributions may be submitted for the special sections History of Neurology, Health Care and Neurological Digressions - a forum for cultural topics related to the neurosciences. The journal also publishes correspondence book reviews, meeting reports and announcements.