Generative artificial intelligence-mediated counselling on first aid for seizures: The performance of publicly available chatbot versus its customised version

IF 2.3 3区医学 Q2 BEHAVIORAL SCIENCES

Epilepsy & Behavior Pub Date : 2025-08-27 DOI:10.1016/j.yebeh.2025.110680

Alexei A. Birkun , Yekaterina Kosova , Anton Rudenko

{"title":"Generative artificial intelligence-mediated counselling on first aid for seizures: The performance of publicly available chatbot versus its customised version","authors":"Alexei A. Birkun , Yekaterina Kosova , Anton Rudenko","doi":"10.1016/j.yebeh.2025.110680","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The potential application of cutting-edge generative artificial intelligence chatbots in the capacity of emergency consultants is gaining growing attention. This study aimed to analyse the quality of advice on first aid for seizures generated by a commercially developed chatbot in comparison with its customised version.</div></div><div><h3>Methods</h3><div>The baseline version of ChatGPT (model GPT-4o) and the same chatbot customised using a specialised knowledge base and prompt engineering were tested in four scenarios mimicking bystander requests for instructions on how to help a victim with seizures. The scenarios included ongoing seizures and postictal states, with or without consciousness and breathing. A checklist-based evaluation was conducted.</div></div><div><h3>Results</h3><div>In total, 120 user-to-chatbot dialogues were generated (2 chatbots × 15 dialogues × 4 scenarios). The baseline chatbot always failed to consider the victim’s state, including whether the seizures are continuing, or if the victim in the postictal period is conscious and breathing normally. Its advice was non-selective and inaccurate, with frequent omissions of key recommendations on first aid and suggestions of inadequate measures. The customised chatbot-generated guidance was consistently tailored to the victim’s condition, significantly more precise and completely safe. Depending on the scenario, the mean percentage of chatbot responses that fulfilled the checklist items was 14–49 % for the baseline chatbot and 77–92 % for the customised version (<em>p</em> ≤ 0.039).</div></div><div><h3>Conclusions</h3><div>Whereas the publicly available version of the chatbot is not acceptable for first aid counselling, its expert-informed customisation ensures high accuracy and safety of generated advice. Further research in this field is advisable.</div></div>","PeriodicalId":11847,"journal":{"name":"Epilepsy & Behavior","volume":"171 ","pages":"Article 110680"},"PeriodicalIF":2.3000,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epilepsy & Behavior","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1525505025004202","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Background

The potential application of cutting-edge generative artificial intelligence chatbots in the capacity of emergency consultants is gaining growing attention. This study aimed to analyse the quality of advice on first aid for seizures generated by a commercially developed chatbot in comparison with its customised version.

Methods

The baseline version of ChatGPT (model GPT-4o) and the same chatbot customised using a specialised knowledge base and prompt engineering were tested in four scenarios mimicking bystander requests for instructions on how to help a victim with seizures. The scenarios included ongoing seizures and postictal states, with or without consciousness and breathing. A checklist-based evaluation was conducted.

Results

In total, 120 user-to-chatbot dialogues were generated (2 chatbots × 15 dialogues × 4 scenarios). The baseline chatbot always failed to consider the victim’s state, including whether the seizures are continuing, or if the victim in the postictal period is conscious and breathing normally. Its advice was non-selective and inaccurate, with frequent omissions of key recommendations on first aid and suggestions of inadequate measures. The customised chatbot-generated guidance was consistently tailored to the victim’s condition, significantly more precise and completely safe. Depending on the scenario, the mean percentage of chatbot responses that fulfilled the checklist items was 14–49 % for the baseline chatbot and 77–92 % for the customised version (p ≤ 0.039).

Conclusions

Whereas the publicly available version of the chatbot is not acceptable for first aid counselling, its expert-informed customisation ensures high accuracy and safety of generated advice. Further research in this field is advisable.

查看原文本刊更多论文

生成式人工智能介导的癫痫急救咨询：公开聊天机器人与定制版本的表现

尖端的生成式人工智能聊天机器人在应急顾问方面的潜在应用正受到越来越多的关注。这项研究旨在分析由商业开发的聊天机器人产生的癫痫发作急救建议的质量，并将其与定制版本进行比较。方法测试了ChatGPT的基线版本（模型gpt - 40）和使用专业知识库和快速工程定制的同一聊天机器人，在四个场景中模拟旁观者请求如何帮助癫痫患者的指示。这些场景包括持续的癫痫发作和昏迷状态，有或没有意识和呼吸。进行了基于检查表的评估。结果共生成120个用户与聊天机器人的对话（2个聊天机器人× 15个对话× 4个场景）。基线聊天机器人总是不能考虑受害者的状态，包括癫痫是否持续，或者受害者在癫痫发作后是否有意识和呼吸正常。它的咨询意见是非选择性的和不准确的，经常遗漏关于急救的关键建议和关于不充分措施的建议。定制的聊天机器人生成的导航系统始终根据受害者的情况量身定制，更加精确和完全安全。根据不同的场景，基线聊天机器人满足清单项目的平均百分比为14 - 49%，定制版聊天机器人的平均百分比为77 - 92% （p≤0.039）。虽然公开版本的聊天机器人不能用于急救咨询，但其专家知情的定制确保了生成建议的准确性和安全性。在这个领域作进一步的研究是可取的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Epilepsy & Behavior 医学-行为科学

CiteScore

5.40

自引率

15.40%

发文量

385

审稿时长

43 days

期刊介绍： Epilepsy & Behavior is the fastest-growing international journal uniquely devoted to the rapid dissemination of the most current information available on the behavioral aspects of seizures and epilepsy. Epilepsy & Behavior presents original peer-reviewed articles based on laboratory and clinical research. Topics are drawn from a variety of fields, including clinical neurology, neurosurgery, neuropsychiatry, neuropsychology, neurophysiology, neuropharmacology, and neuroimaging. From September 2012 Epilepsy & Behavior stopped accepting Case Reports for publication in the journal. From this date authors who submit to Epilepsy & Behavior will be offered a transfer or asked to resubmit their Case Reports to its new sister journal, Epilepsy & Behavior Case Reports.