通过检索增强型生成增强大规模语言模型改进急诊病人分诊。

IF 2.1 3区 医学 Q2 EMERGENCY MEDICINE
Megumi Yazaki, Satoshi Maki, Takeo Furuya, Ken Inoue, Ko Nagai, Yuki Nagashima, Juntaro Maruyama, Yasunori Toki, Kyota Kitagawa, Shuhei Iwata, Takaki Kitamura, Sho Gushiken, Yuji Noguchi, Masahiro Inoue, Yasuhiro Shiga, Kazuhide Inage, Sumihisa Orita, Takaaki Nakada, Seiji Ohtori
{"title":"通过检索增强型生成增强大规模语言模型改进急诊病人分诊。","authors":"Megumi Yazaki, Satoshi Maki, Takeo Furuya, Ken Inoue, Ko Nagai, Yuki Nagashima, Juntaro Maruyama, Yasunori Toki, Kyota Kitagawa, Shuhei Iwata, Takaki Kitamura, Sho Gushiken, Yuji Noguchi, Masahiro Inoue, Yasuhiro Shiga, Kazuhide Inage, Sumihisa Orita, Takaaki Nakada, Seiji Ohtori","doi":"10.1080/10903127.2024.2374400","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Emergency medical triage is crucial for prioritizing patient care in emergency situations, yet its effectiveness can vary significantly based on the experience and training of the personnel involved. This study aims to evaluate the efficacy of integrating Retrieval Augmented Generation (RAG) with Large Language Models (LLMs), specifically OpenAI's GPT models, to standardize triage procedures and reduce variability in emergency care.</p><p><strong>Methods: </strong>We created 100 simulated triage scenarios based on modified cases from the Japanese National Examination for Emergency Medical Technicians. These scenarios were processed by the RAG-enhanced LLMs, and the models were given patient vital signs, symptoms, and observations from emergency medical services (EMS) teams as inputs. The primary outcome was the accuracy of triage classifications, which was used to compare the performance of the RAG-enhanced LLMs with that of emergency medical technicians and emergency physicians. Secondary outcomes included the rates of under-triage and over-triage.</p><p><strong>Results: </strong>The Generative Pre-trained Transformer 3.5 (GPT-3.5) with RAG model achieved a correct triage rate of 70%, significantly outperforming Emergency Medical Technicians (EMTs) with 35% and 38% correct rates, and emergency physicians with 50% and 47% correct rates (<i>p</i> < 0.05). Additionally, this model demonstrated a substantial reduction in under-triage rates to 8%, compared with 33% for GPT-3.5 without RAG, and 39% for GPT-4 without RAG.</p><p><strong>Conclusions: </strong>The integration of RAG with LLMs shows promise in improving the accuracy and consistency of medical assessments in emergency settings. Further validation in diverse medical settings with broader datasets is necessary to confirm the effectiveness and adaptability of these technologies in live environments.</p>","PeriodicalId":20336,"journal":{"name":"Prehospital Emergency Care","volume":" ","pages":"1-7"},"PeriodicalIF":2.1000,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Emergency Patient Triage Improvement through a Retrieval-Augmented Generation Enhanced Large-Scale Language Model.\",\"authors\":\"Megumi Yazaki, Satoshi Maki, Takeo Furuya, Ken Inoue, Ko Nagai, Yuki Nagashima, Juntaro Maruyama, Yasunori Toki, Kyota Kitagawa, Shuhei Iwata, Takaki Kitamura, Sho Gushiken, Yuji Noguchi, Masahiro Inoue, Yasuhiro Shiga, Kazuhide Inage, Sumihisa Orita, Takaaki Nakada, Seiji Ohtori\",\"doi\":\"10.1080/10903127.2024.2374400\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Emergency medical triage is crucial for prioritizing patient care in emergency situations, yet its effectiveness can vary significantly based on the experience and training of the personnel involved. This study aims to evaluate the efficacy of integrating Retrieval Augmented Generation (RAG) with Large Language Models (LLMs), specifically OpenAI's GPT models, to standardize triage procedures and reduce variability in emergency care.</p><p><strong>Methods: </strong>We created 100 simulated triage scenarios based on modified cases from the Japanese National Examination for Emergency Medical Technicians. These scenarios were processed by the RAG-enhanced LLMs, and the models were given patient vital signs, symptoms, and observations from emergency medical services (EMS) teams as inputs. The primary outcome was the accuracy of triage classifications, which was used to compare the performance of the RAG-enhanced LLMs with that of emergency medical technicians and emergency physicians. Secondary outcomes included the rates of under-triage and over-triage.</p><p><strong>Results: </strong>The Generative Pre-trained Transformer 3.5 (GPT-3.5) with RAG model achieved a correct triage rate of 70%, significantly outperforming Emergency Medical Technicians (EMTs) with 35% and 38% correct rates, and emergency physicians with 50% and 47% correct rates (<i>p</i> < 0.05). Additionally, this model demonstrated a substantial reduction in under-triage rates to 8%, compared with 33% for GPT-3.5 without RAG, and 39% for GPT-4 without RAG.</p><p><strong>Conclusions: </strong>The integration of RAG with LLMs shows promise in improving the accuracy and consistency of medical assessments in emergency settings. Further validation in diverse medical settings with broader datasets is necessary to confirm the effectiveness and adaptability of these technologies in live environments.</p>\",\"PeriodicalId\":20336,\"journal\":{\"name\":\"Prehospital Emergency Care\",\"volume\":\" \",\"pages\":\"1-7\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Prehospital Emergency Care\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/10903127.2024.2374400\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"EMERGENCY MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Prehospital Emergency Care","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10903127.2024.2374400","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0

摘要

目的:紧急医疗分诊对于在紧急情况下优先考虑病人护理至关重要,但其效果会因相关人员的经验和培训而有很大差异。本研究旨在评估将检索增强生成(RAG)与大型语言模型(LLM)(特别是 OpenAI 的 GPT 模型)相结合,以规范分诊程序并减少急诊护理中的差异的效果:我们根据日本国家紧急医疗技术人员考试的修改案例创建了 100 个模拟分诊场景。这些情景由 RAG 增强型 LLMs 处理,模型以患者生命体征、症状和紧急医疗服务 (EMS) 团队的观察结果作为输入。主要结果是分诊分类的准确性,用于比较 RAG 增强型 LLM 与急诊医疗技术人员和急诊医生的表现。次要结果包括分诊不足率和分诊过度率:带有 RAG 的生成预训练转换器 3.5 (GPT-3.5) 模型的正确分诊率达到 70%,明显优于急救医疗技术人员 (EMT) 35% 和 38% 的正确率,以及急诊医生 50% 和 47% 的正确率(P < 0.05)。此外,该模型还大大降低了误诊率,误诊率仅为 8%,而没有 RAG 的 GPT-3.5 误诊率为 33%,没有 RAG 的 GPT-4 误诊率为 39%:结论:将 RAG 与 LLMs 相结合,有望提高急诊医疗评估的准确性和一致性。有必要在不同的医疗环境中使用更广泛的数据集进行进一步验证,以确认这些技术在现场环境中的有效性和适应性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Emergency Patient Triage Improvement through a Retrieval-Augmented Generation Enhanced Large-Scale Language Model.

Objectives: Emergency medical triage is crucial for prioritizing patient care in emergency situations, yet its effectiveness can vary significantly based on the experience and training of the personnel involved. This study aims to evaluate the efficacy of integrating Retrieval Augmented Generation (RAG) with Large Language Models (LLMs), specifically OpenAI's GPT models, to standardize triage procedures and reduce variability in emergency care.

Methods: We created 100 simulated triage scenarios based on modified cases from the Japanese National Examination for Emergency Medical Technicians. These scenarios were processed by the RAG-enhanced LLMs, and the models were given patient vital signs, symptoms, and observations from emergency medical services (EMS) teams as inputs. The primary outcome was the accuracy of triage classifications, which was used to compare the performance of the RAG-enhanced LLMs with that of emergency medical technicians and emergency physicians. Secondary outcomes included the rates of under-triage and over-triage.

Results: The Generative Pre-trained Transformer 3.5 (GPT-3.5) with RAG model achieved a correct triage rate of 70%, significantly outperforming Emergency Medical Technicians (EMTs) with 35% and 38% correct rates, and emergency physicians with 50% and 47% correct rates (p < 0.05). Additionally, this model demonstrated a substantial reduction in under-triage rates to 8%, compared with 33% for GPT-3.5 without RAG, and 39% for GPT-4 without RAG.

Conclusions: The integration of RAG with LLMs shows promise in improving the accuracy and consistency of medical assessments in emergency settings. Further validation in diverse medical settings with broader datasets is necessary to confirm the effectiveness and adaptability of these technologies in live environments.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Prehospital Emergency Care
Prehospital Emergency Care 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.30
自引率
12.50%
发文量
137
审稿时长
1 months
期刊介绍: Prehospital Emergency Care publishes peer-reviewed information relevant to the practice, educational advancement, and investigation of prehospital emergency care, including the following types of articles: Special Contributions - Original Articles - Education and Practice - Preliminary Reports - Case Conferences - Position Papers - Collective Reviews - Editorials - Letters to the Editor - Media Reviews.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信