用于儿科中风研究的 GPT:试点研究

IF 3.2 3区 医学 Q2 CLINICAL NEUROLOGY
Anna K. Fiedler BS , Kai Zhang PhD , Tia S. Lal BS , Xiaoqian Jiang PhD , Stuart M. Fraser MD
{"title":"用于儿科中风研究的 GPT:试点研究","authors":"Anna K. Fiedler BS ,&nbsp;Kai Zhang PhD ,&nbsp;Tia S. Lal BS ,&nbsp;Xiaoqian Jiang PhD ,&nbsp;Stuart M. Fraser MD","doi":"10.1016/j.pediatrneurol.2024.07.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Pediatric stroke is an important cause of morbidity in children. Although research can be challenging, large amounts of data have been captured through collaborative efforts in the International Pediatric Stroke Study (IPSS). This study explores the use of an advanced artificial intelligence program, the Generative Pre-trained Transformer (GPT), to enter pediatric stroke data into the IPSS.</p></div><div><h3>Methods</h3><p>The most recent 50 clinical notes of patients with ischemic stroke or cerebral venous sinus thrombosis at the UTHealth Pediatric Stroke Clinic were deidentified. Domain-specific prompts were engineered for an offline artificial intelligence program (GPT) to answer IPSS questions. Responses from GPT were compared with the human rater. Percent agreement was assessed across 50 patients for each of the 114 queries developed from the IPSS database outcome questionnaire.</p></div><div><h3>Results</h3><p>GPT demonstrated strong performance on several questions but showed variability overall. In its early iterations it was able to match human judgment occasionally with an accuracy score of 1.00 (n = 20, 17.5%), but it scored as low as 0.26 in some patients. Prompts were adjusted in four subsequent iterations to increase accuracy. In its fourth iteration, agreement was 93.6%, with a maximum agreement of 100% and minimum of 62%. Of 2400 individual items assessed, our model entered 2247 (93.6%) correctly and 153 (6.4%) incorrectly.</p></div><div><h3>Conclusions</h3><p>Although our tailored generative model with domain-specific prompt engineering and ontological guidance shows promise for research applications, further refinement is needed to enhance its accuracy. It cannot enter data entirely independently, but it can be employed in tandem with human oversight contributing to a collaborative approach that reduces overall effort.</p></div>","PeriodicalId":19956,"journal":{"name":"Pediatric neurology","volume":"160 ","pages":"Pages 54-59"},"PeriodicalIF":3.2000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generative Pre-trained Transformer for Pediatric Stroke Research: A Pilot Study\",\"authors\":\"Anna K. Fiedler BS ,&nbsp;Kai Zhang PhD ,&nbsp;Tia S. Lal BS ,&nbsp;Xiaoqian Jiang PhD ,&nbsp;Stuart M. Fraser MD\",\"doi\":\"10.1016/j.pediatrneurol.2024.07.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Pediatric stroke is an important cause of morbidity in children. Although research can be challenging, large amounts of data have been captured through collaborative efforts in the International Pediatric Stroke Study (IPSS). This study explores the use of an advanced artificial intelligence program, the Generative Pre-trained Transformer (GPT), to enter pediatric stroke data into the IPSS.</p></div><div><h3>Methods</h3><p>The most recent 50 clinical notes of patients with ischemic stroke or cerebral venous sinus thrombosis at the UTHealth Pediatric Stroke Clinic were deidentified. Domain-specific prompts were engineered for an offline artificial intelligence program (GPT) to answer IPSS questions. Responses from GPT were compared with the human rater. Percent agreement was assessed across 50 patients for each of the 114 queries developed from the IPSS database outcome questionnaire.</p></div><div><h3>Results</h3><p>GPT demonstrated strong performance on several questions but showed variability overall. In its early iterations it was able to match human judgment occasionally with an accuracy score of 1.00 (n = 20, 17.5%), but it scored as low as 0.26 in some patients. Prompts were adjusted in four subsequent iterations to increase accuracy. In its fourth iteration, agreement was 93.6%, with a maximum agreement of 100% and minimum of 62%. Of 2400 individual items assessed, our model entered 2247 (93.6%) correctly and 153 (6.4%) incorrectly.</p></div><div><h3>Conclusions</h3><p>Although our tailored generative model with domain-specific prompt engineering and ontological guidance shows promise for research applications, further refinement is needed to enhance its accuracy. It cannot enter data entirely independently, but it can be employed in tandem with human oversight contributing to a collaborative approach that reduces overall effort.</p></div>\",\"PeriodicalId\":19956,\"journal\":{\"name\":\"Pediatric neurology\",\"volume\":\"160 \",\"pages\":\"Pages 54-59\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pediatric neurology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0887899424002522\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pediatric neurology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0887899424002522","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景小儿卒中是导致儿童发病的一个重要原因。虽然研究具有挑战性,但通过国际儿科卒中研究(IPSS)的共同努力,已经收集了大量数据。本研究探讨了如何使用先进的人工智能程序生成预训练转换器(GPT)将儿科卒中数据输入 IPSS。为离线人工智能程序(GPT)设计了特定领域的提示来回答 IPSS 问题。将 GPT 的回答与人类评分者的回答进行比较。结果 GPT 在几个问题上表现出很强的性能,但总体上表现出不稳定性。在早期的迭代中,它偶尔能与人工判断相匹配,准确率达到 1.00(n = 20,17.5%),但在一些患者身上,准确率低至 0.26。在随后的四次迭代中,对提示进行了调整,以提高准确率。在第四次迭代中,吻合率为 93.6%,最大吻合率为 100%,最小吻合率为 62%。在评估的 2400 个单项中,我们的模型正确输入了 2247 项(93.6%),错误输入了 153 项(6.4%)。结论虽然我们的定制生成模型具有特定领域的提示工程和本体论指导,显示了研究应用的前景,但仍需进一步完善以提高其准确性。它不能完全独立地输入数据,但可以与人工监督协同使用,从而减少整体工作量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Generative Pre-trained Transformer for Pediatric Stroke Research: A Pilot Study

Background

Pediatric stroke is an important cause of morbidity in children. Although research can be challenging, large amounts of data have been captured through collaborative efforts in the International Pediatric Stroke Study (IPSS). This study explores the use of an advanced artificial intelligence program, the Generative Pre-trained Transformer (GPT), to enter pediatric stroke data into the IPSS.

Methods

The most recent 50 clinical notes of patients with ischemic stroke or cerebral venous sinus thrombosis at the UTHealth Pediatric Stroke Clinic were deidentified. Domain-specific prompts were engineered for an offline artificial intelligence program (GPT) to answer IPSS questions. Responses from GPT were compared with the human rater. Percent agreement was assessed across 50 patients for each of the 114 queries developed from the IPSS database outcome questionnaire.

Results

GPT demonstrated strong performance on several questions but showed variability overall. In its early iterations it was able to match human judgment occasionally with an accuracy score of 1.00 (n = 20, 17.5%), but it scored as low as 0.26 in some patients. Prompts were adjusted in four subsequent iterations to increase accuracy. In its fourth iteration, agreement was 93.6%, with a maximum agreement of 100% and minimum of 62%. Of 2400 individual items assessed, our model entered 2247 (93.6%) correctly and 153 (6.4%) incorrectly.

Conclusions

Although our tailored generative model with domain-specific prompt engineering and ontological guidance shows promise for research applications, further refinement is needed to enhance its accuracy. It cannot enter data entirely independently, but it can be employed in tandem with human oversight contributing to a collaborative approach that reduces overall effort.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pediatric neurology
Pediatric neurology 医学-临床神经学
CiteScore
4.80
自引率
2.60%
发文量
176
审稿时长
78 days
期刊介绍: Pediatric Neurology publishes timely peer-reviewed clinical and research articles covering all aspects of the developing nervous system. Pediatric Neurology features up-to-the-minute publication of the latest advances in the diagnosis, management, and treatment of pediatric neurologic disorders. The journal''s editor, E. Steve Roach, in conjunction with the team of Associate Editors, heads an internationally recognized editorial board, ensuring the most authoritative and extensive coverage of the field. Among the topics covered are: epilepsy, mitochondrial diseases, congenital malformations, chromosomopathies, peripheral neuropathies, perinatal and childhood stroke, cerebral palsy, as well as other diseases affecting the developing nervous system.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信