Shane Poole, Nikki Sisodia, Kanishka Koshal, Kyra Henderson, Jaeleene Wijangco, Danelvis Paredes, Chelsea Chen, William Rowles, Amit Akula, Jens Wuerfel, Vishakha Sharma, Andreas M Rauschecker, Roland G Henry, Riley Bove
{"title":"使用大型语言模型检测新病变:在真实世界多发性硬化症数据集中的应用。","authors":"Shane Poole, Nikki Sisodia, Kanishka Koshal, Kyra Henderson, Jaeleene Wijangco, Danelvis Paredes, Chelsea Chen, William Rowles, Amit Akula, Jens Wuerfel, Vishakha Sharma, Andreas M Rauschecker, Roland G Henry, Riley Bove","doi":"10.1002/ana.27251","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Neuroimaging is routinely utilized to identify new inflammatory activity in multiple sclerosis (MS). A large language model to classify narrative magnetic resonance imaging reports in the electronic health record (EHR) as discrete data could provide significant benefits for MS research. The objectives of the current study were to develop such a prompt and to illustrate its research applications through a common clinical scenario: monitoring response to B-cell depleting therapy (BCDT).</p><p><strong>Methods: </strong>An institutional ecosystem that securely connects healthcare data with ChatGPT4 was applied to clinical MS magnetic resonance imaging reports in a single institutional EHR (2000-2022). A prompt (msLesionprompt) was developed and iteratively refined to classify the presence or absence of new T2-weighted lesions (newT2w) and contrast-enhancing lesions (CEL). The multistep validation included evaluating efficiency (time and cost), comparison with manually annotated reports using standard confusion matrix, and application to identifying predictors of newT2w/CEL after BCDT start.</p><p><strong>Results: </strong>Accuracy of msLesionprompt was high for detection of newT2w (97%) and CEL (96.8%). All 14,888 available reports were categorized in 4.13 hours ($28); 79% showed no newT2w or CEL. Data extracted showed expected suppression of new activity by BCDT (>97% monitoring magnetic resonance images after an initial \"rebaseline\" scan). Neighborhood poverty (Area Deprivation Index) was identified as a predictor of inflammatory activity (newT2w: OR 1.69, 95% CI 1.10-2.59, p = 0.017; CEL: OR 1.54, 95% CI 1.01-2.34, p = 0.046).</p><p><strong>Interpretation: </strong>Extracting discrete information from narrative imaging reports using an large language model is feasible and efficient. This approach could augment many real-world analyses of MS disease evolution and treatment response. ANN NEUROL 2025.</p>","PeriodicalId":127,"journal":{"name":"Annals of Neurology","volume":" ","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.\",\"authors\":\"Shane Poole, Nikki Sisodia, Kanishka Koshal, Kyra Henderson, Jaeleene Wijangco, Danelvis Paredes, Chelsea Chen, William Rowles, Amit Akula, Jens Wuerfel, Vishakha Sharma, Andreas M Rauschecker, Roland G Henry, Riley Bove\",\"doi\":\"10.1002/ana.27251\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Neuroimaging is routinely utilized to identify new inflammatory activity in multiple sclerosis (MS). A large language model to classify narrative magnetic resonance imaging reports in the electronic health record (EHR) as discrete data could provide significant benefits for MS research. The objectives of the current study were to develop such a prompt and to illustrate its research applications through a common clinical scenario: monitoring response to B-cell depleting therapy (BCDT).</p><p><strong>Methods: </strong>An institutional ecosystem that securely connects healthcare data with ChatGPT4 was applied to clinical MS magnetic resonance imaging reports in a single institutional EHR (2000-2022). A prompt (msLesionprompt) was developed and iteratively refined to classify the presence or absence of new T2-weighted lesions (newT2w) and contrast-enhancing lesions (CEL). The multistep validation included evaluating efficiency (time and cost), comparison with manually annotated reports using standard confusion matrix, and application to identifying predictors of newT2w/CEL after BCDT start.</p><p><strong>Results: </strong>Accuracy of msLesionprompt was high for detection of newT2w (97%) and CEL (96.8%). All 14,888 available reports were categorized in 4.13 hours ($28); 79% showed no newT2w or CEL. Data extracted showed expected suppression of new activity by BCDT (>97% monitoring magnetic resonance images after an initial \\\"rebaseline\\\" scan). Neighborhood poverty (Area Deprivation Index) was identified as a predictor of inflammatory activity (newT2w: OR 1.69, 95% CI 1.10-2.59, p = 0.017; CEL: OR 1.54, 95% CI 1.01-2.34, p = 0.046).</p><p><strong>Interpretation: </strong>Extracting discrete information from narrative imaging reports using an large language model is feasible and efficient. This approach could augment many real-world analyses of MS disease evolution and treatment response. ANN NEUROL 2025.</p>\",\"PeriodicalId\":127,\"journal\":{\"name\":\"Annals of Neurology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Neurology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/ana.27251\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Neurology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ana.27251","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:神经影像学常规用于识别多发性硬化症(MS)的新炎症活动。将电子健康记录(EHR)中的叙述性磁共振成像报告分类为离散数据的大型语言模型可以为MS研究提供显着的好处。当前研究的目的是开发这样一个提示,并通过一个常见的临床场景来说明其研究应用:监测对b细胞消耗疗法(BCDT)的反应。方法:将医疗保健数据与ChatGPT4安全连接的机构生态系统应用于单一机构EHR(2000-2022)的临床MS磁共振成像报告。我们开发了一个提示(mslessionprompt),并不断改进,以区分是否存在新的t2加权病变(newT2w)和对比增强病变(CEL)。多步骤验证包括评估效率(时间和成本),使用标准混淆矩阵与手动注释的报告进行比较,以及应用于识别BCDT开始后newT2w/CEL的预测因子。结果:msLesionprompt对newT2w和CEL的检测准确率较高,分别为97%和96.8%。所有14,888份可用报告在4.13小时(28美元)内分类;79%未见新t2w或CEL。提取的数据显示,BCDT对新活动有预期的抑制作用(在初始“重新基线”扫描后,bb0 97%监测磁共振图像)。社区贫困(区域剥夺指数)被确定为炎症活动的预测因子(newT2w: OR 1.69, 95% CI 1.10-2.59, p = 0.017;CEL: OR 1.54, 95% CI 1.01-2.34, p = 0.046)。解释:使用大型语言模型从叙事影像报告中提取离散信息是可行且有效的。这种方法可以增加许多MS疾病演变和治疗反应的现实世界分析。Ann neurol 2025。
Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.
Objective: Neuroimaging is routinely utilized to identify new inflammatory activity in multiple sclerosis (MS). A large language model to classify narrative magnetic resonance imaging reports in the electronic health record (EHR) as discrete data could provide significant benefits for MS research. The objectives of the current study were to develop such a prompt and to illustrate its research applications through a common clinical scenario: monitoring response to B-cell depleting therapy (BCDT).
Methods: An institutional ecosystem that securely connects healthcare data with ChatGPT4 was applied to clinical MS magnetic resonance imaging reports in a single institutional EHR (2000-2022). A prompt (msLesionprompt) was developed and iteratively refined to classify the presence or absence of new T2-weighted lesions (newT2w) and contrast-enhancing lesions (CEL). The multistep validation included evaluating efficiency (time and cost), comparison with manually annotated reports using standard confusion matrix, and application to identifying predictors of newT2w/CEL after BCDT start.
Results: Accuracy of msLesionprompt was high for detection of newT2w (97%) and CEL (96.8%). All 14,888 available reports were categorized in 4.13 hours ($28); 79% showed no newT2w or CEL. Data extracted showed expected suppression of new activity by BCDT (>97% monitoring magnetic resonance images after an initial "rebaseline" scan). Neighborhood poverty (Area Deprivation Index) was identified as a predictor of inflammatory activity (newT2w: OR 1.69, 95% CI 1.10-2.59, p = 0.017; CEL: OR 1.54, 95% CI 1.01-2.34, p = 0.046).
Interpretation: Extracting discrete information from narrative imaging reports using an large language model is feasible and efficient. This approach could augment many real-world analyses of MS disease evolution and treatment response. ANN NEUROL 2025.
期刊介绍:
Annals of Neurology publishes original articles with potential for high impact in understanding the pathogenesis, clinical and laboratory features, diagnosis, treatment, outcomes and science underlying diseases of the human nervous system. Articles should ideally be of broad interest to the academic neurological community rather than solely to subspecialists in a particular field. Studies involving experimental model system, including those in cell and organ cultures and animals, of direct translational relevance to the understanding of neurological disease are also encouraged.