基于计算机断层扫描或磁共振成像报告，用于提取新识别的急性脑梗死的微调大型语言模型。

IF 1.3 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Emergency Radiology Pub Date : 2025-08-01 Epub Date: 2025-06-02 DOI:10.1007/s10140-025-02354-1

Nana Fujita, Koichiro Yasaka, Shigeru Kiryu, Osamu Abe

{"title":"基于计算机断层扫描或磁共振成像报告，用于提取新识别的急性脑梗死的微调大型语言模型。","authors":"Nana Fujita, Koichiro Yasaka, Shigeru Kiryu, Osamu Abe","doi":"10.1007/s10140-025-02354-1","DOIUrl":null,"url":null,"abstract":"Purpose: This study aimed to develop an automated early warning system using a large language model (LLM) to identify acute to subacute brain infarction from free-text computed tomography (CT) or magnetic resonance imaging (MRI) radiology reports.Methods: In this retrospective study, 5,573, 1,883, and 834 patients were included in the training (mean age, 67.5 ± 17.2 years; 2,831 males), validation (mean age, 61.5 ± 18.3 years; 994 males), and test (mean age, 66.5 ± 16.1 years; 488 males) datasets. An LLM (Japanese Bidirectional Encoder Representations from Transformers model) was fine-tuned to classify the CT and MRI reports into three groups (group 0, newly identified acute to subacute infarction; group 1, known acute to subacute infarction or old infarction; group 2, without infarction). The training and validation processes were repeated 15 times, and the best-performing model on the validation dataset was selected to further evaluate its performance on the test dataset.Results: The best fine-tuned model exhibited sensitivities of 0.891, 0.905, and 0.959 for groups 0, 1, and 2, respectively, in the test dataset. The macrosensitivity (the average of sensitivity for all groups) and accuracy were 0.918 and 0.923, respectively. The model's performance in extracting newly identified acute brain infarcts was high, with an area under the receiver operating characteristic curve of 0.979 (95% confidence interval, 0.956-1.000). The average prediction time was 0.115 ± 0.037 s per patient.Conclusion: A fine-tuned LLM could extract newly identified acute to subacute brain infarcts based on CT or MRI findings with high performance.","PeriodicalId":11623,"journal":{"name":"Emergency Radiology","volume":" ","pages":"495-501"},"PeriodicalIF":1.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12328549/pdf/","citationCount":"0","resultStr":"{\"title\":\"Fine-tuned large Language model for extracting newly identified acute brain infarcts based on computed tomography or magnetic resonance imaging reports.\",\"authors\":\"Nana Fujita, Koichiro Yasaka, Shigeru Kiryu, Osamu Abe\",\"doi\":\"10.1007/s10140-025-02354-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: This study aimed to develop an automated early warning system using a large language model (LLM) to identify acute to subacute brain infarction from free-text computed tomography (CT) or magnetic resonance imaging (MRI) radiology reports.Methods: In this retrospective study, 5,573, 1,883, and 834 patients were included in the training (mean age, 67.5 ± 17.2 years; 2,831 males), validation (mean age, 61.5 ± 18.3 years; 994 males), and test (mean age, 66.5 ± 16.1 years; 488 males) datasets. An LLM (Japanese Bidirectional Encoder Representations from Transformers model) was fine-tuned to classify the CT and MRI reports into three groups (group 0, newly identified acute to subacute infarction; group 1, known acute to subacute infarction or old infarction; group 2, without infarction). The training and validation processes were repeated 15 times, and the best-performing model on the validation dataset was selected to further evaluate its performance on the test dataset.Results: The best fine-tuned model exhibited sensitivities of 0.891, 0.905, and 0.959 for groups 0, 1, and 2, respectively, in the test dataset. The macrosensitivity (the average of sensitivity for all groups) and accuracy were 0.918 and 0.923, respectively. The model's performance in extracting newly identified acute brain infarcts was high, with an area under the receiver operating characteristic curve of 0.979 (95% confidence interval, 0.956-1.000). The average prediction time was 0.115 ± 0.037 s per patient.Conclusion: A fine-tuned LLM could extract newly identified acute to subacute brain infarcts based on CT or MRI findings with high performance.\",\"PeriodicalId\":11623,\"journal\":{\"name\":\"Emergency Radiology\",\"volume\":\" \",\"pages\":\"495-501\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12328549/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Emergency Radiology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s10140-025-02354-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/2 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emergency Radiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10140-025-02354-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/2 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

目的：本研究旨在开发一种使用大语言模型（LLM）的自动预警系统，从自由文本计算机断层扫描（CT）或磁共振成像（MRI）放射学报告中识别急性至亚急性脑梗死。方法：本回顾性研究共纳入5573例、1883例和834例患者(平均年龄67.5±17.2岁；男性2831人)，验证(平均年龄61.5±18.3岁；994名男性)，平均年龄66.5±16.1岁；488名男性)数据集。LLM（日本双向编码器表示从变压器模型）进行微调，将CT和MRI报告分为三组(0组，新发现的急性至亚急性梗死；1组，已知急性至亚急性梗死或陈旧性梗死；第二组，无梗死)。训练和验证过程重复15次，选择在验证数据集上表现最好的模型，进一步评估其在测试数据集上的性能。结果：在测试数据集中，对于第0、1和2组，最佳微调模型的灵敏度分别为0.891、0.905和0.959。宏观灵敏度（各组灵敏度平均值）和准确度分别为0.918和0.923。该模型对新识别急性脑梗死的提取性能较高，受试者工作特征曲线下面积为0.979（95%置信区间为0.956 ~ 1.000）。平均预测时间为0.115±0.037 s /例。结论：调整后的LLM可以根据CT或MRI的表现高效提取新发现的急性至亚急性脑梗死。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Fine-tuned large Language model for extracting newly identified acute brain infarcts based on computed tomography or magnetic resonance imaging reports.

查看原文本刊更多论文

Fine-tuned large Language model for extracting newly identified acute brain infarcts based on computed tomography or magnetic resonance imaging reports.

Purpose: This study aimed to develop an automated early warning system using a large language model (LLM) to identify acute to subacute brain infarction from free-text computed tomography (CT) or magnetic resonance imaging (MRI) radiology reports.

Methods: In this retrospective study, 5,573, 1,883, and 834 patients were included in the training (mean age, 67.5 ± 17.2 years; 2,831 males), validation (mean age, 61.5 ± 18.3 years; 994 males), and test (mean age, 66.5 ± 16.1 years; 488 males) datasets. An LLM (Japanese Bidirectional Encoder Representations from Transformers model) was fine-tuned to classify the CT and MRI reports into three groups (group 0, newly identified acute to subacute infarction; group 1, known acute to subacute infarction or old infarction; group 2, without infarction). The training and validation processes were repeated 15 times, and the best-performing model on the validation dataset was selected to further evaluate its performance on the test dataset.

Results: The best fine-tuned model exhibited sensitivities of 0.891, 0.905, and 0.959 for groups 0, 1, and 2, respectively, in the test dataset. The macrosensitivity (the average of sensitivity for all groups) and accuracy were 0.918 and 0.923, respectively. The model's performance in extracting newly identified acute brain infarcts was high, with an area under the receiver operating characteristic curve of 0.979 (95% confidence interval, 0.956-1.000). The average prediction time was 0.115 ± 0.037 s per patient.

Conclusion: A fine-tuned LLM could extract newly identified acute to subacute brain infarcts based on CT or MRI findings with high performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Emergency Radiology RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.60

自引率

4.50%

发文量

期刊介绍： To advance and improve the radiologic aspects of emergency careTo establish Emergency Radiology as an area of special interest in the field of diagnostic imagingTo improve methods of education in Emergency RadiologyTo provide, through formal meetings, a mechanism for presentation of scientific papers on various aspects of Emergency Radiology and continuing educationTo promote research in Emergency Radiology by clinical and basic science investigators, including residents and other traineesTo act as the resource body on Emergency Radiology for those interested in emergency patient care Members of the American Society of Emergency Radiology (ASER) receive the Emergency Radiology journal as a benefit of membership!