Role of Large Language Models for Suggesting Nerve Involvement in Upper Limbs MRI Reports with Muscle Denervation Signs.

IF 2.8 3区 医学 Q2 Medicine
Teodoro Martín-Noguerol, Pilar López-Úbeda, Antonio Luna, Manuel Gómez-Río, Juan M Górriz
{"title":"Role of Large Language Models for Suggesting Nerve Involvement in Upper Limbs MRI Reports with Muscle Denervation Signs.","authors":"Teodoro Martín-Noguerol, Pilar López-Úbeda, Antonio Luna, Manuel Gómez-Río, Juan M Górriz","doi":"10.1007/s00062-025-01533-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Determining the involvement of specific peripheral nerves (PNs) in the upper limb associated with signs of muscle denervation can be challenging. This study aims to develop, compare, and validate various large language models (LLMs) to automatically identify and establish potential relationships between denervated muscles and their corresponding PNs.</p><p><strong>Materials and methods: </strong>We collected 300 retrospective MRI reports in Spanish from upper limb examinations conducted between 2018 and 2024 that showed signs of muscle denervation. An expert radiologist manually annotated these reports based on the affected peripheral nerves (median, ulnar, radial, axillary, and suprascapular). BERT, DistilBERT, mBART, RoBERTa, and Medical-ELECTRA models were fine-tuned and evaluated on the reports. Additionally, an automatic voting system was implemented to consolidate predictions through majority voting.</p><p><strong>Results: </strong>The voting system achieved the highest F1 scores for the median, ulnar, and radial nerves, with scores of 0.88, 1.00, and 0.90, respectively. Medical-ELECTRA also performed well, achieving F1 scores above 0.82 for the axillary and suprascapular nerves. In contrast, mBART demonstrated lower performance, particularly with an F1 score of 0.38 for the median nerve.</p><p><strong>Conclusions: </strong>Our voting system generally outperforms the individually tested LLMs in determining the specific PN likely associated with muscle denervation patterns detected in upper limb MRI reports. This system can thereby assist radiologists by suggesting the implicated PN when generating their radiology reports.</p>","PeriodicalId":10391,"journal":{"name":"Clinical Neuroradiology","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Neuroradiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00062-025-01533-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: Determining the involvement of specific peripheral nerves (PNs) in the upper limb associated with signs of muscle denervation can be challenging. This study aims to develop, compare, and validate various large language models (LLMs) to automatically identify and establish potential relationships between denervated muscles and their corresponding PNs.

Materials and methods: We collected 300 retrospective MRI reports in Spanish from upper limb examinations conducted between 2018 and 2024 that showed signs of muscle denervation. An expert radiologist manually annotated these reports based on the affected peripheral nerves (median, ulnar, radial, axillary, and suprascapular). BERT, DistilBERT, mBART, RoBERTa, and Medical-ELECTRA models were fine-tuned and evaluated on the reports. Additionally, an automatic voting system was implemented to consolidate predictions through majority voting.

Results: The voting system achieved the highest F1 scores for the median, ulnar, and radial nerves, with scores of 0.88, 1.00, and 0.90, respectively. Medical-ELECTRA also performed well, achieving F1 scores above 0.82 for the axillary and suprascapular nerves. In contrast, mBART demonstrated lower performance, particularly with an F1 score of 0.38 for the median nerve.

Conclusions: Our voting system generally outperforms the individually tested LLMs in determining the specific PN likely associated with muscle denervation patterns detected in upper limb MRI reports. This system can thereby assist radiologists by suggesting the implicated PN when generating their radiology reports.

在有肌肉去神经体征的上肢MRI报告中,大语言模型提示神经受累的作用。
目的:确定与肌肉去神经支配体征相关的上肢特定周围神经(PNs)的受累可能具有挑战性。本研究旨在开发、比较和验证各种大型语言模型(llm),以自动识别和建立失神经支配肌肉与其相应的pn之间的潜在关系。材料和方法:我们收集了300份西班牙语的回顾性MRI报告,这些报告来自2018年至2024年间进行的上肢检查,显示出肌肉去神经支配的迹象。放射科专家根据受影响的周围神经(正中神经、尺骨神经、桡骨神经、腋窝神经和肩胛上神经)对这些报告进行了手工注释。BERT、DistilBERT、mBART、RoBERTa和Medical-ELECTRA模型在报告中进行了微调和评估。此外,还实施了自动投票系统,通过多数投票来巩固预测。结果:该投票系统对正中神经、尺神经和桡神经的F1评分最高,分别为0.88、1.00和0.90。Medical-ELECTRA也表现良好,腋窝神经和肩胛上神经的F1评分均在0.82以上。相比之下,mBART表现出较低的表现,特别是正中神经的F1得分为0.38。结论:我们的投票系统在确定可能与上肢MRI报告中检测到的肌肉去神经支配模式相关的特定PN方面,通常优于单独测试的LLMs。因此,该系统可以帮助放射科医生在生成放射报告时,通过提示所涉及的PN。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Clinical Neuroradiology
Clinical Neuroradiology Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.90
自引率
3.60%
发文量
0
期刊介绍: Clinical Neuroradiology provides current information, original contributions, and reviews in the field of neuroradiology. An interdisciplinary approach is accomplished by diagnostic and therapeutic contributions related to associated subjects. The international coverage and relevance of the journal is underlined by its being the official journal of the German, Swiss, and Austrian Societies of Neuroradiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信