Predicting language outcome after stroke using machine learning: in search of the big data benefit.

IF 3.6 2区 医学 Q2 NEUROIMAGING
Margarita Saranti, Douglas Neville, Adam White, Pia Rotshtein, Thomas M H Hope, Cathy J Price, Howard Bowman
{"title":"Predicting language outcome after stroke using machine learning: in search of the big data benefit.","authors":"Margarita Saranti, Douglas Neville, Adam White, Pia Rotshtein, Thomas M H Hope, Cathy J Price, Howard Bowman","doi":"10.1016/j.nicl.2025.103858","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate prediction of post-stroke language outcomes using machine learning offers the potential to enhance clinical treatment and rehabilitation for aphasic patients. This study of 758 English speaking stroke patients from the PLORAS project explores the impact of sample size on the performance of logistic regression and a deep learning (ResNet-18) model in predicting language outcomes from neuroimaging and impairment-relevant tabular data. We assessed the performance of both models on two key language tasks from the Comprehensive Aphasia Test: Spoken Picture Description and Naming, using a learning curve approach. Contrary to expectations, the simpler logistic regression model performed comparably or better than the deep learning model (with overlapping confidence intervals), with both models showing an accuracy plateau around 80% for sample sizes larger than 300 patients. Principal Component Analysis revealed that the dimensionality of the neuroimaging data could be reduced to as few as 20 (or even 2) dominant components without significant loss in accuracy, suggesting that classification may be driven by simple patterns such as lesion size. The study highlights both the potential limitations of current dataset size in achieving further accuracy gains and the need for larger datasets to capture more complex patterns, as some of our results indicate that we might not have reached an absolute classification performance ceiling. Overall, these findings provide insights into the practical use of machine learning for predicting aphasia outcomes and the potential benefits of much larger datasets in enhancing model performance.</p>","PeriodicalId":54359,"journal":{"name":"Neuroimage-Clinical","volume":"48 ","pages":"103858"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12351174/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroimage-Clinical","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.nicl.2025.103858","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NEUROIMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate prediction of post-stroke language outcomes using machine learning offers the potential to enhance clinical treatment and rehabilitation for aphasic patients. This study of 758 English speaking stroke patients from the PLORAS project explores the impact of sample size on the performance of logistic regression and a deep learning (ResNet-18) model in predicting language outcomes from neuroimaging and impairment-relevant tabular data. We assessed the performance of both models on two key language tasks from the Comprehensive Aphasia Test: Spoken Picture Description and Naming, using a learning curve approach. Contrary to expectations, the simpler logistic regression model performed comparably or better than the deep learning model (with overlapping confidence intervals), with both models showing an accuracy plateau around 80% for sample sizes larger than 300 patients. Principal Component Analysis revealed that the dimensionality of the neuroimaging data could be reduced to as few as 20 (or even 2) dominant components without significant loss in accuracy, suggesting that classification may be driven by simple patterns such as lesion size. The study highlights both the potential limitations of current dataset size in achieving further accuracy gains and the need for larger datasets to capture more complex patterns, as some of our results indicate that we might not have reached an absolute classification performance ceiling. Overall, these findings provide insights into the practical use of machine learning for predicting aphasia outcomes and the potential benefits of much larger datasets in enhancing model performance.

使用机器学习预测中风后的语言结果:寻找大数据的好处。
使用机器学习对中风后语言结果的准确预测为加强失语症患者的临床治疗和康复提供了潜力。本研究对来自PLORAS项目的758名说英语的中风患者进行了研究,探讨了样本量对逻辑回归和深度学习(ResNet-18)模型在从神经影像学和损伤相关表格数据预测语言结果方面的影响。我们使用学习曲线方法评估了两种模型在综合失语症测试中的两个关键语言任务的表现:口头图片描述和命名。与预期相反,更简单的逻辑回归模型的表现与深度学习模型相当或更好(具有重叠的置信区间),对于超过300名患者的样本量,两种模型的准确率都在80%左右。主成分分析显示,神经成像数据的维数可以减少到20(甚至2)个主成分,而不会显著降低准确性,这表明分类可能是由病变大小等简单模式驱动的。该研究强调了当前数据集规模在实现进一步准确性提高方面的潜在限制,以及需要更大的数据集来捕获更复杂的模式,因为我们的一些结果表明,我们可能没有达到绝对的分类性能上限。总的来说,这些发现为机器学习在预测失语症结果方面的实际应用以及更大数据集在提高模型性能方面的潜在好处提供了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neuroimage-Clinical
Neuroimage-Clinical NEUROIMAGING-
CiteScore
7.50
自引率
4.80%
发文量
368
审稿时长
52 days
期刊介绍: NeuroImage: Clinical, a journal of diseases, disorders and syndromes involving the Nervous System, provides a vehicle for communicating important advances in the study of abnormal structure-function relationships of the human nervous system based on imaging. The focus of NeuroImage: Clinical is on defining changes to the brain associated with primary neurologic and psychiatric diseases and disorders of the nervous system as well as behavioral syndromes and developmental conditions. The main criterion for judging papers is the extent of scientific advancement in the understanding of the pathophysiologic mechanisms of diseases and disorders, in identification of functional models that link clinical signs and symptoms with brain function and in the creation of image based tools applicable to a broad range of clinical needs including diagnosis, monitoring and tracking of illness, predicting therapeutic response and development of new treatments. Papers dealing with structure and function in animal models will also be considered if they reveal mechanisms that can be readily translated to human conditions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信