Predicting language outcome after stroke using machine learning: in search of the big data benefit.

IF 3.6 2区医学 Q2 NEUROIMAGING

Neuroimage-Clinical Pub Date : 2025-08-06 DOI:10.1016/j.nicl.2025.103858

Margarita Saranti, Douglas Neville, Adam White, Pia Rotshtein, Thomas M H Hope, Cathy J Price, Howard Bowman

{"title":"Predicting language outcome after stroke using machine learning: in search of the big data benefit.","authors":"Margarita Saranti, Douglas Neville, Adam White, Pia Rotshtein, Thomas M H Hope, Cathy J Price, Howard Bowman","doi":"10.1016/j.nicl.2025.103858","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate prediction of post-stroke language outcomes using machine learning offers the potential to enhance clinical treatment and rehabilitation for aphasic patients. This study of 758 English speaking stroke patients from the PLORAS project explores the impact of sample size on the performance of logistic regression and a deep learning (ResNet-18) model in predicting language outcomes from neuroimaging and impairment-relevant tabular data. We assessed the performance of both models on two key language tasks from the Comprehensive Aphasia Test: Spoken Picture Description and Naming, using a learning curve approach. Contrary to expectations, the simpler logistic regression model performed comparably or better than the deep learning model (with overlapping confidence intervals), with both models showing an accuracy plateau around 80% for sample sizes larger than 300 patients. Principal Component Analysis revealed that the dimensionality of the neuroimaging data could be reduced to as few as 20 (or even 2) dominant components without significant loss in accuracy, suggesting that classification may be driven by simple patterns such as lesion size. The study highlights both the potential limitations of current dataset size in achieving further accuracy gains and the need for larger datasets to capture more complex patterns, as some of our results indicate that we might not have reached an absolute classification performance ceiling. Overall, these findings provide insights into the practical use of machine learning for predicting aphasia outcomes and the potential benefits of much larger datasets in enhancing model performance.</p>","PeriodicalId":54359,"journal":{"name":"Neuroimage-Clinical","volume":"48 ","pages":"103858"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12351174/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroimage-Clinical","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.nicl.2025.103858","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NEUROIMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate prediction of post-stroke language outcomes using machine learning offers the potential to enhance clinical treatment and rehabilitation for aphasic patients. This study of 758 English speaking stroke patients from the PLORAS project explores the impact of sample size on the performance of logistic regression and a deep learning (ResNet-18) model in predicting language outcomes from neuroimaging and impairment-relevant tabular data. We assessed the performance of both models on two key language tasks from the Comprehensive Aphasia Test: Spoken Picture Description and Naming, using a learning curve approach. Contrary to expectations, the simpler logistic regression model performed comparably or better than the deep learning model (with overlapping confidence intervals), with both models showing an accuracy plateau around 80% for sample sizes larger than 300 patients. Principal Component Analysis revealed that the dimensionality of the neuroimaging data could be reduced to as few as 20 (or even 2) dominant components without significant loss in accuracy, suggesting that classification may be driven by simple patterns such as lesion size. The study highlights both the potential limitations of current dataset size in achieving further accuracy gains and the need for larger datasets to capture more complex patterns, as some of our results indicate that we might not have reached an absolute classification performance ceiling. Overall, these findings provide insights into the practical use of machine learning for predicting aphasia outcomes and the potential benefits of much larger datasets in enhancing model performance.

查看原文本刊更多论文

使用机器学习预测中风后的语言结果：寻找大数据的好处。

使用机器学习对中风后语言结果的准确预测为加强失语症患者的临床治疗和康复提供了潜力。本研究对来自PLORAS项目的758名说英语的中风患者进行了研究，探讨了样本量对逻辑回归和深度学习（ResNet-18）模型在从神经影像学和损伤相关表格数据预测语言结果方面的影响。我们使用学习曲线方法评估了两种模型在综合失语症测试中的两个关键语言任务的表现：口头图片描述和命名。与预期相反，更简单的逻辑回归模型的表现与深度学习模型相当或更好（具有重叠的置信区间），对于超过300名患者的样本量，两种模型的准确率都在80%左右。主成分分析显示，神经成像数据的维数可以减少到20（甚至2）个主成分，而不会显著降低准确性，这表明分类可能是由病变大小等简单模式驱动的。该研究强调了当前数据集规模在实现进一步准确性提高方面的潜在限制，以及需要更大的数据集来捕获更复杂的模式，因为我们的一些结果表明，我们可能没有达到绝对的分类性能上限。总的来说，这些发现为机器学习在预测失语症结果方面的实际应用以及更大数据集在提高模型性能方面的潜在好处提供了见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neuroimage-Clinical NEUROIMAGING-

CiteScore

7.50

自引率

4.80%

发文量

368

审稿时长

52 days

期刊介绍： NeuroImage: Clinical, a journal of diseases, disorders and syndromes involving the Nervous System, provides a vehicle for communicating important advances in the study of abnormal structure-function relationships of the human nervous system based on imaging. The focus of NeuroImage: Clinical is on defining changes to the brain associated with primary neurologic and psychiatric diseases and disorders of the nervous system as well as behavioral syndromes and developmental conditions. The main criterion for judging papers is the extent of scientific advancement in the understanding of the pathophysiologic mechanisms of diseases and disorders, in identification of functional models that link clinical signs and symptoms with brain function and in the creation of image based tools applicable to a broad range of clinical needs including diagnosis, monitoring and tracking of illness, predicting therapeutic response and development of new treatments. Papers dealing with structure and function in animal models will also be considered if they reveal mechanisms that can be readily translated to human conditions.