Lin Zhang , Liwen Jian , Yiming Long , Zhihong Ren , Vince D. Calhoun , Ives Cavalcante Passos , Xinyu Tian , Yuhong Xiang
{"title":"Machine learning approaches for classifying major depressive disorder using biological and neuropsychological markers: A meta-analysis","authors":"Lin Zhang , Liwen Jian , Yiming Long , Zhihong Ren , Vince D. Calhoun , Ives Cavalcante Passos , Xinyu Tian , Yuhong Xiang","doi":"10.1016/j.neubiorev.2025.106201","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional diagnostic methods for major depressive disorder (MDD), which rely on subjective assessments, may compromise diagnostic accuracy. In contrast, machine learning models have the potential to classify and diagnose MDD more effectively, reducing the risk of misdiagnosis associated with conventional methods. The aim of this meta-analysis is to evaluate the overall classification accuracy of machine learning models in MDD and examine the effects of machine learning algorithms, biomarkers, diagnostic comparison groups, validation procedures, and participant age on classification performance. As of September 2024, a total of 176 studies were ultimately included in the meta-analysis, encompassing a total of 60,926 participants. A random-effects model was applied to analyze the extracted data, resulting in an overall classification accuracy of 0.825 (95 % CI [0.810; 0.839]). Convolutional neural networks significantly outperformed support vector machines (SVM) when using electroencephalography and magnetoencephalography data. Additionally, SVM demonstrated significantly better performance with functional magnetic resonance imaging data compared to graph neural networks and gaussian process classification. The sample size was negatively correlated to classification accuracy. Furthermore, evidence of publication bias was also detected. Therefore, while this study indicates that machine learning models show high accuracy in distinguishing MDD from healthy controls and other psychiatric disorders, further research is required before these findings can be generalized to large-scale clinical practice.</div></div>","PeriodicalId":56105,"journal":{"name":"Neuroscience and Biobehavioral Reviews","volume":"174 ","pages":"Article 106201"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroscience and Biobehavioral Reviews","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0149763425002015","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional diagnostic methods for major depressive disorder (MDD), which rely on subjective assessments, may compromise diagnostic accuracy. In contrast, machine learning models have the potential to classify and diagnose MDD more effectively, reducing the risk of misdiagnosis associated with conventional methods. The aim of this meta-analysis is to evaluate the overall classification accuracy of machine learning models in MDD and examine the effects of machine learning algorithms, biomarkers, diagnostic comparison groups, validation procedures, and participant age on classification performance. As of September 2024, a total of 176 studies were ultimately included in the meta-analysis, encompassing a total of 60,926 participants. A random-effects model was applied to analyze the extracted data, resulting in an overall classification accuracy of 0.825 (95 % CI [0.810; 0.839]). Convolutional neural networks significantly outperformed support vector machines (SVM) when using electroencephalography and magnetoencephalography data. Additionally, SVM demonstrated significantly better performance with functional magnetic resonance imaging data compared to graph neural networks and gaussian process classification. The sample size was negatively correlated to classification accuracy. Furthermore, evidence of publication bias was also detected. Therefore, while this study indicates that machine learning models show high accuracy in distinguishing MDD from healthy controls and other psychiatric disorders, further research is required before these findings can be generalized to large-scale clinical practice.
传统的重度抑郁障碍(MDD)诊断方法依赖于主观评估,可能会降低诊断的准确性。相比之下,机器学习模型有可能更有效地分类和诊断MDD,降低与传统方法相关的误诊风险。本荟萃分析的目的是评估MDD中机器学习模型的总体分类准确性,并检查机器学习算法、生物标志物、诊断对照组、验证程序和参与者年龄对分类性能的影响。截至2024年9月,共有176项研究最终被纳入荟萃分析,共有60,926名参与者。采用随机效应模型对提取的数据进行分析,总体分类准确率为0.825 (95% CI [0.810;0.839])。卷积神经网络在使用脑电图和脑磁图数据时明显优于支持向量机(SVM)。此外,与图神经网络和高斯过程分类相比,SVM在功能磁共振成像数据上表现出明显更好的性能。样本量与分类准确率呈负相关。此外,还发现了发表偏倚的证据。因此,虽然本研究表明机器学习模型在区分重度抑郁症与健康对照和其他精神疾病方面具有很高的准确性,但在将这些发现推广到大规模临床实践之前,还需要进一步的研究。
期刊介绍:
The official journal of the International Behavioral Neuroscience Society publishes original and significant review articles that explore the intersection between neuroscience and the study of psychological processes and behavior. The journal also welcomes articles that primarily focus on psychological processes and behavior, as long as they have relevance to one or more areas of neuroscience.