MSARAE: Multiscale adversarial regularized autoencoders for cortical network classification

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-08-19 DOI:10.1016/j.media.2025.103775

Yihui Zhu , Yue Zhou , Xiaotong Zhang , Yueying Li , Yonggui Yuan , Youyong Kong

{"title":"MSARAE: Multiscale adversarial regularized autoencoders for cortical network classification","authors":"Yihui Zhu , Yue Zhou , Xiaotong Zhang , Yueying Li , Yonggui Yuan , Youyong Kong","doi":"10.1016/j.media.2025.103775","DOIUrl":null,"url":null,"abstract":"<div><div>Due to privacy regulations and technical limitations, current research on the cerebral cortex frequently faces challenges, including limited data availability. The number of samples significantly influences the performance and generalization ability of deep learning models. In general, these models require sufficient training data to effectively learn underlying distributions and features, enabling strong performance on unseen samples. A limited sample size can lead to overfitting, thereby weakening the model’s generalizability. To address these challenges from a data augmentation perspective, we propose a Multi-Scale Adversarial Regularized Autoencoder (MSARAE) for augmenting and classifying cortical structural connectivity. The approach begins with data preprocessing and the construction of cortical structural connectivity networks. To better capture cortical features, the model leverages Laplacian eigenvectors to enhance topological information. Structural connectivity is then generated using variational autoencoders, with multi-scale graph convolutional layers serving as encoders to capture graph representations at different scales. An adversarial regularization mechanism is introduced to minimize the distribution discrepancy in the latent space. By training a discriminator, the model encourages the encoder to produce latent representations that closely match the distribution of real data, thereby improving its representational capacity. Finally, extensive experiments on the major depression disorder (MDD) dataset, the Human Connectome Project (HCP) dataset, and the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrated the superiority of the model.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"107 ","pages":"Article 103775"},"PeriodicalIF":11.8000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525003214","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Due to privacy regulations and technical limitations, current research on the cerebral cortex frequently faces challenges, including limited data availability. The number of samples significantly influences the performance and generalization ability of deep learning models. In general, these models require sufficient training data to effectively learn underlying distributions and features, enabling strong performance on unseen samples. A limited sample size can lead to overfitting, thereby weakening the model’s generalizability. To address these challenges from a data augmentation perspective, we propose a Multi-Scale Adversarial Regularized Autoencoder (MSARAE) for augmenting and classifying cortical structural connectivity. The approach begins with data preprocessing and the construction of cortical structural connectivity networks. To better capture cortical features, the model leverages Laplacian eigenvectors to enhance topological information. Structural connectivity is then generated using variational autoencoders, with multi-scale graph convolutional layers serving as encoders to capture graph representations at different scales. An adversarial regularization mechanism is introduced to minimize the distribution discrepancy in the latent space. By training a discriminator, the model encourages the encoder to produce latent representations that closely match the distribution of real data, thereby improving its representational capacity. Finally, extensive experiments on the major depression disorder (MDD) dataset, the Human Connectome Project (HCP) dataset, and the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrated the superiority of the model.

查看原文本刊更多论文

用于皮质网络分类的多尺度对抗正则化自编码器

由于隐私法规和技术限制，目前对大脑皮层的研究经常面临挑战，包括有限的数据可用性。样本数量显著影响深度学习模型的性能和泛化能力。一般来说，这些模型需要足够的训练数据来有效地学习底层分布和特征，从而在未见过的样本上实现强大的性能。有限的样本量可能导致过拟合，从而削弱模型的可泛化性。为了从数据增强的角度解决这些挑战，我们提出了一种多尺度对抗正则化自编码器（MSARAE）来增强和分类皮层结构连接。该方法从数据预处理和皮质结构连接网络的构建开始。为了更好地捕获皮质特征，该模型利用拉普拉斯特征向量来增强拓扑信息。然后使用变分自编码器生成结构连接，多尺度图卷积层作为编码器来捕获不同尺度的图表示。引入了一种对抗正则化机制来最小化潜在空间中的分布差异。通过训练鉴别器，该模型鼓励编码器产生与真实数据分布密切匹配的潜在表征，从而提高其表征能力。最后，在重度抑郁症（MDD）数据集、人类连接组计划（HCP）数据集和阿尔茨海默病神经成像倡议（ADNI）数据集上进行的大量实验证明了该模型的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.