Yihui Zhu , Yue Zhou , Xiaotong Zhang , Yueying Li , Yonggui Yuan , Youyong Kong
{"title":"MSARAE: Multiscale adversarial regularized autoencoders for cortical network classification","authors":"Yihui Zhu , Yue Zhou , Xiaotong Zhang , Yueying Li , Yonggui Yuan , Youyong Kong","doi":"10.1016/j.media.2025.103775","DOIUrl":null,"url":null,"abstract":"<div><div>Due to privacy regulations and technical limitations, current research on the cerebral cortex frequently faces challenges, including limited data availability. The number of samples significantly influences the performance and generalization ability of deep learning models. In general, these models require sufficient training data to effectively learn underlying distributions and features, enabling strong performance on unseen samples. A limited sample size can lead to overfitting, thereby weakening the model’s generalizability. To address these challenges from a data augmentation perspective, we propose a Multi-Scale Adversarial Regularized Autoencoder (MSARAE) for augmenting and classifying cortical structural connectivity. The approach begins with data preprocessing and the construction of cortical structural connectivity networks. To better capture cortical features, the model leverages Laplacian eigenvectors to enhance topological information. Structural connectivity is then generated using variational autoencoders, with multi-scale graph convolutional layers serving as encoders to capture graph representations at different scales. An adversarial regularization mechanism is introduced to minimize the distribution discrepancy in the latent space. By training a discriminator, the model encourages the encoder to produce latent representations that closely match the distribution of real data, thereby improving its representational capacity. Finally, extensive experiments on the major depression disorder (MDD) dataset, the Human Connectome Project (HCP) dataset, and the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrated the superiority of the model.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"107 ","pages":"Article 103775"},"PeriodicalIF":11.8000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525003214","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Due to privacy regulations and technical limitations, current research on the cerebral cortex frequently faces challenges, including limited data availability. The number of samples significantly influences the performance and generalization ability of deep learning models. In general, these models require sufficient training data to effectively learn underlying distributions and features, enabling strong performance on unseen samples. A limited sample size can lead to overfitting, thereby weakening the model’s generalizability. To address these challenges from a data augmentation perspective, we propose a Multi-Scale Adversarial Regularized Autoencoder (MSARAE) for augmenting and classifying cortical structural connectivity. The approach begins with data preprocessing and the construction of cortical structural connectivity networks. To better capture cortical features, the model leverages Laplacian eigenvectors to enhance topological information. Structural connectivity is then generated using variational autoencoders, with multi-scale graph convolutional layers serving as encoders to capture graph representations at different scales. An adversarial regularization mechanism is introduced to minimize the distribution discrepancy in the latent space. By training a discriminator, the model encourages the encoder to produce latent representations that closely match the distribution of real data, thereby improving its representational capacity. Finally, extensive experiments on the major depression disorder (MDD) dataset, the Human Connectome Project (HCP) dataset, and the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrated the superiority of the model.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.