Self-Supervised Pre-training Tasks for an fMRI Time-series Transformer in Autism Detection.

Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco) Pub Date : 2025-01-01 Epub Date: 2024-12-06 DOI:10.1007/978-3-031-78761-4_14

Yinchi Zhou, Peiyu Duan, Yuexi Du, Nicha C Dvornek

{"title":"Self-Supervised Pre-training Tasks for an fMRI Time-series Transformer in Autism Detection.","authors":"Yinchi Zhou, Peiyu Duan, Yuexi Du, Nicha C Dvornek","doi":"10.1007/978-3-031-78761-4_14","DOIUrl":null,"url":null,"abstract":"<p><p>Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that encompasses a wide variety of symptoms and degrees of impairment, which makes the diagnosis and treatment challenging. Functional magnetic resonance imaging (fMRI) has been extensively used to study brain activity in ASD, and machine learning methods have been applied to analyze resting state fMRI (rs-fMRI) data. However, fewer studies have explored the recent transformer-based models on rs-fMRI data. Given the superiority of transformer models in capturing long-range dependencies in sequence data, we have developed a transformer-based self-supervised framework that directly analyzes time-series fMRI data without computing functional connectivity. To address over-fitting in small datasets and enhance the model performance, we propose self-supervised pre-training tasks to reconstruct the randomly masked fMRI time-series data, investigating the effects of various masking strategies. We then fine-tune the model for the ASD classification task and evaluate it using two public datasets and five-fold cross-validation with different amounts of training data. The experiments show that randomly masking entire ROIs gives better model performance than randomly masking time points in the pre-training step, resulting in an average improvement of 10.8% for AUC and 9.3% for subject accuracy compared with the transformer model trained from scratch across different levels of training data availability. Our code is available on GitHub .</p>","PeriodicalId":520367,"journal":{"name":"Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco)","volume":"15266 ","pages":"145-154"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951341/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-78761-4_14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/6 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that encompasses a wide variety of symptoms and degrees of impairment, which makes the diagnosis and treatment challenging. Functional magnetic resonance imaging (fMRI) has been extensively used to study brain activity in ASD, and machine learning methods have been applied to analyze resting state fMRI (rs-fMRI) data. However, fewer studies have explored the recent transformer-based models on rs-fMRI data. Given the superiority of transformer models in capturing long-range dependencies in sequence data, we have developed a transformer-based self-supervised framework that directly analyzes time-series fMRI data without computing functional connectivity. To address over-fitting in small datasets and enhance the model performance, we propose self-supervised pre-training tasks to reconstruct the randomly masked fMRI time-series data, investigating the effects of various masking strategies. We then fine-tune the model for the ASD classification task and evaluate it using two public datasets and five-fold cross-validation with different amounts of training data. The experiments show that randomly masking entire ROIs gives better model performance than randomly masking time points in the pre-training step, resulting in an average improvement of 10.8% for AUC and 9.3% for subject accuracy compared with the transformer model trained from scratch across different levels of training data availability. Our code is available on GitHub .

查看原文本刊更多论文

fMRI时间序列变压器自监督预训练任务在自闭症检测中的应用。

自闭症谱系障碍（ASD）是一种神经发育疾病，包括各种各样的症状和程度的损害，这使得诊断和治疗具有挑战性。功能磁共振成像（fMRI）已被广泛用于研究ASD患者的大脑活动，机器学习方法已被应用于分析静息状态fMRI （rs-fMRI）数据。然而，很少有研究在rs-fMRI数据上探索最近基于变压器的模型。考虑到变压器模型在捕获序列数据中的远程依赖关系方面的优势，我们开发了一个基于变压器的自监督框架，该框架可以直接分析时间序列fMRI数据，而无需计算功能连接。为了解决小数据集的过拟合问题并提高模型性能，我们提出了自监督预训练任务来重建随机掩蔽的fMRI时间序列数据，研究了各种掩蔽策略的影响。然后，我们对ASD分类任务的模型进行微调，并使用两个公共数据集和不同数量的训练数据进行五次交叉验证来评估它。实验表明，随机屏蔽整个roi比随机屏蔽预训练步骤中的时间点具有更好的模型性能，在不同的训练数据可用性水平上，与从头开始训练的变压器模型相比，AUC平均提高10.8%，受试者准确率平均提高9.3%。我们的代码可以在GitHub上找到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco)

自引率

0.00%

发文量