Self-Supervised Pre-training Tasks for an fMRI Time-series Transformer in Autism Detection.

Yinchi Zhou, Peiyu Duan, Yuexi Du, Nicha C Dvornek
{"title":"Self-Supervised Pre-training Tasks for an fMRI Time-series Transformer in Autism Detection.","authors":"Yinchi Zhou, Peiyu Duan, Yuexi Du, Nicha C Dvornek","doi":"10.1007/978-3-031-78761-4_14","DOIUrl":null,"url":null,"abstract":"<p><p>Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that encompasses a wide variety of symptoms and degrees of impairment, which makes the diagnosis and treatment challenging. Functional magnetic resonance imaging (fMRI) has been extensively used to study brain activity in ASD, and machine learning methods have been applied to analyze resting state fMRI (rs-fMRI) data. However, fewer studies have explored the recent transformer-based models on rs-fMRI data. Given the superiority of transformer models in capturing long-range dependencies in sequence data, we have developed a transformer-based self-supervised framework that directly analyzes time-series fMRI data without computing functional connectivity. To address over-fitting in small datasets and enhance the model performance, we propose self-supervised pre-training tasks to reconstruct the randomly masked fMRI time-series data, investigating the effects of various masking strategies. We then fine-tune the model for the ASD classification task and evaluate it using two public datasets and five-fold cross-validation with different amounts of training data. The experiments show that randomly masking entire ROIs gives better model performance than randomly masking time points in the pre-training step, resulting in an average improvement of 10.8% for AUC and 9.3% for subject accuracy compared with the transformer model trained from scratch across different levels of training data availability. Our code is available on GitHub .</p>","PeriodicalId":520367,"journal":{"name":"Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco)","volume":"15266 ","pages":"145-154"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11951341/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning in clinical neuroimaging : 7th international workshop, MLCN 2024, held in conjunction with MICCAI 2024, Marrakesh, Morocco, October 10, 2024, proceedings. MLCN (Workshop) (7th : 2024 : Marrakesh, Morocco)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-78761-4_14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/6 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that encompasses a wide variety of symptoms and degrees of impairment, which makes the diagnosis and treatment challenging. Functional magnetic resonance imaging (fMRI) has been extensively used to study brain activity in ASD, and machine learning methods have been applied to analyze resting state fMRI (rs-fMRI) data. However, fewer studies have explored the recent transformer-based models on rs-fMRI data. Given the superiority of transformer models in capturing long-range dependencies in sequence data, we have developed a transformer-based self-supervised framework that directly analyzes time-series fMRI data without computing functional connectivity. To address over-fitting in small datasets and enhance the model performance, we propose self-supervised pre-training tasks to reconstruct the randomly masked fMRI time-series data, investigating the effects of various masking strategies. We then fine-tune the model for the ASD classification task and evaluate it using two public datasets and five-fold cross-validation with different amounts of training data. The experiments show that randomly masking entire ROIs gives better model performance than randomly masking time points in the pre-training step, resulting in an average improvement of 10.8% for AUC and 9.3% for subject accuracy compared with the transformer model trained from scratch across different levels of training data availability. Our code is available on GitHub .

fMRI时间序列变压器自监督预训练任务在自闭症检测中的应用。
自闭症谱系障碍(ASD)是一种神经发育疾病,包括各种各样的症状和程度的损害,这使得诊断和治疗具有挑战性。功能磁共振成像(fMRI)已被广泛用于研究ASD患者的大脑活动,机器学习方法已被应用于分析静息状态fMRI (rs-fMRI)数据。然而,很少有研究在rs-fMRI数据上探索最近基于变压器的模型。考虑到变压器模型在捕获序列数据中的远程依赖关系方面的优势,我们开发了一个基于变压器的自监督框架,该框架可以直接分析时间序列fMRI数据,而无需计算功能连接。为了解决小数据集的过拟合问题并提高模型性能,我们提出了自监督预训练任务来重建随机掩蔽的fMRI时间序列数据,研究了各种掩蔽策略的影响。然后,我们对ASD分类任务的模型进行微调,并使用两个公共数据集和不同数量的训练数据进行五次交叉验证来评估它。实验表明,随机屏蔽整个roi比随机屏蔽预训练步骤中的时间点具有更好的模型性能,在不同的训练数据可用性水平上,与从头开始训练的变压器模型相比,AUC平均提高10.8%,受试者准确率平均提高9.3%。我们的代码可以在GitHub上找到。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信