DUNE:一种多功能神经成像编码器,可捕获3种主要疾病的大脑复杂性:癌症、痴呆和精神分裂症。

IF 11.8 2区 生物学 Q1 MULTIDISCIPLINARY SCIENCES
Thomas Barba, Bryce A Bagley, Sandra Steyaert, Francisco Carrillo-Perez, Christoph Sadée, Michael Iv, Olivier Gevaert
{"title":"DUNE:一种多功能神经成像编码器,可捕获3种主要疾病的大脑复杂性:癌症、痴呆和精神分裂症。","authors":"Thomas Barba, Bryce A Bagley, Sandra Steyaert, Francisco Carrillo-Perez, Christoph Sadée, Michael Iv, Olivier Gevaert","doi":"10.1093/gigascience/giaf116","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Magnetic resonance imaging (MRI) of the brain contains complex data that pose significant challenges for computational analysis. While models proposed for brain MRI analyses yield encouraging results, the high complexity of neuroimaging data hinders generalizability and clinical application. We introduce DUNE, a neuroimaging-oriented workflow that transforms raw brain MRI scans into standardized compact patient-level embeddings through integrated preprocessing and deep feature extraction, thereby enabling their processing by basic machine learning algorithms. A UNet-based autoencoder was trained using 3,814 selected scans of morphologically normal (healthy volunteers) or abnormal (glioma patients) brains, to generate comprehensive compact representations of the full-sized images. To evaluate their quality, these embeddings were utilized to train machine learning models to predict a wide range of clinical variables.</p><p><strong>Results: </strong>Embeddings were extracted for cohorts used for the model development (21,102 individuals), along with 3 additional independent cohorts (Alzheimer's disease, schizophrenia, and glioma cohorts, 1,322 individuals), to evaluate the model's generalization capabilities. The embeddings extracted from healthy volunteers' scans could predict a broad spectrum of clinical parameters, including volumetry metrics, cardiovascular disease (area under the receiver operating characteristic curve [AUROC] = 0.80) and alcohol consumption (AUROC = 0.99), and more nuanced parameters such as the Alzheimer's predisposing APOE4 allele (AUROC = 0.67). Embeddings derived from the validation cohorts successfully predicted the diagnoses of Alzheimer's dementia (AUROC = 0.92) and schizophrenia (AUROC = 0.64). Embeddings extracted from glioma scans successfully predicted survival (C-index = 0.608) and IDH molecular status (AUROC = 0.92), matching the performances of previous task-oriented models.</p><p><strong>Conclusion: </strong>DUNE efficiently represents clinically relevant patterns from full-size brain MRI scans across several disease areas, opening ways for innovative clinical applications in neurology.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":"14 ","pages":""},"PeriodicalIF":11.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527335/pdf/","citationCount":"0","resultStr":"{\"title\":\"DUNE: a versatile neuroimaging encoder captures brain complexity across 3 major diseases: cancer, dementia, and schizophrenia.\",\"authors\":\"Thomas Barba, Bryce A Bagley, Sandra Steyaert, Francisco Carrillo-Perez, Christoph Sadée, Michael Iv, Olivier Gevaert\",\"doi\":\"10.1093/gigascience/giaf116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Magnetic resonance imaging (MRI) of the brain contains complex data that pose significant challenges for computational analysis. While models proposed for brain MRI analyses yield encouraging results, the high complexity of neuroimaging data hinders generalizability and clinical application. We introduce DUNE, a neuroimaging-oriented workflow that transforms raw brain MRI scans into standardized compact patient-level embeddings through integrated preprocessing and deep feature extraction, thereby enabling their processing by basic machine learning algorithms. A UNet-based autoencoder was trained using 3,814 selected scans of morphologically normal (healthy volunteers) or abnormal (glioma patients) brains, to generate comprehensive compact representations of the full-sized images. To evaluate their quality, these embeddings were utilized to train machine learning models to predict a wide range of clinical variables.</p><p><strong>Results: </strong>Embeddings were extracted for cohorts used for the model development (21,102 individuals), along with 3 additional independent cohorts (Alzheimer's disease, schizophrenia, and glioma cohorts, 1,322 individuals), to evaluate the model's generalization capabilities. The embeddings extracted from healthy volunteers' scans could predict a broad spectrum of clinical parameters, including volumetry metrics, cardiovascular disease (area under the receiver operating characteristic curve [AUROC] = 0.80) and alcohol consumption (AUROC = 0.99), and more nuanced parameters such as the Alzheimer's predisposing APOE4 allele (AUROC = 0.67). Embeddings derived from the validation cohorts successfully predicted the diagnoses of Alzheimer's dementia (AUROC = 0.92) and schizophrenia (AUROC = 0.64). Embeddings extracted from glioma scans successfully predicted survival (C-index = 0.608) and IDH molecular status (AUROC = 0.92), matching the performances of previous task-oriented models.</p><p><strong>Conclusion: </strong>DUNE efficiently represents clinically relevant patterns from full-size brain MRI scans across several disease areas, opening ways for innovative clinical applications in neurology.</p>\",\"PeriodicalId\":12581,\"journal\":{\"name\":\"GigaScience\",\"volume\":\"14 \",\"pages\":\"\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527335/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GigaScience\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/gigascience/giaf116\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giaf116","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:大脑的磁共振成像(MRI)包含复杂的数据,对计算分析构成重大挑战。虽然为脑MRI分析提出的模型产生了令人鼓舞的结果,但神经成像数据的高度复杂性阻碍了推广和临床应用。我们介绍了DUNE,这是一种面向神经成像的工作流程,通过集成预处理和深度特征提取,将原始的大脑MRI扫描转换为标准化的紧凑的患者级嵌入,从而使其能够通过基本的机器学习算法进行处理。使用3814张形态学正常(健康志愿者)或异常(神经胶质瘤患者)大脑的扫描图来训练基于unet的自动编码器,以生成全尺寸图像的全面紧凑表示。为了评估它们的质量,这些嵌入被用来训练机器学习模型来预测广泛的临床变量。结果:提取了用于模型开发的队列(21,102个体)的嵌入,以及3个额外的独立队列(阿尔茨海默病,精神分裂症和胶质瘤队列,1,322个体),以评估模型的泛化能力。从健康志愿者的扫描中提取的嵌入可以预测广泛的临床参数,包括体积指标、心血管疾病(受试者工作特征曲线下面积[AUROC] = 0.80)和酒精消耗(AUROC = 0.99),以及更细微的参数,如阿尔茨海默氏症易感性APOE4等位基因(AUROC = 0.67)。来自验证队列的嵌入成功地预测了阿尔茨海默氏痴呆(AUROC = 0.92)和精神分裂症(AUROC = 0.64)的诊断。从胶质瘤扫描中提取的嵌入成功地预测了生存(C-index = 0.608)和IDH分子状态(AUROC = 0.92),与之前的任务导向模型的性能相匹配。结论:DUNE有效地代表了多个疾病区域的全尺寸脑MRI扫描的临床相关模式,为神经病学的创新临床应用开辟了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DUNE: a versatile neuroimaging encoder captures brain complexity across 3 major diseases: cancer, dementia, and schizophrenia.

Background: Magnetic resonance imaging (MRI) of the brain contains complex data that pose significant challenges for computational analysis. While models proposed for brain MRI analyses yield encouraging results, the high complexity of neuroimaging data hinders generalizability and clinical application. We introduce DUNE, a neuroimaging-oriented workflow that transforms raw brain MRI scans into standardized compact patient-level embeddings through integrated preprocessing and deep feature extraction, thereby enabling their processing by basic machine learning algorithms. A UNet-based autoencoder was trained using 3,814 selected scans of morphologically normal (healthy volunteers) or abnormal (glioma patients) brains, to generate comprehensive compact representations of the full-sized images. To evaluate their quality, these embeddings were utilized to train machine learning models to predict a wide range of clinical variables.

Results: Embeddings were extracted for cohorts used for the model development (21,102 individuals), along with 3 additional independent cohorts (Alzheimer's disease, schizophrenia, and glioma cohorts, 1,322 individuals), to evaluate the model's generalization capabilities. The embeddings extracted from healthy volunteers' scans could predict a broad spectrum of clinical parameters, including volumetry metrics, cardiovascular disease (area under the receiver operating characteristic curve [AUROC] = 0.80) and alcohol consumption (AUROC = 0.99), and more nuanced parameters such as the Alzheimer's predisposing APOE4 allele (AUROC = 0.67). Embeddings derived from the validation cohorts successfully predicted the diagnoses of Alzheimer's dementia (AUROC = 0.92) and schizophrenia (AUROC = 0.64). Embeddings extracted from glioma scans successfully predicted survival (C-index = 0.608) and IDH molecular status (AUROC = 0.92), matching the performances of previous task-oriented models.

Conclusion: DUNE efficiently represents clinically relevant patterns from full-size brain MRI scans across several disease areas, opening ways for innovative clinical applications in neurology.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
GigaScience
GigaScience MULTIDISCIPLINARY SCIENCES-
CiteScore
15.50
自引率
1.10%
发文量
119
审稿时长
1 weeks
期刊介绍: GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信