Federated learning for multi-omics: A performance evaluation in Parkinson’s disease

IF 6.7 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Benjamin P. Danek, Mary B. Makarious, Anant Dadu, Dan Vitale, Paul Suhwan Lee, Andrew B. Singleton, Mike A. Nalls, Jimeng Sun, Faraz Faghri
{"title":"Federated learning for multi-omics: A performance evaluation in Parkinson’s disease","authors":"Benjamin P. Danek, Mary B. Makarious, Anant Dadu, Dan Vitale, Paul Suhwan Lee, Andrew B. Singleton, Mike A. Nalls, Jimeng Sun, Faraz Faghri","doi":"10.1016/j.patter.2024.100945","DOIUrl":null,"url":null,"abstract":"While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson’s disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open-source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"7 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Patterns","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.patter.2024.100945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson’s disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open-source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.
多组学联合学习:帕金森病的性能评估
虽然机器学习(ML)研究近来越来越受欢迎,但其在全生命科学领域的应用却受限于获取训练 ML 模型所需的足够大的高质量数据集。联盟学习(FL)为参与机构之间合作整理此类数据集提供了机会。我们比较了在多组学帕金森病预测任务中使用 FL 训练的几个模型与经典训练的 ML 模型的模拟性能。我们发现,FL 模型的性能跟踪了集中训练的 ML 模型,其中性能最好的 FL 模型的 AUC-PR 为 0.876 ± 0.009,比其集中训练的变体低 0.014 ± 0.003。我们还确定,联盟内样本的分散性对模型性能有重要影响。我们的研究实施了几个开源的 FL 框架,旨在强调在多组学研究中应用这些协作方法时所面临的一些挑战和机遇。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Patterns
Patterns Decision Sciences-Decision Sciences (all)
CiteScore
10.60
自引率
4.60%
发文量
153
审稿时长
19 weeks
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信