一种提高混合模型关联能力的可伸缩变分推理方法

IF 31.7 1区 生物学 Q1 GENETICS & HEREDITY
Hrushikesh Loya, Georgios Kalantzis, Fergus Cooper, Pier Francesco Palamara
{"title":"一种提高混合模型关联能力的可伸缩变分推理方法","authors":"Hrushikesh Loya, Georgios Kalantzis, Fergus Cooper, Pier Francesco Palamara","doi":"10.1038/s41588-024-02044-7","DOIUrl":null,"url":null,"abstract":"The rapid growth of modern biobanks is creating new opportunities for large-scale genome-wide association studies (GWASs) and the analysis of complex traits. However, performing GWASs on millions of samples often leads to trade-offs between computational efficiency and statistical power, reducing the benefits of large-scale data collection efforts. We developed Quickdraws, a method that increases association power in quantitative and binary traits without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference and graphics processing unit acceleration. We applied Quickdraws to 79 quantitative and 50 binary traits in 405,088 UK Biobank samples, identifying 4.97% and 3.25% more associations than REGENIE and 22.71% and 7.07% more than FastGWA. Quickdraws had costs comparable to REGENIE, FastGWA and SAIGE on the UK Biobank Research Analysis Platform service, while being substantially faster than BOLT-LMM. These results highlight the promise of leveraging machine learning techniques for scalable GWASs without sacrificing power or robustness. Quickdraws is a mixed-model association tool with a noninfinitesimal prior for analyzing binary and quantitative traits, using a scalable variational inference that allows analysis of biobank-scale cohorts.","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":"57 2","pages":"461-468"},"PeriodicalIF":31.7000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41588-024-02044-7.pdf","citationCount":"0","resultStr":"{\"title\":\"A scalable variational inference approach for increased mixed-model association power\",\"authors\":\"Hrushikesh Loya, Georgios Kalantzis, Fergus Cooper, Pier Francesco Palamara\",\"doi\":\"10.1038/s41588-024-02044-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid growth of modern biobanks is creating new opportunities for large-scale genome-wide association studies (GWASs) and the analysis of complex traits. However, performing GWASs on millions of samples often leads to trade-offs between computational efficiency and statistical power, reducing the benefits of large-scale data collection efforts. We developed Quickdraws, a method that increases association power in quantitative and binary traits without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference and graphics processing unit acceleration. We applied Quickdraws to 79 quantitative and 50 binary traits in 405,088 UK Biobank samples, identifying 4.97% and 3.25% more associations than REGENIE and 22.71% and 7.07% more than FastGWA. Quickdraws had costs comparable to REGENIE, FastGWA and SAIGE on the UK Biobank Research Analysis Platform service, while being substantially faster than BOLT-LMM. These results highlight the promise of leveraging machine learning techniques for scalable GWASs without sacrificing power or robustness. Quickdraws is a mixed-model association tool with a noninfinitesimal prior for analyzing binary and quantitative traits, using a scalable variational inference that allows analysis of biobank-scale cohorts.\",\"PeriodicalId\":18985,\"journal\":{\"name\":\"Nature genetics\",\"volume\":\"57 2\",\"pages\":\"461-468\"},\"PeriodicalIF\":31.7000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.nature.com/articles/s41588-024-02044-7.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.nature.com/articles/s41588-024-02044-7\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41588-024-02044-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

现代生物库的快速发展为大规模全基因组关联研究(GWASs)和复杂性状分析创造了新的机遇。然而,在数百万个样本上执行gwas通常会导致在计算效率和统计能力之间进行权衡,从而降低大规模数据收集工作的好处。我们开发了quickdraw,这是一种在不牺牲计算效率的情况下增加定量和二进制特征关联能力的方法,利用变异效应、随机变分推理和图形处理单元加速的spike-and-slab先验。我们将quickdraw应用于405,088份UK Biobank样本的79个定量性状和50个二元性状,鉴定出的关联比REGENIE高4.97%和3.25%,比FastGWA高22.71%和7.07%。quickdraw的成本与英国生物银行研究分析平台上的REGENIE、FastGWA和SAIGE相当,但比BOLT-LMM快得多。这些结果强调了在不牺牲功率或鲁棒性的情况下利用机器学习技术实现可扩展的GWASs的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

A scalable variational inference approach for increased mixed-model association power

A scalable variational inference approach for increased mixed-model association power

A scalable variational inference approach for increased mixed-model association power
The rapid growth of modern biobanks is creating new opportunities for large-scale genome-wide association studies (GWASs) and the analysis of complex traits. However, performing GWASs on millions of samples often leads to trade-offs between computational efficiency and statistical power, reducing the benefits of large-scale data collection efforts. We developed Quickdraws, a method that increases association power in quantitative and binary traits without sacrificing computational efficiency, leveraging a spike-and-slab prior on variant effects, stochastic variational inference and graphics processing unit acceleration. We applied Quickdraws to 79 quantitative and 50 binary traits in 405,088 UK Biobank samples, identifying 4.97% and 3.25% more associations than REGENIE and 22.71% and 7.07% more than FastGWA. Quickdraws had costs comparable to REGENIE, FastGWA and SAIGE on the UK Biobank Research Analysis Platform service, while being substantially faster than BOLT-LMM. These results highlight the promise of leveraging machine learning techniques for scalable GWASs without sacrificing power or robustness. Quickdraws is a mixed-model association tool with a noninfinitesimal prior for analyzing binary and quantitative traits, using a scalable variational inference that allows analysis of biobank-scale cohorts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Nature genetics
Nature genetics 生物-遗传学
CiteScore
43.00
自引率
2.60%
发文量
241
审稿时长
3 months
期刊介绍: Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation. Integrative genetic topics comprise, but are not limited to: -Genes in the pathology of human disease -Molecular analysis of simple and complex genetic traits -Cancer genetics -Agricultural genomics -Developmental genetics -Regulatory variation in gene expression -Strategies and technologies for extracting function from genomic data -Pharmacological genomics -Genome evolution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信