Leveraging Global Genetics Resources to Enhance Polygenic Prediction Across Ancestrally Diverse Populations.

IF 3.3 Q2 GENETICS & HEREDITY
Oliver Pain
{"title":"Leveraging Global Genetics Resources to Enhance Polygenic Prediction Across Ancestrally Diverse Populations.","authors":"Oliver Pain","doi":"10.1016/j.xhgg.2025.100482","DOIUrl":null,"url":null,"abstract":"<p><p>Genome-wide association studies (GWAS) from multiple ancestral populations are increasingly available, offering opportunities to improve the accuracy and equity of polygenic scores (PGS). Several methods now aim to leverage multiple GWAS sources, but predictive performance and computational efficiency remain unclear, particularly when individual-level tuning data are unavailable. This study evaluates a comprehensive set of PGS methods across African (AFR), East Asian (EAS), and European (EUR) ancestries for 10 complex traits, using summary statistics from the Ugandan Genome Resource, Biobank Japan, UK Biobank, and the Million Veteran Program. Single-source PGS were derived using methods including DBSLMM, lassosum, LDpred2, MegaPRS, pT+clump, PRS-CS, QuickPRS, and SBayesRC. Multi-source approaches included PRS-CSx, TL-PRS, X-Wing, and combinations of independently optimised single-source scores. All methods were restricted to HapMap3 variants and used linkage disequilibrium reference panels matching the GWAS super population. A key contribution is a novel application of the LEOPARD method to estimate optimal linear combinations of population-specific PGS using only summary statistics. Analyses were implemented using the open-source GenoPred pipeline. In AFR and EAS populations, PGS combining ancestry-aligned and European GWAS outperformed single-source models. Linear combinations of independently optimised scores consistently outperformed current jointly optimised multi-source methods, while being substantially more computationally efficient. The LEOPARD extension offered a practical solution for tuning these combinations when only summary statistics were available, achieving performance comparable to tuning with individual-level data. These findings highlight a flexible and generalisable framework for multi-source PGS construction. The GenoPred pipeline supports more equitable, accurate, and accessible polygenic prediction.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100482"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HGG Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xhgg.2025.100482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Genome-wide association studies (GWAS) from multiple ancestral populations are increasingly available, offering opportunities to improve the accuracy and equity of polygenic scores (PGS). Several methods now aim to leverage multiple GWAS sources, but predictive performance and computational efficiency remain unclear, particularly when individual-level tuning data are unavailable. This study evaluates a comprehensive set of PGS methods across African (AFR), East Asian (EAS), and European (EUR) ancestries for 10 complex traits, using summary statistics from the Ugandan Genome Resource, Biobank Japan, UK Biobank, and the Million Veteran Program. Single-source PGS were derived using methods including DBSLMM, lassosum, LDpred2, MegaPRS, pT+clump, PRS-CS, QuickPRS, and SBayesRC. Multi-source approaches included PRS-CSx, TL-PRS, X-Wing, and combinations of independently optimised single-source scores. All methods were restricted to HapMap3 variants and used linkage disequilibrium reference panels matching the GWAS super population. A key contribution is a novel application of the LEOPARD method to estimate optimal linear combinations of population-specific PGS using only summary statistics. Analyses were implemented using the open-source GenoPred pipeline. In AFR and EAS populations, PGS combining ancestry-aligned and European GWAS outperformed single-source models. Linear combinations of independently optimised scores consistently outperformed current jointly optimised multi-source methods, while being substantially more computationally efficient. The LEOPARD extension offered a practical solution for tuning these combinations when only summary statistics were available, achieving performance comparable to tuning with individual-level data. These findings highlight a flexible and generalisable framework for multi-source PGS construction. The GenoPred pipeline supports more equitable, accurate, and accessible polygenic prediction.

利用全球遗传资源加强祖先多样性人群的多基因预测。
来自多个祖先群体的全基因组关联研究(GWAS)越来越多,为提高多基因评分(PGS)的准确性和公平性提供了机会。现在有几种方法旨在利用多个GWAS源,但是预测性能和计算效率仍然不清楚,特别是在无法获得个人级别调优数据的情况下。本研究利用来自乌干达基因组资源、日本生物银行、英国生物银行和百万退伍军人计划的汇总统计数据,对非洲(AFR)、东亚(EAS)和欧洲(EUR)祖先的10个复杂性状的综合PGS方法进行了评估。采用DBSLMM、lassosum、LDpred2、MegaPRS、pT+ cluster、PRS-CS、QuickPRS、SBayesRC等方法推导单源PGS。多源方法包括PRS-CSx、TL-PRS、X-Wing以及独立优化的单源评分组合。所有方法均局限于HapMap3变异,并使用与GWAS超级群体匹配的连锁不平衡参考面板。一个关键贡献是LEOPARD方法的新应用,该方法仅使用汇总统计来估计种群特定PGS的最佳线性组合。分析是使用开源GenoPred管道实现的。在AFR和EAS人群中,结合祖先对齐和欧洲GWAS的PGS优于单一来源模型。独立优化分数的线性组合始终优于当前联合优化的多源方法,同时具有更高的计算效率。LEOPARD扩展提供了一个实用的解决方案,可以在只有汇总统计数据可用时调优这些组合,从而实现与使用个人级别数据调优相当的性能。这些发现强调了多源PGS构建的灵活和通用框架。GenoPred管道支持更公平、更准确和更容易获得的多基因预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
HGG Advances
HGG Advances Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
4.30
自引率
4.50%
发文量
69
审稿时长
14 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信