Konrad J Karczewski,Rahul Gupta,Masahiro Kanai,Wenhan Lu,Kristin Tsuo,Ying Wang,Raymond K Walters,Patrick Turley,Shawneequa Callier,Nirav N Shah,Nikolas Baya,Duncan S Palmer,Jacqueline I Goldstein,Gopal Sarma,Matthew Solomonson,Nathan Cheng,Sam Bryant,Claire Churchhouse,Caroline M Cusick,Timothy Poterba,John Compitello,Daniel King,Wei Zhou,Cotton Seed,Hilary K Finucane,Mark J Daly,Benjamin M Neale,Elizabeth G Atkinson,Alicia R Martin
{"title":"Pan-UK Biobank genome-wide association analyses enhance discovery and resolution of ancestry-enriched effects.","authors":"Konrad J Karczewski,Rahul Gupta,Masahiro Kanai,Wenhan Lu,Kristin Tsuo,Ying Wang,Raymond K Walters,Patrick Turley,Shawneequa Callier,Nirav N Shah,Nikolas Baya,Duncan S Palmer,Jacqueline I Goldstein,Gopal Sarma,Matthew Solomonson,Nathan Cheng,Sam Bryant,Claire Churchhouse,Caroline M Cusick,Timothy Poterba,John Compitello,Daniel King,Wei Zhou,Cotton Seed,Hilary K Finucane,Mark J Daly,Benjamin M Neale,Elizabeth G Atkinson,Alicia R Martin","doi":"10.1038/s41588-025-02335-7","DOIUrl":null,"url":null,"abstract":"Large biobanks, such as the UK Biobank (UKB), enable massive phenome by genome-wide association studies that elucidate genetic etiology of complex traits. However, people from diverse genetic ancestry groups are often excluded from association analyses due to concerns about population structure introducing false positive associations. Here we generate mixed model associations and meta-analyses across genetic ancestry groups, inclusive of a larger fraction of the UK Biobank than previous efforts, to produce freely available summary statistics for 7,266 traits. We build a quality control and analysis framework informed by genetic architecture. Overall, we identify 14,676 significant loci (P < 5 × 10-8) in the meta-analysis that were not found in the EUR genetic ancestry group alone, including new associations, for example between CAMK2D and triglycerides. We also highlight associations from ancestry-enriched variation, including a known pleiotropic missense variant in G6PD associated with several biomarker traits. We release these results publicly alongside frequently asked questions that describe caveats for interpretation of results, enhancing available resources for interpretation of risk variants across diverse populations.","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":"16 1","pages":""},"PeriodicalIF":29.0000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41588-025-02335-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Large biobanks, such as the UK Biobank (UKB), enable massive phenome by genome-wide association studies that elucidate genetic etiology of complex traits. However, people from diverse genetic ancestry groups are often excluded from association analyses due to concerns about population structure introducing false positive associations. Here we generate mixed model associations and meta-analyses across genetic ancestry groups, inclusive of a larger fraction of the UK Biobank than previous efforts, to produce freely available summary statistics for 7,266 traits. We build a quality control and analysis framework informed by genetic architecture. Overall, we identify 14,676 significant loci (P < 5 × 10-8) in the meta-analysis that were not found in the EUR genetic ancestry group alone, including new associations, for example between CAMK2D and triglycerides. We also highlight associations from ancestry-enriched variation, including a known pleiotropic missense variant in G6PD associated with several biomarker traits. We release these results publicly alongside frequently asked questions that describe caveats for interpretation of results, enhancing available resources for interpretation of risk variants across diverse populations.
期刊介绍:
Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation.
Integrative genetic topics comprise, but are not limited to:
-Genes in the pathology of human disease
-Molecular analysis of simple and complex genetic traits
-Cancer genetics
-Agricultural genomics
-Developmental genetics
-Regulatory variation in gene expression
-Strategies and technologies for extracting function from genomic data
-Pharmacological genomics
-Genome evolution