Archit Singh, Lorraine Southam, Konstantinos Hatzikotoulas, Nigel W. Rayner, Ken Suzuki, Henry J. Taylor, Xianyong Yin, Ravi Mandla, Alicia Huerta-Chagoya, Andrew P. Morris, Eleftheria Zeggini, Ozvan Bocher
{"title":"Correcting for Genomic Inflation Leads to Loss of Power in Large-Scale Genome-Wide Association Study Meta-Analysis","authors":"Archit Singh, Lorraine Southam, Konstantinos Hatzikotoulas, Nigel W. Rayner, Ken Suzuki, Henry J. Taylor, Xianyong Yin, Ravi Mandla, Alicia Huerta-Chagoya, Andrew P. Morris, Eleftheria Zeggini, Ozvan Bocher","doi":"10.1002/gepi.70016","DOIUrl":null,"url":null,"abstract":"<p>Inflation in genome-wide association studies (GWAS) summary statistics represents a major challenge, for which correction methods have been developed. These include the genomic control (GC) method, which uses the λ-value to correct summary statistics, and the linkage disequilibrium score regression (LDSR) method, which uses the LDSR intercept. By using type 2 diabetes (T2D) as an exemplar, we explore factors influencing λ-values and the impact of these corrections on association signals. We find that larger sample sizes increase λ-values due to increased captured polygenicity, while including lower frequency variants decreases λ-values due to reduced power. Comparing T2D genetic associations described in overlapping GWAS meta-analyses of increasing sample size, we find that GC correction reduces the false positive rate and leads to the loss of robust associations. In one of the largest meta-analysis, GC correction results in 39.7% loss of independent loci, substantially reducing the number of detected associations. In comparison, the LDSR intercept correction leads to a loss of up to 25.2% of the independent loci, being therefore less conservative than the GC correction. We conclude that in large, well-powered GWAS meta-analysis of polygenic traits, both GC and LDSR intercept correction leads to power loss, highlighting the need for improved genomic inflation correction methods.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 6","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.70016","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.70016","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Inflation in genome-wide association studies (GWAS) summary statistics represents a major challenge, for which correction methods have been developed. These include the genomic control (GC) method, which uses the λ-value to correct summary statistics, and the linkage disequilibrium score regression (LDSR) method, which uses the LDSR intercept. By using type 2 diabetes (T2D) as an exemplar, we explore factors influencing λ-values and the impact of these corrections on association signals. We find that larger sample sizes increase λ-values due to increased captured polygenicity, while including lower frequency variants decreases λ-values due to reduced power. Comparing T2D genetic associations described in overlapping GWAS meta-analyses of increasing sample size, we find that GC correction reduces the false positive rate and leads to the loss of robust associations. In one of the largest meta-analysis, GC correction results in 39.7% loss of independent loci, substantially reducing the number of detected associations. In comparison, the LDSR intercept correction leads to a loss of up to 25.2% of the independent loci, being therefore less conservative than the GC correction. We conclude that in large, well-powered GWAS meta-analysis of polygenic traits, both GC and LDSR intercept correction leads to power loss, highlighting the need for improved genomic inflation correction methods.
期刊介绍:
Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations.
Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.