Karina Patasova, Bahar Sedaghati-Khayat, Rachel Knevel, Heather J. Cordell, Arthur G. Pratt
{"title":"Methods for Prioritizing Causal Genes in Molecular Studies of Human Disease: The State of the Art","authors":"Karina Patasova, Bahar Sedaghati-Khayat, Rachel Knevel, Heather J. Cordell, Arthur G. Pratt","doi":"10.1002/gepi.70037","DOIUrl":null,"url":null,"abstract":"<p>In the last decade, genome-wide association studies (GWAS) have identified tens of thousands of common variants associated with a wide array of complex traits and diseases. Integration of GWAS with molecular data has informed the development of statistical tools for causal gene discovery. In this paper, we give an overview of commonly used causal inference methods and discuss the strengths and limitations of colocalization, Mendelian randomization (MR) and network-based approaches. Colocalization is often used to assess whether the genetic association signals for two traits arise from the same causal variant, thereby strengthening inferred causal associations. MR was developed to tackle issues of confounding and reverse causality, providing a rigorous approach to causal inference and demonstrating improved false discovery rates. Unlike MR, network-based analyses employ a discovery approach and model complex relationships between multiple variables. All causal inference methods are, to varying degrees, susceptible to spurious associations due to genetic confounding, pleiotropy and linkage disequilibrium. Here, we discuss the latest developments in the field of causal gene inference and limitations of these methods. We give an overview of interplay between different approaches as well as practical applications with reference to published examples in context of heart disease.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"50 3","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12952701/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.70037","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
In the last decade, genome-wide association studies (GWAS) have identified tens of thousands of common variants associated with a wide array of complex traits and diseases. Integration of GWAS with molecular data has informed the development of statistical tools for causal gene discovery. In this paper, we give an overview of commonly used causal inference methods and discuss the strengths and limitations of colocalization, Mendelian randomization (MR) and network-based approaches. Colocalization is often used to assess whether the genetic association signals for two traits arise from the same causal variant, thereby strengthening inferred causal associations. MR was developed to tackle issues of confounding and reverse causality, providing a rigorous approach to causal inference and demonstrating improved false discovery rates. Unlike MR, network-based analyses employ a discovery approach and model complex relationships between multiple variables. All causal inference methods are, to varying degrees, susceptible to spurious associations due to genetic confounding, pleiotropy and linkage disequilibrium. Here, we discuss the latest developments in the field of causal gene inference and limitations of these methods. We give an overview of interplay between different approaches as well as practical applications with reference to published examples in context of heart disease.
期刊介绍:
Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations.
Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.