Methods for Prioritizing Causal Genes in Molecular Studies of Human Disease: The State of the Art

IF 3.8 4区医学 Q3 GENETICS & HEREDITY

Genetic Epidemiology Pub Date : 2026-03-02 DOI:10.1002/gepi.70037

Karina Patasova, Bahar Sedaghati-Khayat, Rachel Knevel, Heather J. Cordell, Arthur G. Pratt

{"title":"Methods for Prioritizing Causal Genes in Molecular Studies of Human Disease: The State of the Art","authors":"Karina Patasova, Bahar Sedaghati-Khayat, Rachel Knevel, Heather J. Cordell, Arthur G. Pratt","doi":"10.1002/gepi.70037","DOIUrl":null,"url":null,"abstract":"<p>In the last decade, genome-wide association studies (GWAS) have identified tens of thousands of common variants associated with a wide array of complex traits and diseases. Integration of GWAS with molecular data has informed the development of statistical tools for causal gene discovery. In this paper, we give an overview of commonly used causal inference methods and discuss the strengths and limitations of colocalization, Mendelian randomization (MR) and network-based approaches. Colocalization is often used to assess whether the genetic association signals for two traits arise from the same causal variant, thereby strengthening inferred causal associations. MR was developed to tackle issues of confounding and reverse causality, providing a rigorous approach to causal inference and demonstrating improved false discovery rates. Unlike MR, network-based analyses employ a discovery approach and model complex relationships between multiple variables. All causal inference methods are, to varying degrees, susceptible to spurious associations due to genetic confounding, pleiotropy and linkage disequilibrium. Here, we discuss the latest developments in the field of causal gene inference and limitations of these methods. We give an overview of interplay between different approaches as well as practical applications with reference to published examples in context of heart disease.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"50 3","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2026-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12952701/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.70037","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

In the last decade, genome-wide association studies (GWAS) have identified tens of thousands of common variants associated with a wide array of complex traits and diseases. Integration of GWAS with molecular data has informed the development of statistical tools for causal gene discovery. In this paper, we give an overview of commonly used causal inference methods and discuss the strengths and limitations of colocalization, Mendelian randomization (MR) and network-based approaches. Colocalization is often used to assess whether the genetic association signals for two traits arise from the same causal variant, thereby strengthening inferred causal associations. MR was developed to tackle issues of confounding and reverse causality, providing a rigorous approach to causal inference and demonstrating improved false discovery rates. Unlike MR, network-based analyses employ a discovery approach and model complex relationships between multiple variables. All causal inference methods are, to varying degrees, susceptible to spurious associations due to genetic confounding, pleiotropy and linkage disequilibrium. Here, we discuss the latest developments in the field of causal gene inference and limitations of these methods. We give an overview of interplay between different approaches as well as practical applications with reference to published examples in context of heart disease.

Abstract Image

查看原文本刊更多论文

人类疾病分子研究中致病基因优先排序的方法：最新进展。

在过去的十年中，全基因组关联研究（GWAS）已经确定了数以万计的与一系列复杂性状和疾病相关的常见变异。GWAS与分子数据的整合为因果基因发现的统计工具的发展提供了信息。在本文中，我们概述了常用的因果推理方法，并讨论了共定位，孟德尔随机化（MR）和基于网络的方法的优势和局限性。共定位通常用于评估两个性状的遗传关联信号是否来自相同的因果变异，从而加强推断的因果关联。MR的发展是为了解决混淆和反向因果关系的问题，为因果推理提供了严格的方法，并证明了错误发现率的提高。与MR不同，基于网络的分析采用发现方法并对多个变量之间的复杂关系进行建模。由于遗传混杂、多效性和连锁不平衡，所有的因果推理方法都在不同程度上容易产生虚假的关联。在这里，我们讨论了因果基因推断领域的最新发展和这些方法的局限性。我们给出了不同的方法之间的相互作用的概述，以及参考在心脏疾病的背景下发表的例子实际应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生

CiteScore

4.40

自引率

9.50%

发文量

审稿时长

6-12 weeks

期刊介绍： Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.