{"title":"因果推断和基因组研究:鲁宾、珀尔和孟德尔随机化。","authors":"Rodolfo Juan Carlos Cantet, Just Jensen","doi":"10.1111/jbg.12898","DOIUrl":null,"url":null,"abstract":"<p><p>Although Genome Wide Analysis (GWAS) have been widely used to understand the genetic architecture of complex quantitative traits, interpreting their results in terms of the biological processes that determine those traits has been difficult or even lacking, because of the variability in responses to the tests of hypotheses within a trait, species, and breed or cross, and the lack of follow-up studies. It is then essential employing appropriate statistical tests that point out to the causal genes responsible of the relevant fraction of the genetic variability observed. We briefly review the main theoretical aspects of the two schools of causal inference (Rubin's Causal Model, RCM, and Pearl's causal inference, PCI). RCM approachs the hypothesis testing from a randomization perspective by considering a wider space of the observation, i.e. the \"potential outcomes\", rather than the narrower space that results from defining \"treatment\" effects after observing the data. Next, we discuss the assumptions involved to meet the requirements of randomization for RCM with observational data (non-designed experiments) with special emphasis on the Stable Unit Treatment Analysis (SUTVA). Due to the presence of \"confounders\" (i.e. systematic fixed effects, environmental permanent effects, interaction among genes, etc.), causal average treatment effects are viewed through the familiar lens of normal linear (or mixed) models. To overcome the difficulties of association analyses, a tests of causal effects is introduced using independent predicted residual breeding values from animal models of genetic evaluation that avoids the effects of population structure and confounder effects. An independent section discusses the issue of whether the additive effects defined at the \"gene\" level by R. A. Fisher and popularized in D. S. Falconer's textbook of quantitative genetics can be termed causal from either RCM or PCI.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Causal inference and GWAS: Rubin, Pearl, and Mendelian randomization.\",\"authors\":\"Rodolfo Juan Carlos Cantet, Just Jensen\",\"doi\":\"10.1111/jbg.12898\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Although Genome Wide Analysis (GWAS) have been widely used to understand the genetic architecture of complex quantitative traits, interpreting their results in terms of the biological processes that determine those traits has been difficult or even lacking, because of the variability in responses to the tests of hypotheses within a trait, species, and breed or cross, and the lack of follow-up studies. It is then essential employing appropriate statistical tests that point out to the causal genes responsible of the relevant fraction of the genetic variability observed. We briefly review the main theoretical aspects of the two schools of causal inference (Rubin's Causal Model, RCM, and Pearl's causal inference, PCI). RCM approachs the hypothesis testing from a randomization perspective by considering a wider space of the observation, i.e. the \\\"potential outcomes\\\", rather than the narrower space that results from defining \\\"treatment\\\" effects after observing the data. Next, we discuss the assumptions involved to meet the requirements of randomization for RCM with observational data (non-designed experiments) with special emphasis on the Stable Unit Treatment Analysis (SUTVA). Due to the presence of \\\"confounders\\\" (i.e. systematic fixed effects, environmental permanent effects, interaction among genes, etc.), causal average treatment effects are viewed through the familiar lens of normal linear (or mixed) models. To overcome the difficulties of association analyses, a tests of causal effects is introduced using independent predicted residual breeding values from animal models of genetic evaluation that avoids the effects of population structure and confounder effects. An independent section discusses the issue of whether the additive effects defined at the \\\"gene\\\" level by R. A. Fisher and popularized in D. S. Falconer's textbook of quantitative genetics can be termed causal from either RCM or PCI.</p>\",\"PeriodicalId\":54885,\"journal\":{\"name\":\"Journal of Animal Breeding and Genetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Animal Breeding and Genetics\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1111/jbg.12898\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/jbg.12898","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0
摘要
尽管全基因组分析(GWAS)已被广泛用于了解复杂数量性状的遗传结构,但由于在性状、物种、品种或杂交中对假设检验的反应存在差异,而且缺乏后续研究,因此从决定这些性状的生物学过程的角度解释其结果一直很困难,甚至是缺乏。因此,必须采用适当的统计检验方法,找出造成所观察到的遗传变异的相关基因。我们简要回顾一下因果推断的两个流派(鲁宾因果模型 RCM 和珀尔因果推断 PCI)的主要理论方面。RCM 从随机化的角度进行假设检验,考虑的是更广阔的观察空间,即 "潜在结果",而不是观察数据后定义 "治疗 "效果所产生的狭窄空间。接下来,我们将讨论使用观察数据(非设计实验)进行 RCM 随机化所需的假设条件,并特别强调稳定单位处理分析 (SUTVA)。由于存在 "混杂因素"(即系统固定效应、环境永久效应、基因间的交互作用等),因果平均处理效应需要通过我们熟悉的正态线性(或混合)模型来观察。为了克服关联分析的困难,利用遗传评估动物模型的独立预测育种残值引入了因果效应检验,避免了种群结构和混杂效应的影响。有一个独立的章节讨论了 R. A. Fisher 在 "基因 "水平上定义并在 D. S. Falconer 的定量遗传学教科书中推广的加法效应是否可以从 RCM 或 PCI 中称为因果效应的问题。
Causal inference and GWAS: Rubin, Pearl, and Mendelian randomization.
Although Genome Wide Analysis (GWAS) have been widely used to understand the genetic architecture of complex quantitative traits, interpreting their results in terms of the biological processes that determine those traits has been difficult or even lacking, because of the variability in responses to the tests of hypotheses within a trait, species, and breed or cross, and the lack of follow-up studies. It is then essential employing appropriate statistical tests that point out to the causal genes responsible of the relevant fraction of the genetic variability observed. We briefly review the main theoretical aspects of the two schools of causal inference (Rubin's Causal Model, RCM, and Pearl's causal inference, PCI). RCM approachs the hypothesis testing from a randomization perspective by considering a wider space of the observation, i.e. the "potential outcomes", rather than the narrower space that results from defining "treatment" effects after observing the data. Next, we discuss the assumptions involved to meet the requirements of randomization for RCM with observational data (non-designed experiments) with special emphasis on the Stable Unit Treatment Analysis (SUTVA). Due to the presence of "confounders" (i.e. systematic fixed effects, environmental permanent effects, interaction among genes, etc.), causal average treatment effects are viewed through the familiar lens of normal linear (or mixed) models. To overcome the difficulties of association analyses, a tests of causal effects is introduced using independent predicted residual breeding values from animal models of genetic evaluation that avoids the effects of population structure and confounder effects. An independent section discusses the issue of whether the additive effects defined at the "gene" level by R. A. Fisher and popularized in D. S. Falconer's textbook of quantitative genetics can be termed causal from either RCM or PCI.
期刊介绍:
The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.