{"title":"Average Causal Effect Estimation in DAGs with Hidden Variables: Extensions of Back-Door and Front-Door Criteria","authors":"Anna Guo, Razieh Nabi","doi":"arxiv-2409.03962","DOIUrl":null,"url":null,"abstract":"The identification theory for causal effects in directed acyclic graphs\n(DAGs) with hidden variables is well-developed, but methods for estimating and\ninferring functionals beyond the g-formula remain limited. Previous studies\nhave proposed semiparametric estimators for identifiable functionals in a broad\nclass of DAGs with hidden variables. While demonstrating double robustness in\nsome models, existing estimators face challenges, particularly with density\nestimation and numerical integration for continuous variables, and their\nestimates may fall outside the parameter space of the target estimand. Their\nasymptotic properties are also underexplored, especially when using flexible\nstatistical and machine learning models for nuisance estimation. This study\naddresses these challenges by introducing novel one-step corrected plug-in and\ntargeted minimum loss-based estimators of causal effects for a class of DAGs\nthat extend classical back-door and front-door criteria (known as the treatment\nprimal fixability criterion in prior literature). These estimators leverage\nmachine learning to minimize modeling assumptions while ensuring key\nstatistical properties such as asymptotic linearity, double robustness,\nefficiency, and staying within the bounds of the target parameter space. We\nestablish conditions for nuisance functional estimates in terms of L2(P)-norms\nto achieve root-n consistent causal effect estimates. To facilitate practical\napplication, we have developed the flexCausal package in R.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"60 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The identification theory for causal effects in directed acyclic graphs
(DAGs) with hidden variables is well-developed, but methods for estimating and
inferring functionals beyond the g-formula remain limited. Previous studies
have proposed semiparametric estimators for identifiable functionals in a broad
class of DAGs with hidden variables. While demonstrating double robustness in
some models, existing estimators face challenges, particularly with density
estimation and numerical integration for continuous variables, and their
estimates may fall outside the parameter space of the target estimand. Their
asymptotic properties are also underexplored, especially when using flexible
statistical and machine learning models for nuisance estimation. This study
addresses these challenges by introducing novel one-step corrected plug-in and
targeted minimum loss-based estimators of causal effects for a class of DAGs
that extend classical back-door and front-door criteria (known as the treatment
primal fixability criterion in prior literature). These estimators leverage
machine learning to minimize modeling assumptions while ensuring key
statistical properties such as asymptotic linearity, double robustness,
efficiency, and staying within the bounds of the target parameter space. We
establish conditions for nuisance functional estimates in terms of L2(P)-norms
to achieve root-n consistent causal effect estimates. To facilitate practical
application, we have developed the flexCausal package in R.
在具有隐藏变量的有向无环图(DAG)中,因果效应的识别理论已经发展得很成熟,但除了 g 公式之外,估计和推断函数的方法仍然有限。以前的研究提出了在一类广泛的具有隐藏变量的有向无环图(DAG)中可识别函数的半参数估计方法。现有估计器在某些模型中表现出双重稳健性,但也面临挑战,尤其是连续变量的密度测定和数值积分,而且估计结果可能超出目标估计值的参数空间。它们的渐近特性也未得到充分探索,尤其是在使用灵活的统计和机器学习模型进行滋扰估计时。本研究针对这些挑战,为一类 DAG 引入了新颖的一步校正插件和基于目标最小损失的因果效应估计器,它们扩展了经典的后门和前门标准(在以前的文献中称为处理原始固定性标准)。这些估计器利用机器学习来最小化建模假设,同时确保关键的统计特性,如渐近线性、双重稳健性、效率以及不超出目标参数空间的边界。我们用 L2(P)规范建立了滋扰函数估计的条件,以实现根 n 一致的因果效应估计。为了便于实际应用,我们在 R 中开发了 flexCausal 软件包。