Statistics and Its Interface最新文献

Variable selection for doubly robust causal inference. 双稳健因果推理的变量选择。

IF 0.7 4区数学

Statistics and Its Interface Pub Date : 2025-01-01 Epub Date: 2024-10-22 DOI: 10.4310/sii.241023040813

Eunah Cho, Shu Yang

{"title":"Variable selection for doubly robust causal inference.","authors":"Eunah Cho, Shu Yang","doi":"10.4310/sii.241023040813","DOIUrl":"10.4310/sii.241023040813","url":null,"abstract":"Confounding control is crucial and yet challenging for causal inference based on observational studies. Under the typical unconfoundness assumption, augmented inverse probability weighting (AIPW) has been popular for estimating the average causal effect (ACE) due to its double robustness in the sense it relies on either the propensity score model or the outcome mean model to be correctly specified. To ensure the key assumption holds, the effort is often made to collect a sufficiently rich set of pretreatment variables, rendering variable selection imperative. It is well known that variable selection for the propensity score targeted for accurate prediction may produce a variable ACE estimator by including the instrument variables. Thus, many recent works recommend selecting all outcome predictors for both confounding control and efficient estimation. This article shows that the AIPW estimator with variable selection targeted for efficient estimation may lose the desirable double robustness property. Instead, we propose controlling the propensity score model for any covariate that is a predictor of either the treatment or the outcome or both, which preserves the double robustness of the AIPW estimator. Using this principle, we propose a two-stage procedure with penalization for variable selection and the AIPW estimator for estimation. We show the proposed procedure benefits from the desirable double robustness property. We evaluate the finite-sample performance of the AIPW estimator with various variable selection criteria through simulation and an application.","PeriodicalId":51230,"journal":{"name":"Statistics and Its Interface","volume":"18 1","pages":"93-105"},"PeriodicalIF":0.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12395465/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144977781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-dimensional Bayesian mediation analysis with adaptive Laplace priors. 自适应拉普拉斯先验的高维贝叶斯中介分析。

IF 0.7 4区数学

Statistics and Its Interface Pub Date : 2025-01-01 Epub Date: 2025-05-21 DOI: 10.4310/sii.250521220937

Qingzhao Yu, Joseph Hagan, Xiaocheng Wu, Jennifer Richmond-Bryant, Norman Urbanek, Bin Li

{"title":"High-dimensional Bayesian mediation analysis with adaptive Laplace priors.","authors":"Qingzhao Yu, Joseph Hagan, Xiaocheng Wu, Jennifer Richmond-Bryant, Norman Urbanek, Bin Li","doi":"10.4310/sii.250521220937","DOIUrl":"10.4310/sii.250521220937","url":null,"abstract":"The mediation analysis method is used to investigate effects of mediators that intervene in the pathways between an exposure variable and an outcome variable. Bayesian methods are naturally used in mediation analysis due to the hierarchical structure of Bayesian models. This paper introduces an innovative adaptive Bayesian mediation analysis method that incorporates adaptive Laplace priors into the predictive model to account for high-dimensional mediators. This approach introduces a penalization function on the estimated direct and indirect effects rather than solely on the coefficients of predictive models. Consequently, estimated effects that lack statistical significance may shrink to zero, facilitating a more robust analysis. We demonstrate the efficacy of our adaptive mediation analysis method on simulations and on a Louisiana triple negative breast cancer (TNBC) dataset to examine racial disparity in diagnosed stage among TNBC patients diagnosed between 2010 and 2017. The dataset is linked to the 2017 hazardous air pollutant emissions burden estimation database using patients' residential address. We effectively explain a portion of the disparity using currently collected variables. The analysis identifies crucial mediators and confounders, highlighting the significance of variables such as age of diagnosis, insurance status, tumor grades, and the concentration of Naphtha in the air.","PeriodicalId":51230,"journal":{"name":"Statistics and Its Interface","volume":"18 4","pages":"445-457"},"PeriodicalIF":0.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12962590/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147379341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Composite quantile regression based robust empirical likelihood for partially linear spatial autoregressive models 部分线性空间自回归模型的基于稳健经验似然法的复合量化回归

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii764

Peixin Zhao, Suli Cheng, Xiaoshuang Zhou

引用次数: 0

A consistent specification test for functional linear quantile regression models 功能线性量回归模型的一致性规范检验

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii754

Lili Xia, Zhongzhan Zhang, Gongming Shi

引用次数: 0

A latent class selection model for categorical response variables with nonignorably missing data 具有非明显缺失数据的分类响应变量的潜类选择模型

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii753

Jung Wun Lee, Ofer Harel

引用次数: 0

Empirical likelihood-based weighted estimation of average treatment effects in randomized clinical trials with missing outcomes 在结果缺失的随机临床试验中，基于经验似然法对平均治疗效果进行加权估计

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/sii.2024.v17.n4.a7

Yuanyao Tan, Xialing Wen, Wei Liang, Ying Yan

引用次数: 0

Modeling and identifiability of non-homogenous Poisson process cure rate model 非均质泊松过程治愈率模型的建模和可识别性

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii763

Soorya Surendren, Asha Gopalakrishnan, Anup Dewanji

引用次数: 0

Variable selection and estimation for high-dimensional partially linear spatial autoregressive models with measurement errors 具有测量误差的高维部分线性空间自回归模型的变量选择和估计

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii758

Zhensheng Huang, Shuyu Meng, Linlin Zhang

引用次数: 0

A double regression method for graphical modeling of high-dimensional nonlinear and non-Gaussian data 高维非线性和非高斯数据图形建模的双重回归方法

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii756

Siqi Liang, Faming Liang

引用次数: 0

Flexible quasi-beta prime regression models for dependent continuous positive data 针对依赖性连续正数据的灵活准贝塔质回归模型

IF 0.8 4区数学

Statistics and Its Interface Pub Date : 2024-07-19 DOI: 10.4310/22-sii762

João Freitas, Juvêncio Nobre, Caio Azevedo

{"title":"Flexible quasi-beta prime regression models for dependent continuous positive data","authors":"João Freitas, Juvêncio Nobre, Caio Azevedo","doi":"10.4310/22-sii762","DOIUrl":"https://doi.org/10.4310/22-sii762","url":null,"abstract":"In many situations of interest, it is common to observe positive responses measured along several assessment conditions, within the same subjects. Usually, such a scenario implies a positive skewness on the response distributions, along with the existence of within-subject dependency. It is known that neglecting these features can lead to a misleading inference. In this paper we extend the beta prime regression model for modeling asymmetric positive data, while taking into account the dependence structure. We consider a useful predictor for modeling a suitable transformation of the mean, along with homogeneous covariance structure. The proposed model is an interesting competitor of the flexible Tweedie regression models, which include distributions such as Gamma and Inverse Gaussian. Furthermore, residual analysis and influence diagnostic tools are proposed. A Monte Carlo experiment is conducted to evaluate the performance of the proposed methodology, under small and moderate sample sizes, along with suitable discussions. The methodology is illustrated with the analysis of a real longitudinal dataset. An R package was developed to allow the practitioners to use the methodology described in this paper.","PeriodicalId":51230,"journal":{"name":"Statistics and Its Interface","volume":"67 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141743416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0