RANDOMIZATION-BASED CAUSAL INFERENCE FROM POSSIBLY UNBALANCED SPLIT-PLOT DESIGNS

R. Mukherjee, Tirthankar Dasgupta
{"title":"RANDOMIZATION-BASED CAUSAL INFERENCE FROM POSSIBLY UNBALANCED SPLIT-PLOT DESIGNS","authors":"R. Mukherjee, Tirthankar Dasgupta","doi":"10.20472/iac.2019.051.027","DOIUrl":null,"url":null,"abstract":"Factorial experiments are currently undergoing a popularity surge in social and behavioral sciences. A key challenge here arises from randomization restrictions. Consider an experiment to assess the causal effects of two factors, expert review and teacher bonus scheme, on 40 schools in a state. A completely randomized assignment can disperse the schools undergoing review all over the state, thus entailing prohibitively high cost. A practical alternative is to divide these schools by geographic proximity into four groups called whole-plots, two of which are randomly assigned to expert review. The teacher bonus scheme is then applied to half of the schools chosen randomly within each whole-plot. This is an example of a classic split-plot design. Randomization-based analysis, avoiding rigid linear model assumptions, is the most natural methodology to draw causal inference from finite population split-plot experiments as above. Recently, Zhao, Ding, Mukerjee and Dasgupta (2018, Annals of Statistics) investigated this for balanced split-plot designs, where whole-plots are of equal size. However, this can often pose practical difficulty in social sciences. Thus, if the 40 schools are spread over four counties with 8, 8, 12 and 12 schools, then each county is a natural whole-plot, the design is unbalanced, and the analysis in Zhao et al. (2018) is not applicable.We investigate causal inference in split-plot designs that are possibly unbalanced, using the potential outcomes framework. We start with an unbiased estimator of a typical treatment contrast and first examine how far Zhao et al.?s (2018) approach can be adapted to our more general setup. It is seen that this approach, aided by a variable transformation, yields an expression for the sampling variance of the treatment contrast estimator but runs into difficulty in variance estimation. Specifically, as in the balanced case and elsewhere in causal inference (Mukerjee, Dasgupta and Rubin, 2018, Journal of the American Statistical Association), the resulting variance estimator is conservative, i.e., has a nonnegative bias. But, unlike most standard situations, the bias does not vanish even under strict additivity of treatment effects. To overcome this problem, a careful matrix analysis is employed leading to a new variance estimator which is also conservative, but enjoys the nice property of becoming unbiased under a condition even milder than strict additivity. We also discuss the issue of minimaxity with a view to controlling the bias in variance estimation, and explore the bias via simulations.","PeriodicalId":419018,"journal":{"name":"Proceedings of the 51st International Academic Conference, Vienna","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 51st International Academic Conference, Vienna","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20472/iac.2019.051.027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Factorial experiments are currently undergoing a popularity surge in social and behavioral sciences. A key challenge here arises from randomization restrictions. Consider an experiment to assess the causal effects of two factors, expert review and teacher bonus scheme, on 40 schools in a state. A completely randomized assignment can disperse the schools undergoing review all over the state, thus entailing prohibitively high cost. A practical alternative is to divide these schools by geographic proximity into four groups called whole-plots, two of which are randomly assigned to expert review. The teacher bonus scheme is then applied to half of the schools chosen randomly within each whole-plot. This is an example of a classic split-plot design. Randomization-based analysis, avoiding rigid linear model assumptions, is the most natural methodology to draw causal inference from finite population split-plot experiments as above. Recently, Zhao, Ding, Mukerjee and Dasgupta (2018, Annals of Statistics) investigated this for balanced split-plot designs, where whole-plots are of equal size. However, this can often pose practical difficulty in social sciences. Thus, if the 40 schools are spread over four counties with 8, 8, 12 and 12 schools, then each county is a natural whole-plot, the design is unbalanced, and the analysis in Zhao et al. (2018) is not applicable.We investigate causal inference in split-plot designs that are possibly unbalanced, using the potential outcomes framework. We start with an unbiased estimator of a typical treatment contrast and first examine how far Zhao et al.?s (2018) approach can be adapted to our more general setup. It is seen that this approach, aided by a variable transformation, yields an expression for the sampling variance of the treatment contrast estimator but runs into difficulty in variance estimation. Specifically, as in the balanced case and elsewhere in causal inference (Mukerjee, Dasgupta and Rubin, 2018, Journal of the American Statistical Association), the resulting variance estimator is conservative, i.e., has a nonnegative bias. But, unlike most standard situations, the bias does not vanish even under strict additivity of treatment effects. To overcome this problem, a careful matrix analysis is employed leading to a new variance estimator which is also conservative, but enjoys the nice property of becoming unbiased under a condition even milder than strict additivity. We also discuss the issue of minimaxity with a view to controlling the bias in variance estimation, and explore the bias via simulations.
从可能不平衡的分裂图设计中随机化的因果推断
析因实验目前在社会科学和行为科学领域受到广泛欢迎。这里的一个关键挑战来自随机化限制。考虑一个实验来评估两个因素的因果关系,专家评审和教师奖金计划,在一个州的40所学校。一个完全随机的分配可能会把正在接受审查的学校分散到整个州,从而导致过高的成本。一个实际的替代方案是将这些学校按地理位置划分为四组,称为整体地块,其中两组随机分配给专家评审。然后,教师奖金计划应用于每个整块区域内随机选择的一半学校。这是一个典型的分裂情节设计的例子。基于随机化的分析,避免了严格的线性模型假设,是从上述有限总体分裂图实验中得出因果推理的最自然的方法。最近,Zhao, Ding, Mukerjee和Dasgupta (2018, Annals of Statistics)对平衡的分块设计进行了研究,其中整块大小相等。然而,这往往会给社会科学带来实际困难。因此,如果这40所学校分布在4个县,分别是8所、8所、12所和12所学校,那么每个县是一个自然的整块地,设计是不平衡的,Zhao et al.(2018)的分析不适用。我们使用潜在结果框架研究了可能不平衡的分裂图设计中的因果推理。我们从典型治疗对比的无偏估计量开始,首先检查赵等人在多大程度上。S(2018)的方法可以适应我们更一般的设置。可以看出,这种方法在变量变换的帮助下,产生了处理对比估计器的抽样方差的表达式,但在方差估计中遇到了困难。具体来说,就像平衡情况和其他因果推理一样(Mukerjee, Dasgupta和Rubin, 2018,《美国统计协会杂志》),得到的方差估计量是保守的,即具有非负偏倚。但是,与大多数标准情况不同,即使在治疗效果的严格可加性下,这种偏差也不会消失。为了克服这个问题,采用了仔细的矩阵分析,得到了一个新的方差估计量,它也是保守的,但在比严格可加性更温和的条件下具有无偏性。为了控制方差估计中的偏差,我们还讨论了极小值问题,并通过仿真对偏差进行了探讨。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信