A unified framework for covariate adjustment under stratified randomization

Fuyi Tu, Wei Ma, Hanzhong Liu
{"title":"A unified framework for covariate adjustment under stratified randomization","authors":"Fuyi Tu, Wei Ma, Hanzhong Liu","doi":"arxiv-2312.01266","DOIUrl":null,"url":null,"abstract":"Randomization, as a key technique in clinical trials, can eliminate sources\nof bias and produce comparable treatment groups. In randomized experiments, the\ntreatment effect is a parameter of general interest. Researchers have explored\nthe validity of using linear models to estimate the treatment effect and\nperform covariate adjustment and thus improve the estimation efficiency.\nHowever, the relationship between covariates and outcomes is not necessarily\nlinear, and is often intricate. Advances in statistical theory and related\ncomputer technology allow us to use nonparametric and machine learning methods\nto better estimate the relationship between covariates and outcomes and thus\nobtain further efficiency gains. However, theoretical studies on how to draw\nvalid inferences when using nonparametric and machine learning methods under\nstratified randomization are yet to be conducted. In this paper, we discuss a\nunified framework for covariate adjustment and corresponding statistical\ninference under stratified randomization and present a detailed proof of the\nvalidity of using local linear kernel-weighted least squares regression for\ncovariate adjustment in treatment effect estimators as a special case. In the\ncase of high-dimensional data, we additionally propose an algorithm for\nstatistical inference using machine learning methods under stratified\nrandomization, which makes use of sample splitting to alleviate the\nrequirements on the asymptotic properties of machine learning methods. Finally,\nwe compare the performances of treatment effect estimators using different\nmachine learning methods by considering various data generation scenarios, to\nguide practical research.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"83 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2312.01266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relationship between covariates and outcomes is not necessarily linear, and is often intricate. Advances in statistical theory and related computer technology allow us to use nonparametric and machine learning methods to better estimate the relationship between covariates and outcomes and thus obtain further efficiency gains. However, theoretical studies on how to draw valid inferences when using nonparametric and machine learning methods under stratified randomization are yet to be conducted. In this paper, we discuss a unified framework for covariate adjustment and corresponding statistical inference under stratified randomization and present a detailed proof of the validity of using local linear kernel-weighted least squares regression for covariate adjustment in treatment effect estimators as a special case. In the case of high-dimensional data, we additionally propose an algorithm for statistical inference using machine learning methods under stratified randomization, which makes use of sample splitting to alleviate the requirements on the asymptotic properties of machine learning methods. Finally, we compare the performances of treatment effect estimators using different machine learning methods by considering various data generation scenarios, to guide practical research.
分层随机化下协变量调整的统一框架
随机化作为临床试验的一项关键技术,可以消除偏倚来源,产生可比较的治疗组。在随机实验中,治疗效果是一个普遍关注的参数。研究人员探索了使用线性模型估计治疗效果并进行协变量调整的有效性,从而提高了估计效率。然而,协变量和结果之间的关系并不一定是线性的,而且往往是复杂的。统计理论和相关计算机技术的进步使我们能够使用非参数和机器学习方法来更好地估计协变量和结果之间的关系,从而进一步提高效率。然而,如何在分层随机化下使用非参数和机器学习方法得出有效推论的理论研究尚未开展。本文讨论了分层随机化条件下协变量平差的统一框架和相应的统计推断,并以局部线性核加权最小二乘回归作为一个特例,详细证明了在治疗效果估计中使用协变量平差的有效性。在高维数据的情况下,我们还提出了一种在分层场随机化下使用机器学习方法进行统计推断的算法,该算法利用样本分裂来减轻机器学习方法对渐近性质的要求。最后,我们通过考虑不同的数据生成场景,比较了使用不同机器学习方法的治疗效果估计器的性能,以指导实际研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信