{"title":"分层随机化下协变量调整的统一框架","authors":"Fuyi Tu, Wei Ma, Hanzhong Liu","doi":"arxiv-2312.01266","DOIUrl":null,"url":null,"abstract":"Randomization, as a key technique in clinical trials, can eliminate sources\nof bias and produce comparable treatment groups. In randomized experiments, the\ntreatment effect is a parameter of general interest. Researchers have explored\nthe validity of using linear models to estimate the treatment effect and\nperform covariate adjustment and thus improve the estimation efficiency.\nHowever, the relationship between covariates and outcomes is not necessarily\nlinear, and is often intricate. Advances in statistical theory and related\ncomputer technology allow us to use nonparametric and machine learning methods\nto better estimate the relationship between covariates and outcomes and thus\nobtain further efficiency gains. However, theoretical studies on how to draw\nvalid inferences when using nonparametric and machine learning methods under\nstratified randomization are yet to be conducted. In this paper, we discuss a\nunified framework for covariate adjustment and corresponding statistical\ninference under stratified randomization and present a detailed proof of the\nvalidity of using local linear kernel-weighted least squares regression for\ncovariate adjustment in treatment effect estimators as a special case. In the\ncase of high-dimensional data, we additionally propose an algorithm for\nstatistical inference using machine learning methods under stratified\nrandomization, which makes use of sample splitting to alleviate the\nrequirements on the asymptotic properties of machine learning methods. Finally,\nwe compare the performances of treatment effect estimators using different\nmachine learning methods by considering various data generation scenarios, to\nguide practical research.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"83 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A unified framework for covariate adjustment under stratified randomization\",\"authors\":\"Fuyi Tu, Wei Ma, Hanzhong Liu\",\"doi\":\"arxiv-2312.01266\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Randomization, as a key technique in clinical trials, can eliminate sources\\nof bias and produce comparable treatment groups. In randomized experiments, the\\ntreatment effect is a parameter of general interest. Researchers have explored\\nthe validity of using linear models to estimate the treatment effect and\\nperform covariate adjustment and thus improve the estimation efficiency.\\nHowever, the relationship between covariates and outcomes is not necessarily\\nlinear, and is often intricate. Advances in statistical theory and related\\ncomputer technology allow us to use nonparametric and machine learning methods\\nto better estimate the relationship between covariates and outcomes and thus\\nobtain further efficiency gains. However, theoretical studies on how to draw\\nvalid inferences when using nonparametric and machine learning methods under\\nstratified randomization are yet to be conducted. In this paper, we discuss a\\nunified framework for covariate adjustment and corresponding statistical\\ninference under stratified randomization and present a detailed proof of the\\nvalidity of using local linear kernel-weighted least squares regression for\\ncovariate adjustment in treatment effect estimators as a special case. In the\\ncase of high-dimensional data, we additionally propose an algorithm for\\nstatistical inference using machine learning methods under stratified\\nrandomization, which makes use of sample splitting to alleviate the\\nrequirements on the asymptotic properties of machine learning methods. Finally,\\nwe compare the performances of treatment effect estimators using different\\nmachine learning methods by considering various data generation scenarios, to\\nguide practical research.\",\"PeriodicalId\":501330,\"journal\":{\"name\":\"arXiv - MATH - Statistics Theory\",\"volume\":\"83 2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - MATH - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2312.01266\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2312.01266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A unified framework for covariate adjustment under stratified randomization
Randomization, as a key technique in clinical trials, can eliminate sources
of bias and produce comparable treatment groups. In randomized experiments, the
treatment effect is a parameter of general interest. Researchers have explored
the validity of using linear models to estimate the treatment effect and
perform covariate adjustment and thus improve the estimation efficiency.
However, the relationship between covariates and outcomes is not necessarily
linear, and is often intricate. Advances in statistical theory and related
computer technology allow us to use nonparametric and machine learning methods
to better estimate the relationship between covariates and outcomes and thus
obtain further efficiency gains. However, theoretical studies on how to draw
valid inferences when using nonparametric and machine learning methods under
stratified randomization are yet to be conducted. In this paper, we discuss a
unified framework for covariate adjustment and corresponding statistical
inference under stratified randomization and present a detailed proof of the
validity of using local linear kernel-weighted least squares regression for
covariate adjustment in treatment effect estimators as a special case. In the
case of high-dimensional data, we additionally propose an algorithm for
statistical inference using machine learning methods under stratified
randomization, which makes use of sample splitting to alleviate the
requirements on the asymptotic properties of machine learning methods. Finally,
we compare the performances of treatment effect estimators using different
machine learning methods by considering various data generation scenarios, to
guide practical research.