{"title":"Incorporating external data for analyzing randomized clinical trials: A transfer learning approach","authors":"Yujia Gu, Hanzhong Liu, Wei Ma","doi":"arxiv-2409.04126","DOIUrl":null,"url":null,"abstract":"Randomized clinical trials are the gold standard for analyzing treatment\neffects, but high costs and ethical concerns can limit recruitment, potentially\nleading to invalid inferences. Incorporating external trial data with similar\ncharacteristics into the analysis using transfer learning appears promising for\naddressing these issues. In this paper, we present a formal framework for\napplying transfer learning to the analysis of clinical trials, considering\nthree key perspectives: transfer algorithm, theoretical foundation, and\ninference method. For the algorithm, we adopt a parameter-based transfer\nlearning approach to enhance the lasso-adjusted stratum-specific estimator\ndeveloped for estimating treatment effects. A key component in constructing the\ntransfer learning estimator is deriving the regression coefficient estimates\nwithin each stratum, accounting for the bias between source and target data. To\nprovide a theoretical foundation, we derive the $l_1$ convergence rate for the\nestimated regression coefficients and establish the asymptotic normality of the\ntransfer learning estimator. Our results show that when external trial data\nresembles current trial data, the sample size requirements can be reduced\ncompared to using only the current trial data. Finally, we propose a consistent\nnonparametric variance estimator to facilitate inference. Numerical studies\ndemonstrate the effectiveness and robustness of our proposed estimator across\nvarious scenarios.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"42 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.04126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Randomized clinical trials are the gold standard for analyzing treatment
effects, but high costs and ethical concerns can limit recruitment, potentially
leading to invalid inferences. Incorporating external trial data with similar
characteristics into the analysis using transfer learning appears promising for
addressing these issues. In this paper, we present a formal framework for
applying transfer learning to the analysis of clinical trials, considering
three key perspectives: transfer algorithm, theoretical foundation, and
inference method. For the algorithm, we adopt a parameter-based transfer
learning approach to enhance the lasso-adjusted stratum-specific estimator
developed for estimating treatment effects. A key component in constructing the
transfer learning estimator is deriving the regression coefficient estimates
within each stratum, accounting for the bias between source and target data. To
provide a theoretical foundation, we derive the $l_1$ convergence rate for the
estimated regression coefficients and establish the asymptotic normality of the
transfer learning estimator. Our results show that when external trial data
resembles current trial data, the sample size requirements can be reduced
compared to using only the current trial data. Finally, we propose a consistent
nonparametric variance estimator to facilitate inference. Numerical studies
demonstrate the effectiveness and robustness of our proposed estimator across
various scenarios.