Deep Learning Based Causal Inference for Large-Scale Combinatorial Experiments: Theory and Empirical Evidence

Proceedings of the 24th ACM Conference on Economics and Computation Pub Date : 2023-07-07 DOI:10.1145/3580507.3597718

Zikun Ye, Zhiqi Zhang, Dennis Zhang, Heng Zhang, Renyu Zhang

{"title":"Deep Learning Based Causal Inference for Large-Scale Combinatorial Experiments: Theory and Empirical Evidence","authors":"Zikun Ye, Zhiqi Zhang, Dennis Zhang, Heng Zhang, Renyu Zhang","doi":"10.1145/3580507.3597718","DOIUrl":null,"url":null,"abstract":"Large-scale online platforms launch hundreds of randomized experiments (a.k.a. A/B tests) every day to iterate their operations and marketing strategies, while the combinations of these treatments are typically not exhaustively tested. It triggers an important question of both academic and practical interests: Without observing the outcomes of all treatment combinations, how to estimate the causal effect of any treatment combination and identify the optimal treatment combination? We develop a novel framework combining deep learning and double machine learning to estimate the causal effect of any treatment combination for each user on the platform when observing only a small subset of treatment combinations. Our proposed framework (called debiased deep learning, DeDL) exploits Neyman orthogonality and combines interpretable and flexible structural layers in deep learning. We prove theoretically that this framework yields consistent and asymptotically normal estimators under mild assumptions, thus allowing for identifying the best treatment combination when only observing a few combinations. To empirically validate our method, we then collaborate with a large-scale video-sharing platform and implement our framework for three experiments involving three treatments where each combination of treatments is tested. When only observing a subset of treatment combinations, our DeDL approach significantly outperforms other benchmarks to accurately estimate and infer the average treatment effect (ATE) of any treatment combination and to identify the optimal treatment combination.","PeriodicalId":210555,"journal":{"name":"Proceedings of the 24th ACM Conference on Economics and Computation","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 24th ACM Conference on Economics and Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3580507.3597718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Large-scale online platforms launch hundreds of randomized experiments (a.k.a. A/B tests) every day to iterate their operations and marketing strategies, while the combinations of these treatments are typically not exhaustively tested. It triggers an important question of both academic and practical interests: Without observing the outcomes of all treatment combinations, how to estimate the causal effect of any treatment combination and identify the optimal treatment combination? We develop a novel framework combining deep learning and double machine learning to estimate the causal effect of any treatment combination for each user on the platform when observing only a small subset of treatment combinations. Our proposed framework (called debiased deep learning, DeDL) exploits Neyman orthogonality and combines interpretable and flexible structural layers in deep learning. We prove theoretically that this framework yields consistent and asymptotically normal estimators under mild assumptions, thus allowing for identifying the best treatment combination when only observing a few combinations. To empirically validate our method, we then collaborate with a large-scale video-sharing platform and implement our framework for three experiments involving three treatments where each combination of treatments is tested. When only observing a subset of treatment combinations, our DeDL approach significantly outperforms other benchmarks to accurately estimate and infer the average treatment effect (ATE) of any treatment combination and to identify the optimal treatment combination.

查看原文本刊更多论文

基于深度学习的大规模组合实验因果推理:理论与经验证据

大型在线平台每天都会启动数百个随机实验(又名A/B测试)来迭代他们的运营和营销策略，而这些治疗方法的组合通常没有经过详尽的测试。它引发了一个重要的学术和实践问题:在不观察所有治疗组合的结果的情况下，如何估计任何治疗组合的因果效应并确定最佳治疗组合?我们开发了一个结合深度学习和双机器学习的新框架，在只观察治疗组合的一小部分时，估计平台上每个用户的任何治疗组合的因果效应。我们提出的框架(称为去偏深度学习，DeDL)利用内曼正交性，并在深度学习中结合了可解释和灵活的结构层。我们从理论上证明了该框架在温和假设下产生一致和渐近正态估计，从而允许在仅观察少数组合时识别最佳治疗组合。为了从经验上验证我们的方法，我们与一个大型视频分享平台合作，并实施我们的三个实验框架，涉及三种治疗方法，其中每种治疗方法的组合都经过测试。当仅观察治疗组合的一个子集时，我们的DeDL方法在准确估计和推断任何治疗组合的平均治疗效果(ATE)并确定最佳治疗组合方面明显优于其他基准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 24th ACM Conference on Economics and Computation

自引率

0.00%

发文量