Double machine learning methods for estimating average treatment effects: a comparative study.

IF 1.2 4区医学 Q4 PHARMACOLOGY & PHARMACY

Journal of Biopharmaceutical Statistics Pub Date : 2025-04-21 DOI:10.1080/10543406.2025.2489281

Xiaoqing Tan, Shu Yang, Wenyu Ye, Douglas E Faries, Ilya Lipkovich, Zbigniew Kadziola

{"title":"Double machine learning methods for estimating average treatment effects: a comparative study.","authors":"Xiaoqing Tan, Shu Yang, Wenyu Ye, Douglas E Faries, Ilya Lipkovich, Zbigniew Kadziola","doi":"10.1080/10543406.2025.2489281","DOIUrl":null,"url":null,"abstract":"<p><p>Observational cohort studies are increasingly being used for comparative effectiveness research to assess the safety of therapeutics. Recently, various doubly robust methods have been proposed for average treatment effect estimation by combining the treatment model and the outcome model via different vehicles, such as matching, weighting, and regression. The key advantage of doubly robust estimators is that they require either the treatment model or the outcome model to be correctly specified to obtain a consistent estimator of average treatment effects, and therefore lead to a more accurate and often more precise inference. However, little work has been done to understand how doubly robust estimators differ due to their unique strategies of using the treatment and outcome models and how machine learning techniques can be combined to boost their performance, which we call double machine learning estimators. Here, we examine multiple popular doubly robust methods and compare their performance using different treatment and outcome modeling via extensive simulations and a real-world application. We found that incorporating machine learning with doubly robust estimators such as the targeted maximum likelihood estimator gives the best overall performance. Practical guidance on how to apply doubly robust estimators is provided.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1-20"},"PeriodicalIF":1.2000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biopharmaceutical Statistics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10543406.2025.2489281","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

Abstract

Observational cohort studies are increasingly being used for comparative effectiveness research to assess the safety of therapeutics. Recently, various doubly robust methods have been proposed for average treatment effect estimation by combining the treatment model and the outcome model via different vehicles, such as matching, weighting, and regression. The key advantage of doubly robust estimators is that they require either the treatment model or the outcome model to be correctly specified to obtain a consistent estimator of average treatment effects, and therefore lead to a more accurate and often more precise inference. However, little work has been done to understand how doubly robust estimators differ due to their unique strategies of using the treatment and outcome models and how machine learning techniques can be combined to boost their performance, which we call double machine learning estimators. Here, we examine multiple popular doubly robust methods and compare their performance using different treatment and outcome modeling via extensive simulations and a real-world application. We found that incorporating machine learning with doubly robust estimators such as the targeted maximum likelihood estimator gives the best overall performance. Practical guidance on how to apply doubly robust estimators is provided.

查看原文本刊更多论文

估计平均治疗效果的双机器学习方法：比较研究。

观察性队列研究越来越多地被用于比较有效性研究，以评估治疗方法的安全性。近年来，人们通过匹配、加权、回归等不同手段，将治疗模型与结果模型相结合，提出了多种双稳健的平均治疗效果估计方法。双鲁棒估计器的主要优点是，它们需要正确指定治疗模型或结果模型，以获得平均治疗效果的一致估计量，从而导致更准确且通常更精确的推断。然而，由于双鲁棒估计器使用治疗和结果模型的独特策略，以及如何结合机器学习技术来提高其性能，我们称之为双机器学习估计器，因此很少有研究了解双鲁棒估计器的不同之处。在这里，我们研究了多种流行的双鲁棒方法，并通过广泛的模拟和现实世界的应用，使用不同的处理和结果建模来比较它们的性能。我们发现，将机器学习与双重鲁棒估计器（如目标最大似然估计器）相结合，可以获得最佳的整体性能。给出了如何应用双鲁棒估计的实用指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Biopharmaceutical Statistics 医学-统计学与概率论

CiteScore

2.50

自引率

18.20%

发文量

审稿时长

6-12 weeks

期刊介绍： The Journal of Biopharmaceutical Statistics, a rapid publication journal, discusses quality applications of statistics in biopharmaceutical research and development. Now publishing six times per year, it includes expositions of statistical methodology with immediate applicability to biopharmaceutical research in the form of full-length and short manuscripts, review articles, selected/invited conference papers, short articles, and letters to the editor. Addressing timely and provocative topics important to the biostatistical profession, the journal covers: Drug, device, and biological research and development; Drug screening and drug design; Assessment of pharmacological activity; Pharmaceutical formulation and scale-up; Preclinical safety assessment; Bioavailability, bioequivalence, and pharmacokinetics; Phase, I, II, and III clinical development including complex innovative designs; Premarket approval assessment of clinical safety; Postmarketing surveillance; Big data and artificial intelligence and applications.