Seo Young Park, Jaeil Ahn, Jae Hoon Lee, Jaewoo Kwon, Hana Lee
{"title":"Overlap Weights for Binary Outcomes: A Performance Assessment.","authors":"Seo Young Park, Jaeil Ahn, Jae Hoon Lee, Jaewoo Kwon, Hana Lee","doi":"10.1002/pds.70253","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Inverse probability weighting (IPW) is a widely used method to estimate the causal effect of treatment from observational data. However, it can be unstable when extreme propensity score (PS) values lead to very large weights. Overlap weights (OW), which emphasize subjects in areas of covariate overlap, reduce the influence of extreme PS without excluding participants. While the OW method has shown strong performance in simulations with continuous outcomes, its utility in binary outcome settings-common in health research-has not been thoroughly evaluated.</p><p><strong>Methods: </strong>We conducted simulation studies to evaluate the performance of OW in comparison to other PS weighting methods including IPW, trimmed IPW, and matching weights, in settings with extreme PS values and a binary outcome. Using simulated datasets with varying degrees of PS overlap and treatment prevalence, we assessed covariate balance and treatment effect estimation performance. The performance of the PS weighting methods was further illustrated through an application to data from a study on pancreatic ductal adenocarcinoma.</p><p><strong>Results: </strong>In simulation studies, IPW's performance deteriorated markedly as the overlap in the covariate distribution decreased. In contrast, OW achieved exact covariate balance and consistently showed the highest efficiency among all methods evaluated. In the application to real-world data characterized by low treatment prevalence and substantial covariate imbalance, OW also outperformed the other methods in terms of both standard error and covariate balance.</p><p><strong>Conclusion: </strong>These findings suggest superior performance of OW in terms of covariate balance and estimation efficiency in settings with extreme PS and a binary outcome.</p>","PeriodicalId":19782,"journal":{"name":"Pharmacoepidemiology and Drug Safety","volume":"34 11","pages":"e70253"},"PeriodicalIF":2.4000,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmacoepidemiology and Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/pds.70253","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Inverse probability weighting (IPW) is a widely used method to estimate the causal effect of treatment from observational data. However, it can be unstable when extreme propensity score (PS) values lead to very large weights. Overlap weights (OW), which emphasize subjects in areas of covariate overlap, reduce the influence of extreme PS without excluding participants. While the OW method has shown strong performance in simulations with continuous outcomes, its utility in binary outcome settings-common in health research-has not been thoroughly evaluated.
Methods: We conducted simulation studies to evaluate the performance of OW in comparison to other PS weighting methods including IPW, trimmed IPW, and matching weights, in settings with extreme PS values and a binary outcome. Using simulated datasets with varying degrees of PS overlap and treatment prevalence, we assessed covariate balance and treatment effect estimation performance. The performance of the PS weighting methods was further illustrated through an application to data from a study on pancreatic ductal adenocarcinoma.
Results: In simulation studies, IPW's performance deteriorated markedly as the overlap in the covariate distribution decreased. In contrast, OW achieved exact covariate balance and consistently showed the highest efficiency among all methods evaluated. In the application to real-world data characterized by low treatment prevalence and substantial covariate imbalance, OW also outperformed the other methods in terms of both standard error and covariate balance.
Conclusion: These findings suggest superior performance of OW in terms of covariate balance and estimation efficiency in settings with extreme PS and a binary outcome.
期刊介绍:
The aim of Pharmacoepidemiology and Drug Safety is to provide an international forum for the communication and evaluation of data, methods and opinion in the discipline of pharmacoepidemiology. The Journal publishes peer-reviewed reports of original research, invited reviews and a variety of guest editorials and commentaries embracing scientific, medical, statistical, legal and economic aspects of pharmacoepidemiology and post-marketing surveillance of drug safety. Appropriate material in these categories may also be considered for publication as a Brief Report.
Particular areas of interest include:
design, analysis, results, and interpretation of studies looking at the benefit or safety of specific pharmaceuticals, biologics, or medical devices, including studies in pharmacovigilance, postmarketing surveillance, pharmacoeconomics, patient safety, molecular pharmacoepidemiology, or any other study within the broad field of pharmacoepidemiology;
comparative effectiveness research relating to pharmaceuticals, biologics, and medical devices. Comparative effectiveness research is the generation and synthesis of evidence that compares the benefits and harms of alternative methods to prevent, diagnose, treat, and monitor a clinical condition, as these methods are truly used in the real world;
methodologic contributions of relevance to pharmacoepidemiology, whether original contributions, reviews of existing methods, or tutorials for how to apply the methods of pharmacoepidemiology;
assessments of harm versus benefit in drug therapy;
patterns of drug utilization;
relationships between pharmacoepidemiology and the formulation and interpretation of regulatory guidelines;
evaluations of risk management plans and programmes relating to pharmaceuticals, biologics and medical devices.