The risk of maternal complications after cesarean delivery: Near-far matching for instrumental variables study designs with large observational datasets
{"title":"The risk of maternal complications after cesarean delivery: Near-far matching for instrumental variables study designs with large observational datasets","authors":"Ruoqi Yu, R. Kelz, S. Lorch, Luke J. Keele","doi":"10.1214/22-aoas1691","DOIUrl":null,"url":null,"abstract":"Cesarean delivery is used when there are problems with the placenta or umbilical cord, for twin pregnancies, and breech births. How-ever, research has found that Cesarean delivery increases the risk of maternal complications like blood transfusions and admission to the intensive care unit. Here, we study whether Cesarean delivery increases the risk of maternal complications using an instrumental variables study design to reduce bias from unobserved confounders. We use a variant of matching – near-far matching – to render our study design more plausible. In a near-far match, the investigator seeks to strengthen the effect of the instrument on the exposure while balanc-ing observable characteristics between groups of subjects with low and high values of the instrument. Extant near-far matching methods are computationally intensive for large data sets, and computing time can be very lengthy. To reduce the computational complexity of near-far matching in large observational studies, we apply an iterative form of Glover’s algorithm for a doubly convex bipartite graph to de-termine an optimal reverse caliper for the instrument, which reduces the number of candidate matches and allows for an optimal match in a large but much sparser graph. We also incorporate a variety of balance constraints, including exact matching, fine and near-fine balance, and covariate balance prioritization. We illustrate this new matching method using medical claims data from Pennsylvania, New York, and Florida. In our application, we match on physician’s pref-erences for delivery via Cesarean section, which is the instrument in our study. We compare the computing time from our match to extant methods, and we find that we can reduce the computational time required for the match by more than 11 hours. If our matched sample came from a paired randomized experiment, we could conclude that Cesarean delivery elevates the risk of maternal complications and increases the time spent in the hospital. Sensitivity analysis shows that the estimates for complications could be the result of a minor amount of confounding due to an unobserved covariate. The effects on the length of stay outcome, however, are more insensitive to hidden confounders.","PeriodicalId":188068,"journal":{"name":"The Annals of Applied Statistics","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aoas1691","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Cesarean delivery is used when there are problems with the placenta or umbilical cord, for twin pregnancies, and breech births. How-ever, research has found that Cesarean delivery increases the risk of maternal complications like blood transfusions and admission to the intensive care unit. Here, we study whether Cesarean delivery increases the risk of maternal complications using an instrumental variables study design to reduce bias from unobserved confounders. We use a variant of matching – near-far matching – to render our study design more plausible. In a near-far match, the investigator seeks to strengthen the effect of the instrument on the exposure while balanc-ing observable characteristics between groups of subjects with low and high values of the instrument. Extant near-far matching methods are computationally intensive for large data sets, and computing time can be very lengthy. To reduce the computational complexity of near-far matching in large observational studies, we apply an iterative form of Glover’s algorithm for a doubly convex bipartite graph to de-termine an optimal reverse caliper for the instrument, which reduces the number of candidate matches and allows for an optimal match in a large but much sparser graph. We also incorporate a variety of balance constraints, including exact matching, fine and near-fine balance, and covariate balance prioritization. We illustrate this new matching method using medical claims data from Pennsylvania, New York, and Florida. In our application, we match on physician’s pref-erences for delivery via Cesarean section, which is the instrument in our study. We compare the computing time from our match to extant methods, and we find that we can reduce the computational time required for the match by more than 11 hours. If our matched sample came from a paired randomized experiment, we could conclude that Cesarean delivery elevates the risk of maternal complications and increases the time spent in the hospital. Sensitivity analysis shows that the estimates for complications could be the result of a minor amount of confounding due to an unobserved covariate. The effects on the length of stay outcome, however, are more insensitive to hidden confounders.