{"title":"CausalGPS:一个用于连续曝光因果推理的R包","authors":"Naeem Khoshnevis, Xiao Wu, Danielle Braun","doi":"arxiv-2310.00561","DOIUrl":null,"url":null,"abstract":"Quantifying the causal effects of continuous exposures on outcomes of\ninterest is critical for social, economic, health, and medical research.\nHowever, most existing software packages focus on binary exposures. We develop\nthe CausalGPS R package that implements a collection of algorithms to provide\nalgorithmic solutions for causal inference with continuous exposures. CausalGPS\nimplements a causal inference workflow, with algorithms based on generalized\npropensity scores (GPS) as the core, extending propensity scores (the\nprobability of a unit being exposed given pre-exposure covariates) from binary\nto continuous exposures. As the first step, the package implements efficient\nand flexible estimations of the GPS, allowing multiple user-specified modeling\noptions. As the second step, the package provides two ways to adjust for\nconfounding: weighting and matching, generating weighted and matched data sets,\nrespectively. Lastly, the package provides built-in functions to fit flexible\nparametric, semi-parametric, or non-parametric regression models on the\nweighted or matched data to estimate the exposure-response function relating\nthe outcome with the exposures. The computationally intensive tasks are\nimplemented in C++, and efficient shared-memory parallelization is achieved by\nOpenMP API. This paper outlines the main components of the CausalGPS R package\nand demonstrates its application to assess the effect of long-term exposure to\nPM2.5 on educational attainment using zip code-level data from the contiguous\nUnited States from 2000-2016.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CausalGPS: An R Package for Causal Inference With Continuous Exposures\",\"authors\":\"Naeem Khoshnevis, Xiao Wu, Danielle Braun\",\"doi\":\"arxiv-2310.00561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Quantifying the causal effects of continuous exposures on outcomes of\\ninterest is critical for social, economic, health, and medical research.\\nHowever, most existing software packages focus on binary exposures. We develop\\nthe CausalGPS R package that implements a collection of algorithms to provide\\nalgorithmic solutions for causal inference with continuous exposures. CausalGPS\\nimplements a causal inference workflow, with algorithms based on generalized\\npropensity scores (GPS) as the core, extending propensity scores (the\\nprobability of a unit being exposed given pre-exposure covariates) from binary\\nto continuous exposures. As the first step, the package implements efficient\\nand flexible estimations of the GPS, allowing multiple user-specified modeling\\noptions. As the second step, the package provides two ways to adjust for\\nconfounding: weighting and matching, generating weighted and matched data sets,\\nrespectively. Lastly, the package provides built-in functions to fit flexible\\nparametric, semi-parametric, or non-parametric regression models on the\\nweighted or matched data to estimate the exposure-response function relating\\nthe outcome with the exposures. The computationally intensive tasks are\\nimplemented in C++, and efficient shared-memory parallelization is achieved by\\nOpenMP API. This paper outlines the main components of the CausalGPS R package\\nand demonstrates its application to assess the effect of long-term exposure to\\nPM2.5 on educational attainment using zip code-level data from the contiguous\\nUnited States from 2000-2016.\",\"PeriodicalId\":501256,\"journal\":{\"name\":\"arXiv - CS - Mathematical Software\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Mathematical Software\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2310.00561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2310.00561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CausalGPS: An R Package for Causal Inference With Continuous Exposures
Quantifying the causal effects of continuous exposures on outcomes of
interest is critical for social, economic, health, and medical research.
However, most existing software packages focus on binary exposures. We develop
the CausalGPS R package that implements a collection of algorithms to provide
algorithmic solutions for causal inference with continuous exposures. CausalGPS
implements a causal inference workflow, with algorithms based on generalized
propensity scores (GPS) as the core, extending propensity scores (the
probability of a unit being exposed given pre-exposure covariates) from binary
to continuous exposures. As the first step, the package implements efficient
and flexible estimations of the GPS, allowing multiple user-specified modeling
options. As the second step, the package provides two ways to adjust for
confounding: weighting and matching, generating weighted and matched data sets,
respectively. Lastly, the package provides built-in functions to fit flexible
parametric, semi-parametric, or non-parametric regression models on the
weighted or matched data to estimate the exposure-response function relating
the outcome with the exposures. The computationally intensive tasks are
implemented in C++, and efficient shared-memory parallelization is achieved by
OpenMP API. This paper outlines the main components of the CausalGPS R package
and demonstrates its application to assess the effect of long-term exposure to
PM2.5 on educational attainment using zip code-level data from the contiguous
United States from 2000-2016.