CausalGPS: An R Package for Causal Inference With Continuous Exposures

Naeem Khoshnevis, Xiao Wu, Danielle Braun
{"title":"CausalGPS: An R Package for Causal Inference With Continuous Exposures","authors":"Naeem Khoshnevis, Xiao Wu, Danielle Braun","doi":"arxiv-2310.00561","DOIUrl":null,"url":null,"abstract":"Quantifying the causal effects of continuous exposures on outcomes of\ninterest is critical for social, economic, health, and medical research.\nHowever, most existing software packages focus on binary exposures. We develop\nthe CausalGPS R package that implements a collection of algorithms to provide\nalgorithmic solutions for causal inference with continuous exposures. CausalGPS\nimplements a causal inference workflow, with algorithms based on generalized\npropensity scores (GPS) as the core, extending propensity scores (the\nprobability of a unit being exposed given pre-exposure covariates) from binary\nto continuous exposures. As the first step, the package implements efficient\nand flexible estimations of the GPS, allowing multiple user-specified modeling\noptions. As the second step, the package provides two ways to adjust for\nconfounding: weighting and matching, generating weighted and matched data sets,\nrespectively. Lastly, the package provides built-in functions to fit flexible\nparametric, semi-parametric, or non-parametric regression models on the\nweighted or matched data to estimate the exposure-response function relating\nthe outcome with the exposures. The computationally intensive tasks are\nimplemented in C++, and efficient shared-memory parallelization is achieved by\nOpenMP API. This paper outlines the main components of the CausalGPS R package\nand demonstrates its application to assess the effect of long-term exposure to\nPM2.5 on educational attainment using zip code-level data from the contiguous\nUnited States from 2000-2016.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2310.00561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Quantifying the causal effects of continuous exposures on outcomes of interest is critical for social, economic, health, and medical research. However, most existing software packages focus on binary exposures. We develop the CausalGPS R package that implements a collection of algorithms to provide algorithmic solutions for causal inference with continuous exposures. CausalGPS implements a causal inference workflow, with algorithms based on generalized propensity scores (GPS) as the core, extending propensity scores (the probability of a unit being exposed given pre-exposure covariates) from binary to continuous exposures. As the first step, the package implements efficient and flexible estimations of the GPS, allowing multiple user-specified modeling options. As the second step, the package provides two ways to adjust for confounding: weighting and matching, generating weighted and matched data sets, respectively. Lastly, the package provides built-in functions to fit flexible parametric, semi-parametric, or non-parametric regression models on the weighted or matched data to estimate the exposure-response function relating the outcome with the exposures. The computationally intensive tasks are implemented in C++, and efficient shared-memory parallelization is achieved by OpenMP API. This paper outlines the main components of the CausalGPS R package and demonstrates its application to assess the effect of long-term exposure to PM2.5 on educational attainment using zip code-level data from the contiguous United States from 2000-2016.
CausalGPS:一个用于连续曝光因果推理的R包
对于社会、经济、健康和医学研究而言,量化持续暴露对相关结果的因果影响至关重要。然而,大多数现有的软件包都侧重于二进制曝光。我们开发了CausalGPS R包,它实现了一系列算法,为连续曝光的因果推理提供算法解决方案。causalgp简化了一个因果推理工作流,以基于广义倾向分数(GPS)的算法为核心,将倾向分数(给定暴露前协变量的单位暴露的概率)从二元暴露扩展到连续暴露。作为第一步,该包实现了GPS的有效和灵活的估计,允许多个用户指定的建模选项。第二步,该包提供了两种方法来调整混淆:加权和匹配,分别生成加权和匹配的数据集。最后,该软件包提供了内置函数来拟合加权或匹配数据上的灵活参数,半参数或非参数回归模型,以估计与暴露结果相关的暴露-响应函数。计算密集型任务用c++语言实现,并通过openmp API实现高效的共享内存并行化。本文概述了CausalGPS R包的主要组成部分,并利用2000-2016年美国邻近地区的邮政编码级别数据,展示了其在评估长期暴露于toPM2.5对教育成就的影响方面的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信