mexhaz: An R Package for Fitting Flexible Hazard-Based Regression Models for Overall and Excess Mortality with a Random Effect

IF 8.1 2区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Statistical Software Pub Date : 2021-06-01 DOI:10.18637/jss.v098.i14

H. Charvat, A. Belot

{"title":"mexhaz: An R Package for Fitting Flexible Hazard-Based Regression Models for Overall and Excess Mortality with a Random Effect","authors":"H. Charvat, A. Belot","doi":"10.18637/jss.v098.i14","DOIUrl":null,"url":null,"abstract":"We present mexhaz, an R package for fitting flexible hazard-based regression models with the possibility to add time-dependent effects of covariates and to account for a two level hierarchical structure in the data through the inclusion of a normally distributed random intercept (i.e., a log-normally distributed shared frailty). Moreover, mexhaz based models can be fitted within the excess hazard setting by allowing the specification of an expected hazard in the model. These models are of common use in the context of the analysis of population-based cancer registry data. Follow-up time can be entered in the right-censored or counting process input style, the latter allowing models with delayed entries. The logarithm of the baseline hazard can be flexibly modeled with B-splines or restricted cubic splines of time. Parameters estimation is based on likelihood maximization: in deriving the contribution of each observation to the cluster-specific conditional likelihood, Gauss-Legendre quadrature is used to calculate the cumulative hazard; the cluster-specific marginal likelihoods are then obtained by integrating over the random effects distribution, using adaptive Gauss-Hermite quadrature. Functions to compute and plot the predicted (excess) hazard and (net) survival (possibly with cluster-specific predictions in the case of random effect models) are provided. We illustrate the use of the different options of the mexhaz package and compare the results obtained with those of other available R packages.","PeriodicalId":17237,"journal":{"name":"Journal of Statistical Software","volume":"3 1","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Software","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.18637/jss.v098.i14","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 10

Abstract

We present mexhaz, an R package for fitting flexible hazard-based regression models with the possibility to add time-dependent effects of covariates and to account for a two level hierarchical structure in the data through the inclusion of a normally distributed random intercept (i.e., a log-normally distributed shared frailty). Moreover, mexhaz based models can be fitted within the excess hazard setting by allowing the specification of an expected hazard in the model. These models are of common use in the context of the analysis of population-based cancer registry data. Follow-up time can be entered in the right-censored or counting process input style, the latter allowing models with delayed entries. The logarithm of the baseline hazard can be flexibly modeled with B-splines or restricted cubic splines of time. Parameters estimation is based on likelihood maximization: in deriving the contribution of each observation to the cluster-specific conditional likelihood, Gauss-Legendre quadrature is used to calculate the cumulative hazard; the cluster-specific marginal likelihoods are then obtained by integrating over the random effects distribution, using adaptive Gauss-Hermite quadrature. Functions to compute and plot the predicted (excess) hazard and (net) survival (possibly with cluster-specific predictions in the case of random effect models) are provided. We illustrate the use of the different options of the mexhaz package and compare the results obtained with those of other available R packages.

查看原文本刊更多论文

具有随机效应的总体死亡率和超额死亡率的灵活的基于风险的回归模型拟合

我们提出了mexhaz，这是一个R包，用于拟合灵活的基于风险的回归模型，可以添加协变量的时间依赖效应，并通过包含正态分布的随机截距(即对数正态分布的共享脆弱性)来解释数据中的两级层次结构。此外，通过允许在模型中指定预期危险，基于mexhaz的模型可以在过量危险设置内进行拟合。这些模型在分析基于人群的癌症登记数据的背景下是常用的。后续时间可以以右审查或计数过程输入样式输入，后者允许具有延迟输入的模型。基线危害的对数可以灵活地用b样条或时间的受限三次样条进行建模。参数估计基于似然最大化:在推导每个观测值对集群特定条件似然的贡献时，使用高斯-勒让德正交来计算累积风险;然后利用自适应高斯-埃尔米特正交法对随机效应分布进行积分，得到集群特有的边际似然。提供了计算和绘制预测(超额)风险和(净)生存(在随机效应模型的情况下可能具有特定于集群的预测)的函数。我们举例说明了mexhaz包的不同选项的使用，并将获得的结果与其他可用的R包的结果进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Statistical Software 工程技术-计算机：跨学科应用

CiteScore

10.70

自引率

1.70%

发文量

审稿时长

6-12 weeks

期刊介绍： The Journal of Statistical Software (JSS) publishes open-source software and corresponding reproducible articles discussing all aspects of the design, implementation, documentation, application, evaluation, comparison, maintainance and distribution of software dedicated to improvement of state-of-the-art in statistical computing in all areas of empirical research. Open-source code and articles are jointly reviewed and published in this journal and should be accessible to a broad community of practitioners, teachers, and researchers in the field of statistics.