IF 1.1 4区计算机科学

R Journal Pub Date : 2025-03-01 Epub Date: 2025-08-10

Seongwon Im, Ander Wilson, Daniel Mork

引用次数: 0

glmmPen: High Dimensional Penalized Generalized Linear Mixed Models. glmmPen：高维惩罚性广义线性混合模型。

IF 2.3 4区计算机科学

R Journal Pub Date : 2023-12-01 Epub Date: 2024-04-10 DOI: 10.32614/rj-2023-086

Hillary M Heiling, Naim U Rashid, Quefeng Li, Joseph G Ibrahim

{"title":"glmmPen: High Dimensional Penalized Generalized Linear Mixed Models.","authors":"Hillary M Heiling, Naim U Rashid, Quefeng Li, Joseph G Ibrahim","doi":"10.32614/rj-2023-086","DOIUrl":"10.32614/rj-2023-086","url":null,"abstract":"<p><p>Generalized linear mixed models (GLMMs) are widely used in research for their ability to model correlated outcomes with non-Gaussian conditional distributions. The proper selection of fixed and random effects is a critical part of the modeling process, where model misspecification may lead to significant bias. However, the joint selection of fixed and random effects has historically been limited to lower dimensional GLMMs, largely due to the use of criterion-based model selection strategies. Here we present the R package glmmPen, one of the first to select fixed and random effects in higher dimension using a penalized GLMM modeling framework. Model parameters are estimated using a Monte Carlo expectation conditional minimization (MCECM) algorithm, which leverages Stan and RcppArmadillo for increased computational efficiency. Our package supports the Binomial, Gaussian, and Poisson families and multiple penalty functions. In this manuscript we discuss the modeling procedure, estimation scheme, and software implementation through application to a pancreatic cancer subtyping study. Simulation results show our method has good performance in selecting both the fixed and random effects in high dimensional GLMMs.</p>","PeriodicalId":51285,"journal":{"name":"R Journal","volume":"15 4","pages":"106-128"},"PeriodicalIF":2.3,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11138212/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141181494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

binGroup2: Statistical Tools for Infection Identification via Group Testing. binGroup2：通过分组测试进行感染识别的统计工具。

IF 2.1 4区计算机科学

R Journal Pub Date : 2023-12-01 Epub Date: 2024-04-10 DOI: 10.32614/rj-2023-081

Christopher R Bilder, Brianna D Hitt, Brad J Biggerstaff, Joshua M Tebbs, Christopher S McMahan

引用次数: 0

Three-Way Correspondence Analysis in R R中的三向对应分析

4区计算机科学

R Journal Pub Date : 2023-11-09 DOI: 10.32614/rj-2023-049

Rosaria Lombardo, Michel van de Velden, Eric J. Beh

引用次数: 0

nlstac: Non-Gradient Separable Nonlinear Least Squares Fitting nlstac:非梯度可分非线性最小二乘拟合

4区计算机科学

R Journal Pub Date : 2023-11-08 DOI: 10.32614/rj-2023-040

J. A. F. Torvisco, R. Benítez, M. R. Arias, J. Cabello Sánchez

引用次数: 0

A Workflow for Estimating and Visualising Excess Mortality During the COVID-19 Pandemic COVID-19大流行期间超额死亡率估算和可视化工作流程

4区计算机科学

R Journal Pub Date : 2023-11-08 DOI: 10.32614/rj-2023-055

Garyfallos Konstantinoudis, Virgilio Gómez-Rubio, Michela Cameletti, Monica Pirani, Gianluca Baio, Marta Blangiardo

引用次数: 0

Estimating Heteroskedastic and Instrumental Variable Models for Binary Outcome Variables in R 估计二元结果变量的异方差和工具变量模型

4区计算机科学

R Journal Pub Date : 2023-11-08 DOI: 10.32614/rj-2023-050

Mauricio Sarrias

引用次数: 0

Generalized Estimating Equations using the new R package glmtoolbox 使用新R包glmtoolbox的广义估计方程

4区计算机科学

R Journal Pub Date : 2023-11-01 DOI: 10.32614/rj-2023-056

L.H. Vanegas, L.M. Rondón, G.A. Paula

{"title":"Generalized Estimating Equations using the new R package glmtoolbox","authors":"L.H. Vanegas, L.M. Rondón, G.A. Paula","doi":"10.32614/rj-2023-056","DOIUrl":"https://doi.org/10.32614/rj-2023-056","url":null,"abstract":"This paper introduces a very comprehensive implementation, available in the new `R` package `glmtoolbox`, of a very flexible statistical tool known as Generalized Estimating Equations (GEE), which analyzes cluster correlated data utilizing marginal models. As well as providing more built-in structures for the working correlation matrix than other GEE implementations in `R`, this GEE implementation also allows the user to: $(1)$ compute several estimates of the variance-covariance matrix of the estimators of the parameters of interest; $(2)$ compute several criteria to assist the selection of the structure for the working-correlation matrix; $(3)$ compare nested models using the Wald test as well as the generalized score test; $(4)$ assess the goodness-of-fit of the model using Pearson-, deviance- and Mahalanobis-type residuals; $(5)$ perform sensibility analysis using the global influence approach (that is, dfbeta statistic and Cook's distance) as well as the local influence approach; $(6)$ use several criteria to perform variable selection using a hybrid stepwise procedure; $(7)$ fit models with nonlinear predictors; $(8)$ handle dropout-type missing data under MAR rather than MCAR assumption by using observation-specific or cluster-specific weighted methods. The capabilities of this GEE implementation are illustrated by analyzing four real datasets obtained from longitudinal studies.","PeriodicalId":51285,"journal":{"name":"R Journal","volume":"107 5-6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135714472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Taking the Scenic Route: Interactive and Performant Tour Animations 走风景路线:交互式和高性能的游览动画

4区计算机科学

R Journal Pub Date : 2023-11-01 DOI: 10.32614/rj-2023-052

Casper Hart, Earo Wang

引用次数: 0

Gaussian Mixture Models in R R中的高斯混合模型

4区计算机科学

R Journal Pub Date : 2023-11-01 DOI: 10.32614/rj-2023-043

Bastien Chassagnol, Antoine Bichat, Cheïma Boudjeniba, Pierre-Henri Wuillemin, Mickaël Guedj, David Gohel, Gregory Nuel, Etienne Becht

{"title":"Gaussian Mixture Models in R","authors":"Bastien Chassagnol, Antoine Bichat, Cheïma Boudjeniba, Pierre-Henri Wuillemin, Mickaël Guedj, David Gohel, Gregory Nuel, Etienne Becht","doi":"10.32614/rj-2023-043","DOIUrl":"https://doi.org/10.32614/rj-2023-043","url":null,"abstract":"Gaussian mixture models (GMMs) are widely used for modelling stochastic problems. Indeed, a wide diversity of packages have been developed in R. However, no recent review describing the main features offered by these packages and comparing their performances has been performed. In this article, we first introduce GMMs and the EM algorithm used to retrieve the parameters of the model and analyse the main features implemented among seven of the most widely used R packages. We then empirically compare their statistical and computational performances in relation with the choice of the initialisation algorithm and the complexity of the mixture. We demonstrate that the best estimation with well-separated components or with a small number of components with distinguishable modes is obtained with REBMIX initialisation, implemented in the [rebmix](https://CRAN.R-project.org/package=rebmix) package, while the best estimation with highly overlapping components is obtained with *k*-means or random initialisation. Importantly, we show that implementation details in the EM algorithm yield differences in the parameters' estimation. Especially, packages [mixtools](https://CRAN.R-project.org/package=mixtools) (Young et al. 2020) and [Rmixmod](https://CRAN.R-project.org/package=Rmixmod) (Langrognet et al. 2021) estimate the parameters of the mixture with smaller bias, while the RMSE and variability of the estimates is smaller with packages [bgmm](https://CRAN.R-project.org/package=bgmm) (Ewa Szczurek 2021) , [EMCluster](https://CRAN.R-project.org/package=EMCluster) (W.-C. Chen and Maitra 2022) , [GMKMcharlie](https://CRAN.R-project.org/package=GMKMcharlie) (Liu 2021), [flexmix](https://CRAN.R-project.org/package=flexmix) (Gruen and Leisch 2022) and [mclust](https://CRAN.R-project.org/package=mclust) (Fraley, Raftery, and Scrucca 2022). The comparison of these packages provides R users with useful recommendations for improving the computational and statistical performance of their clustering and for identifying common deficiencies. Additionally, we propose several improvements in the development of a future, unified mixture model package.","PeriodicalId":51285,"journal":{"name":"R Journal","volume":"102 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135714326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

R Journal最新文献