Gradient boosting for linear mixed models.

IF 1.2 4区数学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

International Journal of Biostatistics Pub Date : 2021-01-13 DOI:10.1515/ijb-2020-0136

Colin Griesbach, Benjamin Säfken, Elisabeth Waldmann

引用次数: 8

Abstract

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current boosting approaches also offer methods accounting for random effects and thus enable prediction of mixed models for longitudinal and clustered data. However, these approaches include several flaws resulting in unbalanced effect selection with falsely induced shrinkage and a low convergence rate on the one hand and biased estimates of the random effects on the other hand. We therefore propose a new boosting algorithm which explicitly accounts for the random structure by excluding it from the selection procedure, properly correcting the random effects estimates and in addition providing likelihood-based estimation of the random effects variance structure. The new algorithm offers an organic and unbiased fitting approach, which is shown via simulations and data examples.

查看原文本刊更多论文

线性混合模型的梯度增强。

来自统计学习领域的梯度增强被广泛认为是一个强大的框架，通过适应分类理论的概念，在各种回归模型中估计和选择预测器效应。目前的增强方法还提供了考虑随机效应的方法，从而能够预测纵向和聚类数据的混合模型。然而，这些方法存在一些缺陷，一方面导致不平衡的效应选择，错误地诱导收缩和低收敛率，另一方面导致对随机效应的估计有偏差。因此，我们提出了一种新的增强算法，该算法通过将随机结构排除在选择过程之外，适当地纠正随机效应估计，并提供基于似然的随机效应方差结构估计，从而明确地解释了随机结构。该算法提供了一种有机的、无偏的拟合方法，并通过仿真和数据实例进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Biostatistics MATHEMATICAL & COMPUTATIONAL BIOLOGY-STATISTICS & PROBABILITY

CiteScore

2.10

自引率

8.30%

发文量

审稿时长

>12 weeks

期刊介绍： The International Journal of Biostatistics (IJB) seeks to publish new biostatistical models and methods, new statistical theory, as well as original applications of statistical methods, for important practical problems arising from the biological, medical, public health, and agricultural sciences with an emphasis on semiparametric methods. Given many alternatives to publish exist within biostatistics, IJB offers a place to publish for research in biostatistics focusing on modern methods, often based on machine-learning and other data-adaptive methodologies, as well as providing a unique reading experience that compels the author to be explicit about the statistical inference problem addressed by the paper. IJB is intended that the journal cover the entire range of biostatistics, from theoretical advances to relevant and sensible translations of a practical problem into a statistical framework. Electronic publication also allows for data and software code to be appended, and opens the door for reproducible research allowing readers to easily replicate analyses described in a paper. Both original research and review articles will be warmly received, as will articles applying sound statistical methods to practical problems.