Modelling subject-specific childhood growth using linear mixed-effect models with cubic regression splines.

IF 4.1 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Emerging Themes in Epidemiology Pub Date : 2016-01-07 eCollection Date: 2016-01-01 DOI:10.1186/s12982-015-0038-3
Laura M Grajeda, Andrada Ivanescu, Mayuko Saito, Ciprian Crainiceanu, Devan Jaganath, Robert H Gilman, Jean E Crabtree, Dermott Kelleher, Lilia Cabrera, Vitaliano Cama, William Checkley
{"title":"Modelling subject-specific childhood growth using linear mixed-effect models with cubic regression splines.","authors":"Laura M Grajeda, Andrada Ivanescu, Mayuko Saito, Ciprian Crainiceanu, Devan Jaganath, Robert H Gilman, Jean E Crabtree, Dermott Kelleher, Lilia Cabrera, Vitaliano Cama, William Checkley","doi":"10.1186/s12982-015-0038-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Childhood growth is a cornerstone of pediatric research. Statistical models need to consider individual trajectories to adequately describe growth outcomes. Specifically, well-defined longitudinal models are essential to characterize both population and subject-specific growth. Linear mixed-effect models with cubic regression splines can account for the nonlinearity of growth curves and provide reasonable estimators of population and subject-specific growth, velocity and acceleration.</p><p><strong>Methods: </strong>We provide a stepwise approach that builds from simple to complex models, and account for the intrinsic complexity of the data. We start with standard cubic splines regression models and build up to a model that includes subject-specific random intercepts and slopes and residual autocorrelation. We then compared cubic regression splines vis-à-vis linear piecewise splines, and with varying number of knots and positions. Statistical code is provided to ensure reproducibility and improve dissemination of methods. Models are applied to longitudinal height measurements in a cohort of 215 Peruvian children followed from birth until their fourth year of life.</p><p><strong>Results: </strong>Unexplained variability, as measured by the variance of the regression model, was reduced from 7.34 when using ordinary least squares to 0.81 (p < 0.001) when using a linear mixed-effect models with random slopes and a first order continuous autoregressive error term. There was substantial heterogeneity in both the intercept (p < 0.001) and slopes (p < 0.001) of the individual growth trajectories. We also identified important serial correlation within the structure of the data (ρ = 0.66; 95 % CI 0.64 to 0.68; p < 0.001), which we modeled with a first order continuous autoregressive error term as evidenced by the variogram of the residuals and by a lack of association among residuals. The final model provides a parametric linear regression equation for both estimation and prediction of population- and individual-level growth in height. We show that cubic regression splines are superior to linear regression splines for the case of a small number of knots in both estimation and prediction with the full linear mixed effect model (AIC 19,352 vs. 19,598, respectively). While the regression parameters are more complex to interpret in the former, we argue that inference for any problem depends more on the estimated curve or differences in curves rather than the coefficients. Moreover, use of cubic regression splines provides biological meaningful growth velocity and acceleration curves despite increased complexity in coefficient interpretation.</p><p><strong>Conclusions: </strong>Through this stepwise approach, we provide a set of tools to model longitudinal childhood data for non-statisticians using linear mixed-effect models.</p>","PeriodicalId":39896,"journal":{"name":"Emerging Themes in Epidemiology","volume":"13 1","pages":"1"},"PeriodicalIF":4.1000,"publicationDate":"2016-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4705630/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Themes in Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12982-015-0038-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2016/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Childhood growth is a cornerstone of pediatric research. Statistical models need to consider individual trajectories to adequately describe growth outcomes. Specifically, well-defined longitudinal models are essential to characterize both population and subject-specific growth. Linear mixed-effect models with cubic regression splines can account for the nonlinearity of growth curves and provide reasonable estimators of population and subject-specific growth, velocity and acceleration.

Methods: We provide a stepwise approach that builds from simple to complex models, and account for the intrinsic complexity of the data. We start with standard cubic splines regression models and build up to a model that includes subject-specific random intercepts and slopes and residual autocorrelation. We then compared cubic regression splines vis-à-vis linear piecewise splines, and with varying number of knots and positions. Statistical code is provided to ensure reproducibility and improve dissemination of methods. Models are applied to longitudinal height measurements in a cohort of 215 Peruvian children followed from birth until their fourth year of life.

Results: Unexplained variability, as measured by the variance of the regression model, was reduced from 7.34 when using ordinary least squares to 0.81 (p < 0.001) when using a linear mixed-effect models with random slopes and a first order continuous autoregressive error term. There was substantial heterogeneity in both the intercept (p < 0.001) and slopes (p < 0.001) of the individual growth trajectories. We also identified important serial correlation within the structure of the data (ρ = 0.66; 95 % CI 0.64 to 0.68; p < 0.001), which we modeled with a first order continuous autoregressive error term as evidenced by the variogram of the residuals and by a lack of association among residuals. The final model provides a parametric linear regression equation for both estimation and prediction of population- and individual-level growth in height. We show that cubic regression splines are superior to linear regression splines for the case of a small number of knots in both estimation and prediction with the full linear mixed effect model (AIC 19,352 vs. 19,598, respectively). While the regression parameters are more complex to interpret in the former, we argue that inference for any problem depends more on the estimated curve or differences in curves rather than the coefficients. Moreover, use of cubic regression splines provides biological meaningful growth velocity and acceleration curves despite increased complexity in coefficient interpretation.

Conclusions: Through this stepwise approach, we provide a set of tools to model longitudinal childhood data for non-statisticians using linear mixed-effect models.

Abstract Image

Abstract Image

Abstract Image

利用三次回归样条线性混合效应模型对特定受试者的儿童成长进行建模。
背景:儿童成长是儿科研究的基石。统计模型需要考虑个体轨迹,以充分描述增长结果。具体来说,定义良好的纵向模型对于描述人口和特定学科的增长都是必不可少的。具有三次回归样条的线性混合效应模型可以解释生长曲线的非线性,并为群体和特定学科的生长、速度和加速度提供合理的估计。方法:我们提供了一个循序渐进的方法,从简单到复杂的模型,并考虑到数据的内在复杂性。我们从标准三次样条回归模型开始,并建立一个包括特定主题的随机截距和斜率以及残差自相关的模型。然后,我们比较了三次回归样条与-à-vis线性分段样条,以及不同数量的结和位置。提供了统计代码,以确保再现性和改进方法的传播。模型被应用于对215名秘鲁儿童的纵向身高测量,这些儿童从出生到四岁。结果:通过回归模型的方差测量,无法解释的可变性从使用普通最小二乘法时的7.34降低到0.81 (p)。结论:通过这种逐步方法,我们为使用线性混合效应模型的非统计学家提供了一套工具来模拟纵向儿童数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Emerging Themes in Epidemiology
Emerging Themes in Epidemiology Medicine-Epidemiology
CiteScore
4.40
自引率
4.30%
发文量
9
审稿时长
28 weeks
期刊介绍: Emerging Themes in Epidemiology is an open access, peer-reviewed, online journal that aims to promote debate and discussion on practical and theoretical aspects of epidemiology. Combining statistical approaches with an understanding of the biology of disease, epidemiologists seek to elucidate the social, environmental and host factors related to adverse health outcomes. Although research findings from epidemiologic studies abound in traditional public health journals, little publication space is devoted to discussion of the practical and theoretical concepts that underpin them. Because of its immediate impact on public health, an openly accessible forum is needed in the field of epidemiology to foster such discussion.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信