Penalized Empirical Likelihood-Based Variable Selection for Longitudinal Data Analysis

Q3 Business, Management and Accounting
Tharshanna Nadarajah, A. Variyath, J. Loredo-Osti
{"title":"Penalized Empirical Likelihood-Based Variable Selection for Longitudinal Data Analysis","authors":"Tharshanna Nadarajah, A. Variyath, J. Loredo-Osti","doi":"10.1080/01966324.2020.1837042","DOIUrl":null,"url":null,"abstract":"Abstract Longitudinal data with a large number of covariates have become common in many applications such as epidemiology, clinical research, and therapeutic evaluation. The identification of a sub-model that adequately represents the data are necessary for easy interpretation. Existing information theoretic-approaches such as AIC and BIC are useful, but computationally not efficient due to an evaluation of all possible subsets. A new class of penalized likelihood methods such as LASSO, SCAD, etc. are efficient in these situations. All these methods rely on the parametric modeling of the response of interest. The joint likelihood function for longitudinal data is challenging, particularly for correlated discrete outcome data. In such a situation, we propose penalized empirical likelihood (PEL) based on generalized estimating equations (GEE) by which the variable selection and the estimation of the coefficients are carried out simultaneously. We discuss its characteristics and asymptotic properties and present an efficient computational algorithm for optimizing PEL. Simulation studies show that when model assumptions are true, its performance is comparable to that of the existing methods and when the model is misspecified, our method has clear advantages over the existing methods. We have applied the method to two case examples.","PeriodicalId":35850,"journal":{"name":"American Journal of Mathematical and Management Sciences","volume":"40 1","pages":"241 - 260"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/01966324.2020.1837042","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Mathematical and Management Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/01966324.2020.1837042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Business, Management and Accounting","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract Longitudinal data with a large number of covariates have become common in many applications such as epidemiology, clinical research, and therapeutic evaluation. The identification of a sub-model that adequately represents the data are necessary for easy interpretation. Existing information theoretic-approaches such as AIC and BIC are useful, but computationally not efficient due to an evaluation of all possible subsets. A new class of penalized likelihood methods such as LASSO, SCAD, etc. are efficient in these situations. All these methods rely on the parametric modeling of the response of interest. The joint likelihood function for longitudinal data is challenging, particularly for correlated discrete outcome data. In such a situation, we propose penalized empirical likelihood (PEL) based on generalized estimating equations (GEE) by which the variable selection and the estimation of the coefficients are carried out simultaneously. We discuss its characteristics and asymptotic properties and present an efficient computational algorithm for optimizing PEL. Simulation studies show that when model assumptions are true, its performance is comparable to that of the existing methods and when the model is misspecified, our method has clear advantages over the existing methods. We have applied the method to two case examples.
基于惩罚经验似然的纵向数据分析变量选择
摘要具有大量协变量的纵向数据在流行病学、临床研究和治疗评估等许多应用中已经很常见。为了便于解释,有必要确定一个充分代表数据的子模型。现有的信息论方法,如AIC和BIC是有用的,但由于评估了所有可能的子集,计算效率不高。一类新的惩罚似然方法,如LASSO、SCAD等,在这种情况下是有效的。所有这些方法都依赖于感兴趣的响应的参数化建模。纵向数据的联合似然函数具有挑战性,尤其是对于相关离散结果数据。在这种情况下,我们提出了基于广义估计方程(GEE)的惩罚经验似然(PEL),通过广义估计方程同时进行变量选择和系数估计。我们讨论了它的特性和渐近性质,并提出了一种有效的PEL优化计算算法。仿真研究表明,当模型假设成立时,其性能与现有方法相当,当模型指定错误时,我们的方法比现有方法具有明显的优势。我们已将该方法应用于两个实例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
American Journal of Mathematical and Management Sciences
American Journal of Mathematical and Management Sciences Business, Management and Accounting-Business, Management and Accounting (all)
CiteScore
2.70
自引率
0.00%
发文量
5
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信