A Correlation Technique to Reduce the Number of Predictors to Estimate the Survival Time of HIV/ AIDS Patients on ART

V. Ravi, G. Grover, R. Das, M. Varshney, Anurag Sharma
{"title":"A Correlation Technique to Reduce the Number of Predictors to Estimate the Survival Time of HIV/ AIDS Patients on ART","authors":"V. Ravi, G. Grover, R. Das, M. Varshney, Anurag Sharma","doi":"10.6000/1929-6029.2018.07.04.3","DOIUrl":null,"url":null,"abstract":"Till now, many research papers have been published which aims to estimate the survivle time of the HIV/AIDS patients taking into consideration all the predictors viz, Age, Sex, CD4, MOT, Smoking, Weight, HB, Coinfection, Time, BMI, Location Status, Marital Status, Drug etc, although all the predictors need not to be included in the model. Since some of the predictors may be correlated/ associated and may have some influence on the outcome variable, therefore, instead of taking both the significantly correlated/ associated predictors, we may take only one of the two. In this way, we may be able to reduce the number of predictors without affecting the estimated survival time. In this paper we have tried to reduce the number of predictors by determining the highly positively correlated predictors and then evaluating the effect of correlation/ association on the survival time of HIV/AIDS patients. These predictors that we have considered in the starting are Age, Sex, State, Smoking, Alcohol, Drugs, Opportunistic Infections (OI), Living Status (LS), Occupation (OC), Marital Status (MS) and Spouse for the data collected from 2004 to 2014 of AIDS patients in an ART center of Delhi, India. We have performed one – way ANOVA to test the association between a quantitative and a categorical variable and Chi-square test to test between two categorical variables. To select one of the two highly correlated/ associated predictors, a suitable model is fitted keeping one predictor independent at a time and other dependent and the model having the smaller AIC is considered and the independent variable in the model is included in the modified model. The fitted models are logistic, linear and multinomial logistic depending on the type of the independent variable to be fitted. Then the true model (having all the predictors) and the modified model (with reduced number of predictors) are compared on the basis of their AICs and the model having minimum AIC is chosen. In this way we could reduce the number of predictors by almost 50% without affecting the estimated survival time with a reduced standard error.","PeriodicalId":73480,"journal":{"name":"International journal of statistics in medical research","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of statistics in medical research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6000/1929-6029.2018.07.04.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Till now, many research papers have been published which aims to estimate the survivle time of the HIV/AIDS patients taking into consideration all the predictors viz, Age, Sex, CD4, MOT, Smoking, Weight, HB, Coinfection, Time, BMI, Location Status, Marital Status, Drug etc, although all the predictors need not to be included in the model. Since some of the predictors may be correlated/ associated and may have some influence on the outcome variable, therefore, instead of taking both the significantly correlated/ associated predictors, we may take only one of the two. In this way, we may be able to reduce the number of predictors without affecting the estimated survival time. In this paper we have tried to reduce the number of predictors by determining the highly positively correlated predictors and then evaluating the effect of correlation/ association on the survival time of HIV/AIDS patients. These predictors that we have considered in the starting are Age, Sex, State, Smoking, Alcohol, Drugs, Opportunistic Infections (OI), Living Status (LS), Occupation (OC), Marital Status (MS) and Spouse for the data collected from 2004 to 2014 of AIDS patients in an ART center of Delhi, India. We have performed one – way ANOVA to test the association between a quantitative and a categorical variable and Chi-square test to test between two categorical variables. To select one of the two highly correlated/ associated predictors, a suitable model is fitted keeping one predictor independent at a time and other dependent and the model having the smaller AIC is considered and the independent variable in the model is included in the modified model. The fitted models are logistic, linear and multinomial logistic depending on the type of the independent variable to be fitted. Then the true model (having all the predictors) and the modified model (with reduced number of predictors) are compared on the basis of their AICs and the model having minimum AIC is chosen. In this way we could reduce the number of predictors by almost 50% without affecting the estimated survival time with a reduced standard error.
减少预测因子数量的相关技术估算抗逆转录病毒/艾滋病患者的生存时间
到目前为止,已经发表了许多研究论文,旨在估计艾滋病毒/艾滋病患者的生存时间,考虑到所有的预测因素,即年龄,性别,CD4, MOT,吸烟,体重,HB,合并感染,时间,BMI,地理位置状况,婚姻状况,药物等,尽管所有的预测因素都不需要包括在模型中。由于一些预测因素可能是相关的,可能对结果变量有一定的影响,因此,我们可以只取其中一个,而不是同时取两个显著相关/相关的预测因素。通过这种方式,我们可以在不影响估计生存时间的情况下减少预测因子的数量。在本文中,我们试图通过确定高度正相关的预测因子来减少预测因子的数量,然后评估相关/关联对HIV/AIDS患者生存时间的影响。我们在开始时考虑的预测因子是年龄、性别、状态、吸烟、酒精、药物、机会性感染(OI)、生活状况(LS)、职业(OC)、婚姻状况(MS)和配偶,用于收集印度德里某ART中心2004 - 2014年艾滋病患者的数据。我们用单因素方差分析来检验一个定量变量和一个分类变量之间的相关性,用卡方检验来检验两个分类变量之间的相关性。为了从两个高度相关/相关的预测因子中选择一个,拟合合适的模型,每次保持一个预测因子独立,其他预测因子依赖,并考虑具有较小AIC的模型,并将模型中的自变量包含在修改的模型中。根据拟合自变量的类型,拟合的模型有逻辑、线性和多项逻辑。然后将真实模型(包含所有预测因子)和修正模型(减少预测因子数量)的AIC进行比较,选择AIC最小的模型。通过这种方式,我们可以将预测因子的数量减少近50%,而不会影响标准误差降低的估计生存时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.40
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信