Additive Risk Analysis of microarray Gene Expression Data via Correlation Principal Component Regression

Yichuan Zhao, Guoshen Wang
{"title":"Additive Risk Analysis of microarray Gene Expression Data via Correlation Principal Component Regression","authors":"Yichuan Zhao, Guoshen Wang","doi":"10.1142/S0219720010004914","DOIUrl":null,"url":null,"abstract":"In order to predict future patients' survival time based on their microarray gene expression data, one interesting question is how to relate genes to survival outcomes. In this paper, by applying a semi-parametric additive risk model in survival analysis, we propose a new approach to conduct a careful analysis of gene expression data with the focus on the model's predictive ability. In the proposed method, we apply the correlation principal component regression to deal with right censoring survival data under the semi-parametric additive risk model frame with high-dimensional covariates. We also employ the time-dependent area under the receiver operating characteristic curve and root mean squared error for prediction to assess how well the model can predict the survival time. Furthermore, the proposed method is able to identify significant genes, which are significantly related to the disease. Finally, the proposed useful approach is illustrated by the diffuse large B-cell lymphoma data set and breast cancer data set. The results show that the model fits the data sets very well.","PeriodicalId":90783,"journal":{"name":"American journal of bioinformatics and computational biology","volume":"82 3 1","pages":"645-659"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of bioinformatics and computational biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S0219720010004914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

In order to predict future patients' survival time based on their microarray gene expression data, one interesting question is how to relate genes to survival outcomes. In this paper, by applying a semi-parametric additive risk model in survival analysis, we propose a new approach to conduct a careful analysis of gene expression data with the focus on the model's predictive ability. In the proposed method, we apply the correlation principal component regression to deal with right censoring survival data under the semi-parametric additive risk model frame with high-dimensional covariates. We also employ the time-dependent area under the receiver operating characteristic curve and root mean squared error for prediction to assess how well the model can predict the survival time. Furthermore, the proposed method is able to identify significant genes, which are significantly related to the disease. Finally, the proposed useful approach is illustrated by the diffuse large B-cell lymphoma data set and breast cancer data set. The results show that the model fits the data sets very well.
基于相关主成分回归的微阵列基因表达数据加性风险分析
为了根据微阵列基因表达数据预测未来患者的生存时间,一个有趣的问题是如何将基因与生存结果联系起来。在本文中,通过将半参数加性风险模型应用于生存分析,我们提出了一种新的方法来对基因表达数据进行仔细分析,重点关注模型的预测能力。该方法在具有高维协变量的半参数可加性风险模型框架下,应用相关主成分回归处理右删减生存数据。我们还利用受试者工作特征曲线下的时间依赖面积和均方根误差进行预测,以评估模型预测生存时间的效果。此外,该方法能够识别与疾病显著相关的重要基因。最后,弥漫性大b细胞淋巴瘤数据集和乳腺癌数据集说明了所提出的有用方法。结果表明,该模型能很好地拟合数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信