Development of network-guided transcriptomic risk score for disease prediction

IF 0.7 4区 数学 Q3 STATISTICS & PROBABILITY
Stat Pub Date : 2024-01-16 DOI:10.1002/sta4.648
Xuan Cao, Liangliang Zhang, Kyoungjae Lee
{"title":"Development of network-guided transcriptomic risk score for disease prediction","authors":"Xuan Cao, Liangliang Zhang, Kyoungjae Lee","doi":"10.1002/sta4.648","DOIUrl":null,"url":null,"abstract":"Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.","PeriodicalId":56159,"journal":{"name":"Stat","volume":"37 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stat","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.648","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.
开发用于疾病预测的网络引导转录组风险评分
在各种临床环境中常规收集的 Omics 数据具有复杂的网络结构性质。最近在 RNA 测序(RNA-seq)方面取得的进展使我们能够探索全基因组基因表达谱,并建立疾病风险预测模型。在本研究中,我们提出了一种新颖的贝叶斯方法,利用基因表达网络构建基于 RNA-seq 的风险评分,用于疾病风险预测。具体来说,我们考虑了一个分层模型,该模型对回归系数以及协变量的逆协方差矩阵中的条目具有尖峰和板块前验,可在高维逻辑回归中同时执行变量选择和网络估计。通过理论研究和模拟研究,我们的方法不仅具有理想的一致性,而且与其他最先进的方法相比具有更优越的经验性能。我们分析了 441 个哮喘样本和 254 个非哮喘样本的 RNA-seq 基因表达数据,形成了加权网络指导风险评分,并将所提出的方法与现有的哮喘风险分层方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Stat
Stat Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.10
自引率
0.00%
发文量
85
期刊介绍: Stat is an innovative electronic journal for the rapid publication of novel and topical research results, publishing compact articles of the highest quality in all areas of statistical endeavour. Its purpose is to provide a means of rapid sharing of important new theoretical, methodological and applied research. Stat is a joint venture between the International Statistical Institute and Wiley-Blackwell. Stat is characterised by: • Speed - a high-quality review process that aims to reach a decision within 20 days of submission. • Concision - a maximum article length of 10 pages of text, not including references. • Supporting materials - inclusion of electronic supporting materials including graphs, video, software, data and images. • Scope - addresses all areas of statistics and interdisciplinary areas. Stat is a scientific journal for the international community of statisticians and researchers and practitioners in allied quantitative disciplines.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信