在多重响应回归中检测热点的全局-局部方法

IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY
Annals of Applied Statistics Pub Date : 2020-06-01 Epub Date: 2020-06-29 DOI:10.1214/20-AOAS1332
Hélène Ruffieux, Anthony C Davison, Jörg Hager, Jamie Inshaw, Benjamin P Fairfax, Sylvia Richardson, Leonardo Bottolo
{"title":"在多重响应回归中检测热点的全局-局部方法","authors":"Hélène Ruffieux, Anthony C Davison, Jörg Hager, Jamie Inshaw, Benjamin P Fairfax, Sylvia Richardson, Leonardo Bottolo","doi":"10.1214/20-AOAS1332","DOIUrl":null,"url":null,"abstract":"<p><p>We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting <i>hotspots</i>, that is, predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional mechanisms underlying disease endpoints. Existing hierarchical regression approaches designed to model hotspots suffer from two limitations: their discrimination of hotspots is sensitive to the choice of top-level scale parameters for the propensity of predictors to be hotspots, and they do not scale to large predictor and response vectors, for example, of dimensions 10<sup>3</sup>-10<sup>5</sup> in genetic applications. We address these shortcomings by introducing a flexible hierarchical regression framework that is tailored to the detection of hotspots and scalable to the above dimensions. Our proposal implements a fully Bayesian model for hotspots based on the horseshoe shrinkage prior. Its global-local formulation shrinks noise globally and, hence, accommodates the highly sparse nature of genetic analyses while being robust to individual signals, thus leaving the effects of hotspots unshrunk. Inference is carried out using a fast variational algorithm coupled with a novel simulated annealing procedure that allows efficient exploration of multimodal distributions.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"14 2","pages":"905-928"},"PeriodicalIF":1.3000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612176/pdf/EMS140637.pdf","citationCount":"0","resultStr":"{\"title\":\"A Global-Local Approach for Detecting Hotspots in Multiple-Response Regression.\",\"authors\":\"Hélène Ruffieux, Anthony C Davison, Jörg Hager, Jamie Inshaw, Benjamin P Fairfax, Sylvia Richardson, Leonardo Bottolo\",\"doi\":\"10.1214/20-AOAS1332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting <i>hotspots</i>, that is, predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional mechanisms underlying disease endpoints. Existing hierarchical regression approaches designed to model hotspots suffer from two limitations: their discrimination of hotspots is sensitive to the choice of top-level scale parameters for the propensity of predictors to be hotspots, and they do not scale to large predictor and response vectors, for example, of dimensions 10<sup>3</sup>-10<sup>5</sup> in genetic applications. We address these shortcomings by introducing a flexible hierarchical regression framework that is tailored to the detection of hotspots and scalable to the above dimensions. Our proposal implements a fully Bayesian model for hotspots based on the horseshoe shrinkage prior. Its global-local formulation shrinks noise globally and, hence, accommodates the highly sparse nature of genetic analyses while being robust to individual signals, thus leaving the effects of hotspots unshrunk. Inference is carried out using a fast variational algorithm coupled with a novel simulated annealing procedure that allows efficient exploration of multimodal distributions.</p>\",\"PeriodicalId\":50772,\"journal\":{\"name\":\"Annals of Applied Statistics\",\"volume\":\"14 2\",\"pages\":\"905-928\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612176/pdf/EMS140637.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Applied Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1214/20-AOAS1332\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/6/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Applied Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1214/20-AOAS1332","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/6/29 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

我们要解决的是在有许多预测因子和许多反应的回归问题中进行变量选择的建模和推理。我们的重点是检测热点,即与多个反应相关的预测因子。这种任务在统计遗传学中至关重要,因为热点遗传变异通过控制许多基因的表达来塑造基因组的结构,并可能启动疾病终点的决定性功能机制。现有的为建立热点模型而设计的分层回归方法有两个局限性:它们对热点的判别对选择预测因子成为热点的倾向的顶层尺度参数很敏感;它们不能扩展到大型预测因子和响应向量,例如遗传学应用中的 103-105 维向量。为了解决这些缺陷,我们引入了一个灵活的分层回归框架,该框架专为热点检测量身定制,并可扩展到上述维度。我们的建议是在马蹄形收缩先验的基础上实现一个完全贝叶斯的热点模型。它的全局-局部公式在全局范围内缩小了噪声,因此可以适应遗传分析的高度稀疏性,同时对单个信号具有鲁棒性,从而使热点的影响不被缩小。推理采用快速变异算法和新颖的模拟退火程序,可有效探索多模态分布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Global-Local Approach for Detecting Hotspots in Multiple-Response Regression.

We tackle modelling and inference for variable selection in regression problems with many predictors and many responses. We focus on detecting hotspots, that is, predictors associated with several responses. Such a task is critical in statistical genetics, as hotspot genetic variants shape the architecture of the genome by controlling the expression of many genes and may initiate decisive functional mechanisms underlying disease endpoints. Existing hierarchical regression approaches designed to model hotspots suffer from two limitations: their discrimination of hotspots is sensitive to the choice of top-level scale parameters for the propensity of predictors to be hotspots, and they do not scale to large predictor and response vectors, for example, of dimensions 103-105 in genetic applications. We address these shortcomings by introducing a flexible hierarchical regression framework that is tailored to the detection of hotspots and scalable to the above dimensions. Our proposal implements a fully Bayesian model for hotspots based on the horseshoe shrinkage prior. Its global-local formulation shrinks noise globally and, hence, accommodates the highly sparse nature of genetic analyses while being robust to individual signals, thus leaving the effects of hotspots unshrunk. Inference is carried out using a fast variational algorithm coupled with a novel simulated annealing procedure that allows efficient exploration of multimodal distributions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Annals of Applied Statistics
Annals of Applied Statistics 社会科学-统计学与概率论
CiteScore
3.10
自引率
5.60%
发文量
131
审稿时长
6-12 weeks
期刊介绍: Statistical research spans an enormous range from direct subject-matter collaborations to pure mathematical theory. The Annals of Applied Statistics, the newest journal from the IMS, is aimed at papers in the applied half of this range. Published quarterly in both print and electronic form, our goal is to provide a timely and unified forum for all areas of applied statistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信