关联研究和基因组预测的新统计方法

Charles‐Elie Rabier, Céline Delmas
{"title":"关联研究和基因组预测的新统计方法","authors":"Charles‐Elie Rabier, Céline Delmas","doi":"10.11159/icsta22.135","DOIUrl":null,"url":null,"abstract":"Extended Abstract \"Selective genotyping\" is a very famous concept in genetics. It was introduced by Lebowitz et al. (1987) and was studied more in details by Lander and Botstein (1989). It consists in genotyping (collecting DNA information at specific positions) only the individuals with extreme phenotypes. Indeed, Lebowitz et al. (1987) noticed that the highest or the lowest observations contain most of the signal on Quantitative Trait Loci (QTL), i.e. genes with quantitative effect on a trait. Today, although the genotyping costs have drastically dropped, selective genotyping is still heavily used (e.g. [1]) since we can optimize the statistical experiment by focusing on extreme individuals instead of \"random\" individuals. Although \"selective genotyping\" was introduced in the eighties, biologists are still missing tools to analyze properly data sampled from this experimental design. Indeed, classical methods such as penalized regression (e.g. Lasso [2]) are not dedicated to extreme observations. As a consequence, we introduced recently the SgenoLasso [3], a new L1 penalized regression that models explicitly the extremes. SgenoLasso on the ``Interval Mapping\" a famous concept in genetics that consists in scanning the genome by testing the presence of a QTL at each location. From a statistical point of view, SgenoLasso is based on new limiting results on stochastic processes along the genome. SgenoLasso presents all the nice properties of Lasso since we have replaced the problem in a","PeriodicalId":325859,"journal":{"name":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"New Statistical Methods For Association Studies And Genomic Predictio\",\"authors\":\"Charles‐Elie Rabier, Céline Delmas\",\"doi\":\"10.11159/icsta22.135\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extended Abstract \\\"Selective genotyping\\\" is a very famous concept in genetics. It was introduced by Lebowitz et al. (1987) and was studied more in details by Lander and Botstein (1989). It consists in genotyping (collecting DNA information at specific positions) only the individuals with extreme phenotypes. Indeed, Lebowitz et al. (1987) noticed that the highest or the lowest observations contain most of the signal on Quantitative Trait Loci (QTL), i.e. genes with quantitative effect on a trait. Today, although the genotyping costs have drastically dropped, selective genotyping is still heavily used (e.g. [1]) since we can optimize the statistical experiment by focusing on extreme individuals instead of \\\"random\\\" individuals. Although \\\"selective genotyping\\\" was introduced in the eighties, biologists are still missing tools to analyze properly data sampled from this experimental design. Indeed, classical methods such as penalized regression (e.g. Lasso [2]) are not dedicated to extreme observations. As a consequence, we introduced recently the SgenoLasso [3], a new L1 penalized regression that models explicitly the extremes. SgenoLasso on the ``Interval Mapping\\\" a famous concept in genetics that consists in scanning the genome by testing the presence of a QTL at each location. From a statistical point of view, SgenoLasso is based on new limiting results on stochastic processes along the genome. SgenoLasso presents all the nice properties of Lasso since we have replaced the problem in a\",\"PeriodicalId\":325859,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Statistics: Theory and Applications\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Statistics: Theory and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11159/icsta22.135\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Statistics: Theory and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11159/icsta22.135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

“选择性基因分型”是遗传学中一个非常著名的概念。它由Lebowitz等人(1987)提出,Lander和Botstein(1989)对其进行了更详细的研究。它包括基因分型(收集特定位置的DNA信息),仅针对具有极端表型的个体。事实上,Lebowitz等人(1987)注意到,最高或最低的观测值包含了数量性状位点(Quantitative Trait Loci, QTL)上的大部分信号,即对性状有数量效应的基因。今天,尽管基因分型成本大幅下降,但选择性基因分型仍然被大量使用(例如[1]),因为我们可以通过关注极端个体而不是“随机”个体来优化统计实验。尽管“选择性基因分型”在80年代被引入,生物学家仍然缺乏工具来正确分析从这种实验设计中取样的数据。事实上,惩罚回归等经典方法(如Lasso[2])并不专门用于极端观测。因此,我们最近引入了SgenoLasso[3],这是一种新的L1惩罚回归,可以明确地模拟极端情况。SgenoLasso对“区间映射”的一个著名的遗传学概念,包括扫描基因组通过测试在每个位置的QTL的存在。从统计学的角度来看,SgenoLasso是基于基因组随机过程的新限制结果。SgenoLasso给出了Lasso的所有好的性质因为我们把问题替换成了a
本文章由计算机程序翻译,如有差异,请以英文原文为准。
New Statistical Methods For Association Studies And Genomic Predictio
Extended Abstract "Selective genotyping" is a very famous concept in genetics. It was introduced by Lebowitz et al. (1987) and was studied more in details by Lander and Botstein (1989). It consists in genotyping (collecting DNA information at specific positions) only the individuals with extreme phenotypes. Indeed, Lebowitz et al. (1987) noticed that the highest or the lowest observations contain most of the signal on Quantitative Trait Loci (QTL), i.e. genes with quantitative effect on a trait. Today, although the genotyping costs have drastically dropped, selective genotyping is still heavily used (e.g. [1]) since we can optimize the statistical experiment by focusing on extreme individuals instead of "random" individuals. Although "selective genotyping" was introduced in the eighties, biologists are still missing tools to analyze properly data sampled from this experimental design. Indeed, classical methods such as penalized regression (e.g. Lasso [2]) are not dedicated to extreme observations. As a consequence, we introduced recently the SgenoLasso [3], a new L1 penalized regression that models explicitly the extremes. SgenoLasso on the ``Interval Mapping" a famous concept in genetics that consists in scanning the genome by testing the presence of a QTL at each location. From a statistical point of view, SgenoLasso is based on new limiting results on stochastic processes along the genome. SgenoLasso presents all the nice properties of Lasso since we have replaced the problem in a
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信