Statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic study for complex diseases

IF 11 Q1 STATISTICS & PROBABILITY
Yulan Liang, A. Kelemen
{"title":"Statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic study for complex diseases","authors":"Yulan Liang, A. Kelemen","doi":"10.1214/07-SS026","DOIUrl":null,"url":null,"abstract":"Recent advances of information technology in biomedical sciences and other applied areas have created numerous large diverse data sets with a high dimensional feature space, which provide us a tremendous amount of information and new opportunities for improving the quality of human life. Meanwhile, great challenges are also created driven by the continuous arrival of new data that requires researchers to convert these raw data into scientific knowledge in order to benefit from it. Association studies of complex diseases using SNP data have become more and more popular in biomedical research in recent years. In this paper, we present a review of recent statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic association studies for complex diseases. The review includes both general feature reduction approaches for high dimensional correlated data and more specific approaches for SNPs data, which include unsupervised haplotype mapping, tag SNP selection, and supervised SNPs selection using statistical testing/scoring, statistical modeling and machine learning methods with an emphasis on how to identify interacting loci.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"15 1","pages":"43-60"},"PeriodicalIF":11.0000,"publicationDate":"2008-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"46","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics Surveys","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/07-SS026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 46

Abstract

Recent advances of information technology in biomedical sciences and other applied areas have created numerous large diverse data sets with a high dimensional feature space, which provide us a tremendous amount of information and new opportunities for improving the quality of human life. Meanwhile, great challenges are also created driven by the continuous arrival of new data that requires researchers to convert these raw data into scientific knowledge in order to benefit from it. Association studies of complex diseases using SNP data have become more and more popular in biomedical research in recent years. In this paper, we present a review of recent statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic association studies for complex diseases. The review includes both general feature reduction approaches for high dimensional correlated data and more specific approaches for SNPs data, which include unsupervised haplotype mapping, tag SNP selection, and supervised SNPs selection using statistical testing/scoring, statistical modeling and machine learning methods with an emphasis on how to identify interacting loci.
复杂疾病基因组研究中相关高维SNP数据分析的统计学进展与挑战
近年来,信息技术在生物医学等应用领域的进步创造了大量具有高维特征空间的大型多样化数据集,为我们提供了大量的信息和提高人类生活质量的新机会。与此同时,新数据的不断到来也带来了巨大的挑战,这要求研究人员将这些原始数据转化为科学知识,以便从中受益。近年来,利用SNP数据进行复杂疾病的关联研究在生物医学研究中越来越受欢迎。本文综述了复杂疾病基因组关联研究中相关高维SNP数据分析的最新统计进展和挑战。本文综述了用于高维相关数据的一般特征约简方法和用于SNP数据的更具体的方法,包括无监督单倍型映射、标签SNP选择和使用统计测试/评分、统计建模和机器学习方法的监督SNP选择,重点是如何识别相互作用位点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Statistics Surveys
Statistics Surveys STATISTICS & PROBABILITY-
CiteScore
11.70
自引率
0.00%
发文量
5
期刊介绍: Statistics Surveys publishes survey articles in theoretical, computational, and applied statistics. The style of articles may range from reviews of recent research to graduate textbook exposition. Articles may be broad or narrow in scope. The essential requirements are a well specified topic and target audience, together with clear exposition. Statistics Surveys is sponsored by the American Statistical Association, the Bernoulli Society, the Institute of Mathematical Statistics, and by the Statistical Society of Canada.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信