{"title":"Computational Tools for SNP Interactions - How Good Are They?","authors":"F. R. B. Araujo, K. Guimaraes","doi":"10.1109/BIBE.2011.53","DOIUrl":null,"url":null,"abstract":"It is no trivial task to sift through huge amounts of SNP data to detect interactions between SNPs that can be relevant to identify propensity for a certain disease or a phenotype trait of interest, especially because many times it also involves the influence of environmental aspects. In a previous work, we analyzed the impact of different epistatic models on the accuracy of exhaustive computational methods. Those methods have good accuracy, but they are by nature, highly computationally demanding, hence not well suited for large population size or large number of SNPs, as found in genome-wide studies. In this paper, we report the results of a comparative study of methods for detecting epistatic interactions, based on recent trends, namely greedy and Bayesian computational approaches. Our experiments reveal that all methods have better performance in scenarios with higher values for heritability and minor allele frequency (MAF). In general, in terms of accuracy, BOOST outperformed the other methods studied. Even presenting an statistically significantly better performance, BOOST could not reach 40% accuracy when there were 50 or more SNPs, for cases with heritability 0.01 and MAF 0.2, even with a large number of individuals. Keywords-SNPs; Interactions; Computational Approaches;","PeriodicalId":391184,"journal":{"name":"2011 IEEE 11th International Conference on Bioinformatics and Bioengineering","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE 11th International Conference on Bioinformatics and Bioengineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2011.53","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
It is no trivial task to sift through huge amounts of SNP data to detect interactions between SNPs that can be relevant to identify propensity for a certain disease or a phenotype trait of interest, especially because many times it also involves the influence of environmental aspects. In a previous work, we analyzed the impact of different epistatic models on the accuracy of exhaustive computational methods. Those methods have good accuracy, but they are by nature, highly computationally demanding, hence not well suited for large population size or large number of SNPs, as found in genome-wide studies. In this paper, we report the results of a comparative study of methods for detecting epistatic interactions, based on recent trends, namely greedy and Bayesian computational approaches. Our experiments reveal that all methods have better performance in scenarios with higher values for heritability and minor allele frequency (MAF). In general, in terms of accuracy, BOOST outperformed the other methods studied. Even presenting an statistically significantly better performance, BOOST could not reach 40% accuracy when there were 50 or more SNPs, for cases with heritability 0.01 and MAF 0.2, even with a large number of individuals. Keywords-SNPs; Interactions; Computational Approaches;