Statistical Applications in Genetics and Molecular Biology最新文献

筛选
英文 中文
Bayesian mixed-effects model for the analysis of a series of FRAP images. 贝叶斯混合效应模型对一系列FRAP图像的分析。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2015-02-01 DOI: 10.1515/sagmb-2014-0013
Martina Feilke, Katrin Schneider, Volker J Schmid
{"title":"Bayesian mixed-effects model for the analysis of a series of FRAP images.","authors":"Martina Feilke,&nbsp;Katrin Schneider,&nbsp;Volker J Schmid","doi":"10.1515/sagmb-2014-0013","DOIUrl":"https://doi.org/10.1515/sagmb-2014-0013","url":null,"abstract":"<p><p>The binding behavior of molecules in nuclei of living cells can be studied through the analysis of images from fluorescence recovery after photobleaching experiments. However, there is still a lack of methodology for the statistical evaluation of FRAP data, especially for the joint analysis of multiple dynamic images. We propose a hierarchical Bayesian nonlinear model with mixed-effect priors based on local compartment models in order to obtain joint parameter estimates for all nuclei as well as to account for the heterogeneity of the nuclei population. We apply our method to a series of FRAP experiments of DNA methyltransferase 1 tagged to green fluorescent protein expressed in a somatic mouse cell line and compare the results to the application of three different fixed-effects models to the same series of FRAP experiments. With the proposed model, we get estimates of the off-rates of the interactions of the molecules under study together with credible intervals, and additionally gain information about the variability between nuclei. The proposed model is superior to and more robust than the tested fixed-effects models. Therefore, it can be used for the joint analysis of data from FRAP experiments on various similar nuclei.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2014-0013","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32905409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Bayesian mixture model for chromatin interaction data. 染色质相互作用数据的贝叶斯混合模型。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2015-02-01 DOI: 10.1515/sagmb-2014-0029
Liang Niu, Shili Lin
{"title":"A Bayesian mixture model for chromatin interaction data.","authors":"Liang Niu,&nbsp;Shili Lin","doi":"10.1515/sagmb-2014-0029","DOIUrl":"https://doi.org/10.1515/sagmb-2014-0029","url":null,"abstract":"<p><p>Chromatin interactions mediated by a particular protein are of interest for studying gene regulation, especially the regulation of genes that are associated with, or known to be causative of, a disease. A recent molecular technique, Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET), that uses chromatin immunoprecipitation (ChIP) and high throughput paired-end sequencing, is able to detect such chromatin interactions genomewide. However, ChIA-PET may generate noise (i.e., pairings of DNA fragments by random chance) in addition to true signal (i.e., pairings of DNA fragments by interactions). In this paper, we propose MC_DIST based on a mixture modeling framework to identify true chromatin interactions from ChIA-PET count data (counts of DNA fragment pairs). The model is cast into a Bayesian framework to take into account the dependency among the data and the available information on protein binding sites and gene promoters to reduce false positives. A simulation study showed that MC_DIST outperforms the previously proposed hypergeometric model in terms of both power and type I error rate. A real data study showed that MC_DIST may identify potential chromatin interactions between protein binding sites and gene promoters that may be missed by the hypergeometric model. An R package implementing the MC_DIST model is available at http://www.stat.osu.edu/~statgen/SOFTWARE/MDM.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2014-0029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32890093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A region-based multiple testing method for hypotheses ordered in space or time. 一种在空间或时间上有序的假设的基于区域的多重检验方法。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2015-02-01 DOI: 10.1515/sagmb-2013-0075
Rosa J Meijer, Thijmen J P Krebs, Jelle J Goeman
{"title":"A region-based multiple testing method for hypotheses ordered in space or time.","authors":"Rosa J Meijer,&nbsp;Thijmen J P Krebs,&nbsp;Jelle J Goeman","doi":"10.1515/sagmb-2013-0075","DOIUrl":"https://doi.org/10.1515/sagmb-2013-0075","url":null,"abstract":"<p><p>We present a multiple testing method for hypotheses that are ordered in space or time. Given such hypotheses, the elementary hypotheses as well as regions of consecutive hypotheses are of interest. These region hypotheses not only have intrinsic meaning but testing them also has the advantage that (potentially small) signals across a region are combined in one test. Because the expected number and length of potentially interesting regions are usually not available beforehand, we propose a method that tests all possible region hypotheses as well as all individual hypotheses in a single multiple testing procedure that controls the familywise error rate. We start at testing the global null-hypothesis and when this hypothesis can be rejected we continue with further specifying the exact location/locations of the effect present. The method is implemented in the R package cherry and is illustrated on a DNA copy number data set.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2013-0075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32921938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A hidden Markov-model for gene mapping based on whole-genome next generation sequencing data. 基于下一代全基因组测序数据的基因定位隐马尔可夫模型。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2015-02-01 DOI: 10.1515/sagmb-2014-0007
Jürgen Claesen, Tomasz Burzykowski
{"title":"A hidden Markov-model for gene mapping based on whole-genome next generation sequencing data.","authors":"Jürgen Claesen,&nbsp;Tomasz Burzykowski","doi":"10.1515/sagmb-2014-0007","DOIUrl":"https://doi.org/10.1515/sagmb-2014-0007","url":null,"abstract":"<p><p>The analysis of polygenic, phenotypic characteristics such as quantitative traits or inheritable diseases requires reliable scoring of many genetic markers covering the entire genome. The advent of high-throughput sequencing technologies provides a new way to evaluate large numbers of single nucleotide polymorphisms as genetic markers. Combining the technologies with pooling of segregants, as performed in bulk segregant analysis, should, in principle, allow the simultaneous mapping of multiple genetic loci present throughout the genome. We propose a hidden Markov-model to analyze the marker data obtained by the bulk segregant next generation sequencing. The model includes several states, each associated with a different probability of observing the same/different nucleotide in an offspring as compared to the parent. The transitions between the molecular markers imply transitions between the states of the model. After estimating the transition probabilities and state-related probabilities of nucleotide (dis)similarity, the most probable state for each SNP is selected. The most probable states can then be used to indicate which genomic regions may be likely to contain trait-related genes. The application of the model is illustrated on the data from a study of ethanol tolerance in yeast. Software is written in R. R-functions, R-scripts and documentation are available on www.ibiostat.be/software/bioinformatics.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2014-0007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32885132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Exploring homogeneity of correlation structures of gene expression datasets within and between etiological disease categories. 探索病原学疾病类别内部和之间基因表达数据集相关结构的同质性。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2014-0003
Victor L Jong, Putri W Novianti, Kit C B Roes, Marinus J C Eijkemans
{"title":"Exploring homogeneity of correlation structures of gene expression datasets within and between etiological disease categories.","authors":"Victor L Jong,&nbsp;Putri W Novianti,&nbsp;Kit C B Roes,&nbsp;Marinus J C Eijkemans","doi":"10.1515/sagmb-2014-0003","DOIUrl":"https://doi.org/10.1515/sagmb-2014-0003","url":null,"abstract":"<p><p>The literature shows that classifiers perform differently across datasets and that correlations within datasets affect the performance of classifiers. The question that arises is whether the correlation structure within datasets differ significantly across diseases. In this study, we evaluated the homogeneity of correlation structures within and between datasets of six etiological disease categories; inflammatory, immune, infectious, degenerative, hereditary and acute myeloid leukemia (AML). We also assessed the effect of filtering; detection call and variance filtering on correlation structures. We downloaded microarray datasets from ArrayExpress for experiments meeting predefined criteria and ended up with 12 datasets for non-cancerous diseases and six for AML. The datasets were preprocessed by a common procedure incorporating platform-specific recommendations and the two filtering methods mentioned above. Homogeneity of correlation matrices between and within datasets of etiological diseases was assessed using the Box's M statistic on permuted samples. We found that correlation structures significantly differ between datasets of the same and/or different etiological disease categories and that variance filtering eliminates more uncorrelated probesets than detection call filtering and thus renders the data highly correlated.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2014-0003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32906123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Covariate adjusted differential variability analysis of DNA methylation with propensity score method. 用倾向评分法分析DNA甲基化的协变量调整差异变异性。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2013-0072
Pei Fen Kuan
{"title":"Covariate adjusted differential variability analysis of DNA methylation with propensity score method.","authors":"Pei Fen Kuan","doi":"10.1515/sagmb-2013-0072","DOIUrl":"https://doi.org/10.1515/sagmb-2013-0072","url":null,"abstract":"<p><p>It has been proposed recently that differentially variable CpG methylation (DVC) may contribute to transcriptional aberrations in human diseases. In large scale epigenetic studies, potential confounders could affect the observed methylation variabilities and need to be accounted for. In this paper, we develop a robust statistical model for differential variability DVC analysis that accounts for potential confounding covariates by utilizing the propensity score method. Our method is based on a weighted score test on strata generated propensity score stratification. To the best of our knowledge, this is the first proposed statistical method for detecting DVCs that adjusts for confounding covariates. We show that this method is robust against model misspecification and achieves good operating characteristics based on extensive simulations and a case study.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2013-0072","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32761678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
P-value calibration for multiple testing problems in genomics. 基因组学中多重检测问题的p值校准。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2013-0074
John P Ferguson, Dean Palejev
{"title":"P-value calibration for multiple testing problems in genomics.","authors":"John P Ferguson,&nbsp;Dean Palejev","doi":"10.1515/sagmb-2013-0074","DOIUrl":"https://doi.org/10.1515/sagmb-2013-0074","url":null,"abstract":"<p><p>Conservative statistical tests are often used in complex multiple testing settings in which computing the type I error may be difficult. In such tests, the reported p-value for a hypothesis can understate the evidence against the null hypothesis and consequently statistical power may be lost. False Discovery Rate adjustments, used in multiple comparison settings, can worsen the unfavorable effect. We present a computationally efficient and test-agnostic calibration technique that can substantially reduce the conservativeness of such tests. As a consequence, a lower sample size might be sufficient to reject the null hypothesis for true alternatives, and experimental costs can be lowered. We apply the calibration technique to the results of DESeq, a popular method for detecting differentially expressed genes from RNA sequencing data. The increase in power may be particularly high in small sample size experiments, often used in preliminary experiments and funding applications.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2013-0074","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32761679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
When is Menzerath-Altmann law mathematically trivial? A new approach. 什么时候Menzerath-Altmann定律在数学上是微不足道的?一种新的方法。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2013-0034
Ramon Ferrer-i-Cancho, Antoni Hernández-Fernández, Jaume Baixeries, Łukasz Dębowski, Ján Mačutek
{"title":"When is Menzerath-Altmann law mathematically trivial? A new approach.","authors":"Ramon Ferrer-i-Cancho,&nbsp;Antoni Hernández-Fernández,&nbsp;Jaume Baixeries,&nbsp;Łukasz Dębowski,&nbsp;Ján Mačutek","doi":"10.1515/sagmb-2013-0034","DOIUrl":"https://doi.org/10.1515/sagmb-2013-0034","url":null,"abstract":"<p><p>Menzerath's law, the tendency of Z (the mean size of the parts) to decrease as X (the number of parts) increases, is found in language, music and genomes. Recently, it has been argued that the presence of the law in genomes is an inevitable consequence of the fact that Z=Y/X, which would imply that Z scales with X as Z ∼ 1/X. That scaling is a very particular case of Menzerath-Altmann law that has been rejected by means of a correlation test between X and Y in genomes, being X the number of chromosomes of a species, Y its genome size in bases and Z the mean chromosome size. Here we review the statistical foundations of that test and consider three non-parametric tests based upon different correlation metrics and one parametric test to evaluate if Z ∼ 1/X in genomes. The most powerful test is a new non-parametric one based upon the correlation ratio, which is able to reject Z ∼ 1/X in nine out of 11 taxonomic groups and detect a borderline group. Rather than a fact, Z ∼ 1/X is a baseline that real genomes do not meet. The view of Menzerath-Altmann law as inevitable is seriously flawed.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2013-0034","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32906121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Markovianness and conditional independence in annotated bacterial DNA. 注释细菌DNA的马尔可夫性和条件独立性。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2014-0002
Andrew Hart, Servet Martínez
{"title":"Markovianness and conditional independence in annotated bacterial DNA.","authors":"Andrew Hart,&nbsp;Servet Martínez","doi":"10.1515/sagmb-2014-0002","DOIUrl":"https://doi.org/10.1515/sagmb-2014-0002","url":null,"abstract":"<p><p>We explore the probabilistic structure of DNA in a number of bacterial genomes and conclude that a form of Markovianness is present at the boundaries between coding and non-coding regions, that is, the sequence of START and STOP codons annotated for the bacterial genome. This sequence is shown to satisfy a conditional independence property which allows its governing Markov chain to be uniquely identified from the abundances of START and STOP codons. Furthermore, we show that the annotated sequence of STARTs and STOPs complies with Chargaff's second parity rule.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2014-0002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32906122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Robust methods to detect disease-genotype association in genetic association studies: calculate p-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations. 在遗传关联研究中检测疾病-基因型关联的鲁棒方法:使用精确条件枚举而不是模拟排列或渐近近似计算p值。
IF 0.9 4区 数学
Statistical Applications in Genetics and Molecular Biology Pub Date : 2014-12-01 DOI: 10.1515/sagmb-2013-0084
Mette Langaas, Øyvind Bakke
{"title":"Robust methods to detect disease-genotype association in genetic association studies: calculate p-values using exact conditional enumeration instead of simulated permutations or asymptotic approximations.","authors":"Mette Langaas,&nbsp;Øyvind Bakke","doi":"10.1515/sagmb-2013-0084","DOIUrl":"https://doi.org/10.1515/sagmb-2013-0084","url":null,"abstract":"<p><p>In genetic association studies, detecting disease-genotype association is a primary goal. We study seven robust test statistics for such association when the underlying genetic model is unknown, for data on disease status (case or control) and genotype (three genotypes of a biallelic genetic marker). In such studies, p-values have predominantly been calculated by asymptotic approximations or by simulated permutations. We consider an exact method, conditional enumeration. When the number of simulated permutations tends to infinity, the permutation p-value approaches the conditional enumeration p-value, but calculating the latter is much more efficient than performing simulated permutations. We have studied case-control sample sizes with 500-5000 cases and 500-15,000 controls, and significance levels from 5 × 10(-8) to 0.05, thus our results are applicable to genetic association studies with only a few genetic markers under study, intermediate follow-up studies, and genome-wide association studies. Our main findings are: (i) If all monotone genetic models are of interest, the best performance in the situations under study is achieved for the robust test statistics based on the maximum over a range of Cochran-Armitage trend tests with different scores and for the constrained likelihood ratio test. (ii) For significance levels below 0.05, for the test statistics under study, asymptotic approximations may give a test size up to 20 times the nominal level, and should therefore be used with caution. (iii) Calculating p-values based on exact conditional enumeration is a powerful, valid and computationally feasible approach, and we advocate its use in genetic association studies.</p>","PeriodicalId":48980,"journal":{"name":"Statistical Applications in Genetics and Molecular Biology","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/sagmb-2013-0084","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32755665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信