t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data.

Microarrays Pub Date : 2014-12-16 DOI:10.3390/microarrays3040340

Marcelo Boareto, Nestor Caticha

{"title":"t-Test at the Probe Level: An Alternative Method to Identify Statistically Significant Genes for Microarray Data.","authors":"Marcelo Boareto, Nestor Caticha","doi":"10.3390/microarrays3040340","DOIUrl":null,"url":null,"abstract":"<p><p>Microarray data analysis typically consists in identifying a list of differentially expressed genes (DEG), i.e., the genes that are differentially expressed between two experimental conditions. Variance shrinkage methods have been considered a better choice than the standard t-test for selecting the DEG because they correct the dependence of the error with the expression level. This dependence is mainly caused by errors in background correction, which more severely affects genes with low expression values. Here, we propose a new method for identifying the DEG that overcomes this issue and does not require background correction or variance shrinkage. Unlike current methods, our methodology is easy to understand and implement. It consists of applying the standard t-test directly on the normalized intensity data, which is possible because the probe intensity is proportional to the gene expression level and because the t-test is scale- and location-invariant. This methodology considerably improves the sensitivity and robustness of the list of DEG when compared with the t-test applied to preprocessed data and to the most widely used shrinkage methods, Significance Analysis of Microarrays (SAM) and Linear Models for Microarray Data (LIMMA). Our approach is useful especially when the genes of interest have small differences in expression and therefore get ignored by standard variance shrinkage methods. </p>","PeriodicalId":56355,"journal":{"name":"Microarrays","volume":"3 4","pages":"340-51"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3390/microarrays3040340","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microarrays","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/microarrays3040340","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Microarray data analysis typically consists in identifying a list of differentially expressed genes (DEG), i.e., the genes that are differentially expressed between two experimental conditions. Variance shrinkage methods have been considered a better choice than the standard t-test for selecting the DEG because they correct the dependence of the error with the expression level. This dependence is mainly caused by errors in background correction, which more severely affects genes with low expression values. Here, we propose a new method for identifying the DEG that overcomes this issue and does not require background correction or variance shrinkage. Unlike current methods, our methodology is easy to understand and implement. It consists of applying the standard t-test directly on the normalized intensity data, which is possible because the probe intensity is proportional to the gene expression level and because the t-test is scale- and location-invariant. This methodology considerably improves the sensitivity and robustness of the list of DEG when compared with the t-test applied to preprocessed data and to the most widely used shrinkage methods, Significance Analysis of Microarrays (SAM) and Linear Models for Microarray Data (LIMMA). Our approach is useful especially when the genes of interest have small differences in expression and therefore get ignored by standard variance shrinkage methods.

Abstract Image

查看原文本刊更多论文

探针水平的t检验:一种识别微阵列数据统计显著基因的替代方法。

微阵列数据分析通常包括识别差异表达基因(DEG)列表，即在两种实验条件下差异表达的基因。方差收缩法被认为是比标准t检验选择DEG更好的选择，因为它们纠正了误差与表达水平的依赖性。这种依赖主要是由于背景校正错误造成的，对低表达值基因的影响更严重。在这里，我们提出了一种新的方法来识别DEG，克服了这个问题，不需要背景校正或方差收缩。与当前的方法不同，我们的方法易于理解和实现。它包括直接对归一化强度数据应用标准t检验，这是可能的，因为探针强度与基因表达水平成正比，因为t检验是规模和位置不变的。与应用于预处理数据的t检验和最广泛使用的收缩方法，微阵列显著性分析(SAM)和微阵列数据线性模型(LIMMA)相比，该方法大大提高了DEG列表的灵敏度和稳健性。我们的方法是有用的，特别是当感兴趣的基因在表达上有很小的差异，因此被标准方差收缩方法忽略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Microarrays

自引率

0.00%

发文量

审稿时长

11 weeks

期刊介绍： High-Throughput (formerly Microarrays, ISSN 2076-3905) is a multidisciplinary peer-reviewed scientific journal that provides an advanced forum for the publication of studies reporting high-dimensional approaches and developments in Life Sciences, Chemistry and related fields. Our aim is to encourage scientists to publish their experimental and theoretical results based on high-throughput techniques as well as computational and statistical tools for data analysis and interpretation. The full experimental or methodological details must be provided so that the results can be reproduced. There is no restriction on the length of the papers. High-Throughput invites submissions covering several topics, including, but not limited to: Microarrays, DNA Sequencing, RNA Sequencing, Protein Identification and Quantification, Cell-based Approaches, Omics Technologies, Imaging, Bioinformatics, Computational Biology/Chemistry, Statistics, Integrative Omics, Drug Discovery and Development, Microfluidics, Lab-on-a-chip, Data Mining, Databases, Multiplex Assays.