{"title":"正则化分布:定义、性质和应用。","authors":"Zongliang Hu, Yiping Yang, Gaorong Li, T. Tong","doi":"10.1111/sjos.12655","DOIUrl":null,"url":null,"abstract":"For gene expression data analysis, an important task is to identify genes that are differentially expressed between two or more groups. Nevertheless, as biological experiments are often measured with a relatively small number of samples, how to accurately estimate the variances of gene expression becomes a challenging issue. To tackle this problem, we introduce a regularized t$$ t $$ distribution and derive its statistical properties including the probability density function and the moment generating function. The noncentral regularized t$$ t $$ distribution is also introduced for computing the statistical power of hypothesis testing. For practical applications, we apply the regularized t$$ t $$ distribution to establish the null distribution of the regularized t$$ t $$ statistic, and then formulate it as a regularized t$$ t $$ ‐test for detecting the differentially expressed genes. Simulation studies and real data analysis show that our regularized t$$ t $$ ‐test performs much better than the Bayesian t$$ t $$ ‐test in the “limma” package, in particular when the sample sizes are small.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Regularized t distribution: definition, properties and applications\",\"authors\":\"Zongliang Hu, Yiping Yang, Gaorong Li, T. Tong\",\"doi\":\"10.1111/sjos.12655\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For gene expression data analysis, an important task is to identify genes that are differentially expressed between two or more groups. Nevertheless, as biological experiments are often measured with a relatively small number of samples, how to accurately estimate the variances of gene expression becomes a challenging issue. To tackle this problem, we introduce a regularized t$$ t $$ distribution and derive its statistical properties including the probability density function and the moment generating function. The noncentral regularized t$$ t $$ distribution is also introduced for computing the statistical power of hypothesis testing. For practical applications, we apply the regularized t$$ t $$ distribution to establish the null distribution of the regularized t$$ t $$ statistic, and then formulate it as a regularized t$$ t $$ ‐test for detecting the differentially expressed genes. Simulation studies and real data analysis show that our regularized t$$ t $$ ‐test performs much better than the Bayesian t$$ t $$ ‐test in the “limma” package, in particular when the sample sizes are small.\",\"PeriodicalId\":49567,\"journal\":{\"name\":\"Scandinavian Journal of Statistics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2023-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scandinavian Journal of Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1111/sjos.12655\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1111/sjos.12655","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Regularized t distribution: definition, properties and applications
For gene expression data analysis, an important task is to identify genes that are differentially expressed between two or more groups. Nevertheless, as biological experiments are often measured with a relatively small number of samples, how to accurately estimate the variances of gene expression becomes a challenging issue. To tackle this problem, we introduce a regularized t$$ t $$ distribution and derive its statistical properties including the probability density function and the moment generating function. The noncentral regularized t$$ t $$ distribution is also introduced for computing the statistical power of hypothesis testing. For practical applications, we apply the regularized t$$ t $$ distribution to establish the null distribution of the regularized t$$ t $$ statistic, and then formulate it as a regularized t$$ t $$ ‐test for detecting the differentially expressed genes. Simulation studies and real data analysis show that our regularized t$$ t $$ ‐test performs much better than the Bayesian t$$ t $$ ‐test in the “limma” package, in particular when the sample sizes are small.
期刊介绍:
The Scandinavian Journal of Statistics is internationally recognised as one of the leading statistical journals in the world. It was founded in 1974 by four Scandinavian statistical societies. Today more than eighty per cent of the manuscripts are submitted from outside Scandinavia.
It is an international journal devoted to reporting significant and innovative original contributions to statistical methodology, both theory and applications.
The journal specializes in statistical modelling showing particular appreciation of the underlying substantive research problems.
The emergence of specialized methods for analysing longitudinal and spatial data is just one example of an area of important methodological development in which the Scandinavian Journal of Statistics has a particular niche.