Computer Age Statistical Inference, Student Edition最新文献_第2页

Epilogue 后记

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/9781108914062.027

引用次数: 0

Survival Analysis and the EM Algorithm 生存分析与EM算法

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/CBO9781316576533.010

B. Efron, T. Hastie

引用次数: 0

Random Forests and Boosting 随机森林和提升

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/CBO9781316576533.018

B. Efron, T. Hastie

引用次数: 2

Large-Scale Hypothesis Testing and FDRs 大规模假设检验与fdr

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/CBO9781316576533.016

B. Efron, T. Hastie

引用次数: 3

Classic Statistical Inference 经典统计推断

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/9781108914062.003

引用次数: 0

Early Computer-Age Methods 早期计算机时代的方法

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/9781108914062.009

引用次数: 0

The Jackknife and the Bootstrap 折刀和Bootstrap

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/9781108914062.014

F. inJ, Xitao Fan, Lin Wang

{"title":"The Jackknife and the Bootstrap","authors":"F. inJ, Xitao Fan, Lin Wang","doi":"10.1017/9781108914062.014","DOIUrl":"https://doi.org/10.1017/9781108914062.014","url":null,"abstract":"The jackknife and bootstrap methods are becoming more popular in research. Although the two approaches have similar goals and employ similar strategies, information is lacking with regard to the comparability of their results. This study systematically investigate,1 the issue for a canonical correlation analysis, using data from four random samples from the National Education Longitudinal Study of 1988. Some conspicuous discrepancies are observed mainly under small sample size conditions, and this raises some concern when researchers need to choose between the two for their small samples. Due to the lack of theoretical sampling distributions in canonical analysis, it is unclear which method had superior performance. It is suggested that Monte Carlo simulation is needed for this kind of comparison. It is also suggested that caution is warranted in generalizing the results to other statistical techniques, since the validity of such generalizations is uncertain. (Contains 6 tables and 18 references.) (Author/SLD)*********************************************************************** Reproductions supplied by EDRS are the best that can be made * from the original document. *********************************************************************** * U S DEPAKTMENT OF EDUCATION Ofl)ce of Educid)onsf ROSCIrCh and improvernni EDUCATIONAL RE SOURCES INFORMATION CENTER (ERIC) TP.s docurnem! Pis beer) reproduced as ,ece.ved from Ihe person or ordsrutstoon or.grnat.op .1 o Manor changes hve been made to approve reproduct)on dowdy Po)nts of rrew opostons slated .ri pus docu. map? do not neCessanly rety.senl ottrcrli OEM 1)03.1.01, or pohcy Jackknife and Bootstrap 1 PERMISSION TO REPRODUCE THIS MA TERIAL HAS BEEN GRANTED BY X 1_1/96 F inJ TO THE EDUCATIONAL RESOURCES INFORMATION CENTER (ERIC) HOW COMPARABLE ARE THE JACKKNIFE AND BOOTSTRAP RESULTS: AN INVESTIGATION FOR A CASE OF CANONICAL CORRELATION ANALYSIS Xitao Fan Utah State University","PeriodicalId":430973,"journal":{"name":"Computer Age Statistical Inference, Student Edition","volume":"05 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127191417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Empirical Bayes Estimation Strategies 经验贝叶斯估计策略

Computer Age Statistical Inference, Student Edition Pub Date : 2021-06-17 DOI: 10.1017/CBO9781316576533.022

B. Efron, T. Hastie

引用次数: 0

Sparse Modeling and the Lasso 稀疏建模和套索

Computer Age Statistical Inference, Student Edition Pub Date : 2016-07-01 DOI: 10.1017/CBO9781316576533.017

B. Efron, T. Hastie

{"title":"Sparse Modeling and the Lasso","authors":"B. Efron, T. Hastie","doi":"10.1017/CBO9781316576533.017","DOIUrl":"https://doi.org/10.1017/CBO9781316576533.017","url":null,"abstract":"The amount of data we are faced with keeps growing. From around the late 1990s we started to see wide data sets, where the number of variables far exceeds the number of observations. This was largely due to our increasing ability to measure a large amount of information automatically. In genomics, for example, we can use a high-throughput experiment to automatically measure the expression of tens of thousands of genes in a sample in a short amount of time. Similarly, sequencing equipment allows us to genotype millions of SNPs (single-nucleotide polymorphisms) cheaply and quickly. In document retrieval and modeling, we represent a document by the presence or count of each word in the dictionary. This easily leads to a feature vector with 20,000 components, one for each distinct vocabulary word, although most would be zero for a small document. If we move to bi-grams or higher, the feature space gets really large. In even more modest situations, we can be faced with hundreds of variables. If these variables are to be predictors in a regression or logistic regression model, we probably do not want to use them all. It is likely that a subset will do the job well, and including all the redundant variables will degrade our fit. Hence we are often interested in identifying a good subset of variables. Note also that in these wide-data situations, even linear models are over-parametrized, so some form of reduction or regularization is essential. In this chapter we will discuss some of the popular methods for model selection, starting with the time-tested and worthy forward-stepwise approach. We then look at the lasso, a popular modern method that does selection and shrinkage via convex optimization. The LARs algorithm ties these two approaches together, and leads to methods that can deliver paths of solutions. Finally, we discuss some connections with other modern big-and widedata approaches, and mention some extensions. Forward Stepwise Regression Stepwise procedures have been around for a very long time. They were originally devised in times when data sets were quite modest in size, in particular in terms of the number of variables. Originally thought of as the poor cousins of “best-subset” selection, they had the advantage of being much cheaper to compute (and in fact possible to compute for large p).We will review best-subset regression first.","PeriodicalId":430973,"journal":{"name":"Computer Age Statistical Inference, Student Edition","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117246099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parametric Models and Exponential Families 参数模型和指数族

Computer Age Statistical Inference, Student Edition Pub Date : 2016-07-01 DOI: 10.1017/CBO9781316576533.006

B. Efron, T. Hastie

引用次数: 0