American Statistician最新文献

筛选
英文 中文
Hitting a prime by rolling a die with infinitely many faces 掷一个有无限多个面的骰子来得到一个素数
IF 1.8 4区 数学
American Statistician Pub Date : 2023-12-01 DOI: 10.1080/00031305.2023.2290720
Shane Chern
{"title":"Hitting a prime by rolling a die with infinitely many faces","authors":"Shane Chern","doi":"10.1080/00031305.2023.2290720","DOIUrl":"https://doi.org/10.1080/00031305.2023.2290720","url":null,"abstract":"Alon and Malinovsky recently proved that it takes on average 2.42849… rolls of fair six-sided dice until the first time the total sum of all rolls arrives at a prime. Naturally, one may extend the...","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":" 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138473493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Conformal Win Probability to Predict the Winners of the Canceled 2020 NCAA Basketball Tournaments 利用保形获胜概率预测取消的2020年NCAA篮球锦标赛的获胜者
IF 1.8 4区 数学
American Statistician Pub Date : 2023-11-17 DOI: 10.1080/00031305.2023.2283199
Chancellor Johnstone, Dan Nettleton
{"title":"Using Conformal Win Probability to Predict the Winners of the Canceled 2020 NCAA Basketball Tournaments","authors":"Chancellor Johnstone, Dan Nettleton","doi":"10.1080/00031305.2023.2283199","DOIUrl":"https://doi.org/10.1080/00031305.2023.2283199","url":null,"abstract":"The COVID-19 pandemic was responsible for the cancellation of both the men’s and women’s 2020 National Collegiate Athletic Association (NCAA) Division I basketball tournaments. Starting from the po...","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"61 2","pages":""},"PeriodicalIF":1.8,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138438948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assignment-Control Plots: A Visual Companion for Causal Inference Study Design. 赋值-对照图:因果推理研究设计的可视化伴侣。
IF 1.8 4区 数学
American Statistician Pub Date : 2023-01-01 Epub Date: 2022-04-11 DOI: 10.1080/00031305.2022.2051605
Rachael C Aikens, Michael Baiocchi
{"title":"Assignment-Control Plots: A Visual Companion for Causal Inference Study Design.","authors":"Rachael C Aikens, Michael Baiocchi","doi":"10.1080/00031305.2022.2051605","DOIUrl":"10.1080/00031305.2022.2051605","url":null,"abstract":"<p><p>An important step for any causal inference study design is understanding the distribution of the subjects in terms of measured baseline covariates. However, not all baseline variation is equally important. We propose a set of visualizations that reduce the space of measured covariates into two components of baseline variation important to the design of an observational causal inference study: a propensity score summarizing baseline variation associated with treatment assignment, and prognostic score summarizing baseline variation associated with the untreated potential outcome. These <i>assignment-control plots</i> and variations thereof visualize study design trade-offs and illustrate core methodological concepts in causal inference. As a practical demonstration, we apply assignment-control plots to a hypothetical study of cardiothoracic surgery. To demonstrate how these plots can be used to illustrate nuanced concepts, we use them to visualize unmeasured confounding and to consider the relationship between propensity scores and instrumental variables. While the family of visualization tools for studies of causality is relatively sparse, simple visual tools can be an asset to education, application, and methods development.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"77 1","pages":"72-84"},"PeriodicalIF":1.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9916271/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10712591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
The Sign Test, Paired Data, and Asymmetric Dependence: A Cautionary Tale. 符号检验、配对数据和非对称依赖性:警示故事。
IF 1.8 4区 数学
American Statistician Pub Date : 2023-01-01 Epub Date: 2022-09-23 DOI: 10.1080/00031305.2022.2110938
Alan D Hutson, Han Yu
{"title":"The Sign Test, Paired Data, and Asymmetric Dependence: A Cautionary Tale.","authors":"Alan D Hutson, Han Yu","doi":"10.1080/00031305.2022.2110938","DOIUrl":"10.1080/00031305.2022.2110938","url":null,"abstract":"<p><p>In the paired data setting, the sign test is often described in statistical textbooks as a test for comparing differences between the medians of two marginal distributions. There is an implicit assumption that the median of the differences is equivalent to the difference of the medians when employing the sign test in this fashion. We demonstrate however that given asymmetry in the bivariate distribution of the paired data, there are often scenarios where the median of the differences is not equal to the difference of the medians. Further, we show that these scenarios will lead to a false interpretation of the sign test for its intended use in the paired data setting. We illustrate the false-interpretation concept via theory, a simulation study, and through a real-world example based on breast cancer RNA sequencing data obtained from the Cancer Genome Atlas (TCGA).</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"77 1","pages":"35-40"},"PeriodicalIF":1.8,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10275333/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9708928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expressing regret: a unified view of credible intervals. 表示遗憾:对可信时间间隔的统一看法。
IF 1.8 4区 数学
American Statistician Pub Date : 2022-01-01 DOI: 10.1080/00031305.2022.2039764
Kenneth Rice, Lingbo Ye
{"title":"Expressing regret: a unified view of credible intervals.","authors":"Kenneth Rice,&nbsp;Lingbo Ye","doi":"10.1080/00031305.2022.2039764","DOIUrl":"https://doi.org/10.1080/00031305.2022.2039764","url":null,"abstract":"<p><p>Posterior uncertainty is typically summarized as a credible interval, an interval in the parameter space that contains a fixed proportion - usually 95% - of the posterior's support. For multivariate parameters, credible sets perform the same role. There are of course many potential 95% intervals from which to choose, yet even standard choices are rarely justified in any formal way. In this paper we give a general method, focusing on the loss function that motivates an estimate - the Bayes rule - around which we construct a credible set. The set contains all points which, as estimates, would have minimally-worse expected loss than the Bayes rule: we call this excess expected loss 'regret'. The approach can be used for any model and prior, and we show how it justifies all widely-used choices of credible interval/set. Further examples show how it provides insights into more complex estimation problems.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"76 3","pages":"248-256"},"PeriodicalIF":1.8,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9401190/pdf/nihms-1798412.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9117292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Statistical implications of endogeneity induced by residential segregation in small-area modelling of health inequities. 居住隔离引起的内生性在卫生不平等小区域模型中的统计意义。
IF 1.8 4区 数学
American Statistician Pub Date : 2022-01-01 DOI: 10.1080/00031305.2021.2003245
Rachel C Nethery, Jarvis T Chen, Nancy Krieger, Pamela D Waterman, Emily Peterson, Lance A Waller, Brent A Coull
{"title":"Statistical implications of endogeneity induced by residential segregation in small-area modelling of health inequities.","authors":"Rachel C Nethery,&nbsp;Jarvis T Chen,&nbsp;Nancy Krieger,&nbsp;Pamela D Waterman,&nbsp;Emily Peterson,&nbsp;Lance A Waller,&nbsp;Brent A Coull","doi":"10.1080/00031305.2021.2003245","DOIUrl":"https://doi.org/10.1080/00031305.2021.2003245","url":null,"abstract":"<p><p>Health inequities are assessed by health departments to identify social groups disproportionately burdened by disease and by academic researchers to understand how social, economic, and environmental inequities manifest as health inequities. To characterize inequities, group-specific small-area health data are often modeled using log-linear generalized linear models (GLM) or generalized linear mixed models (GLMM) with a random intercept. These approaches estimate the same marginal rate ratio comparing disease rates across groups under standard assumptions. Here we explore how residential segregation combined with social group differences in disease risk can lead to contradictory findings from the GLM and GLMM. We show that this occurs because small-area disease rate data collected under these conditions induce endogeneity in the GLMM due to correlation between the model's offset and random effect. This results in GLMM estimates that represent conditional rather than marginal associations. We refer to endogeneity arising from the offset, which to our knowledge has not been noted previously, as \"offset endogeneity\". We illustrate this phenomenon in simulated data and real premature mortality data, and we propose alternative modeling approaches to address it. We also introduce to a statistical audience the social epidemiologic terminology for framing health inequities, which enables responsible interpretation of results.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"76 2","pages":"142-151"},"PeriodicalIF":1.8,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9070859/pdf/nihms-1762308.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10541651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Learning Hamiltonian Monte Carlo in R. 用 R 学习汉密尔顿蒙特卡洛算法
IF 1.8 4区 数学
American Statistician Pub Date : 2021-01-01 Epub Date: 2021-01-31 DOI: 10.1080/00031305.2020.1865198
Samuel Thomas, Wanzhu Tu
{"title":"Learning Hamiltonian Monte Carlo in R.","authors":"Samuel Thomas, Wanzhu Tu","doi":"10.1080/00031305.2020.1865198","DOIUrl":"10.1080/00031305.2020.1865198","url":null,"abstract":"<p><p>Hamiltonian Monte Carlo (HMC) is a powerful tool for Bayesian computation. In comparison with the traditional Metropolis-Hastings algorithm, HMC offers greater computational efficiency, especially in higher dimensional or more complex modeling situations. To most statisticians, however, the idea of HMC comes from a less familiar origin, one that is based on the theory of classical mechanics. Its implementation, either through Stan or one of its derivative programs, can appear opaque to beginners. A lack of understanding of the inner working of HMC, in our opinion, has hindered its application to a broader range of statistical problems. In this article, we review the basic concepts of HMC in a language that is more familiar to statisticians, and we describe an HMC implementation in R, one of the most frequently used statistical software environments. We also present hmclearn, an R package for learning HMC. This package contains a general-purpose HMC function for data analysis. We illustrate the use of this package in common statistical models. In doing so, we hope to promote this powerful computational tool for wider use. Example code for common statistical models is presented as supplementary material for online publication.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"75 4","pages":"403-413"},"PeriodicalIF":1.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10353725/pdf/nihms-1670958.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9852609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sampling Strategies for Fast Updating of Gaussian Markov Random Fields. 高斯马尔可夫随机场快速更新的采样策略
IF 1.8 4区 数学
American Statistician Pub Date : 2021-01-01 Epub Date: 2019-05-31 DOI: 10.1080/00031305.2019.1595144
D Andrew Brown, Christopher S McMahan, Stella Watson Self
{"title":"Sampling Strategies for Fast Updating of Gaussian Markov Random Fields.","authors":"D Andrew Brown, Christopher S McMahan, Stella Watson Self","doi":"10.1080/00031305.2019.1595144","DOIUrl":"10.1080/00031305.2019.1595144","url":null,"abstract":"<p><p>Gaussian Markov random fields (GMRFs) are popular for modeling dependence in large areal datasets due to their ease of interpretation and computational convenience afforded by the sparse precision matrices needed for random variable generation. Typically in Bayesian computation, GMRFs are updated jointly in a block Gibbs sampler or componentwise in a single-site sampler via the full conditional distributions. The former approach can speed convergence by updating correlated variables all at once, while the latter avoids solving large matrices. We consider a sampling approach in which the underlying graph can be cut so that conditionally independent sites are updated simultaneously. This algorithm allows a practitioner to parallelize updates of subsets of locations or to take advantage of 'vectorized' calculations in a high-level language such as R. Through both simulated and real data, we demonstrate computational savings that can be achieved versus both single-site and block updating, regardless of whether the data are on a regular or an irregular lattice. The approach provides a good compromise between statistical and computational efficiency and is accessible to statisticians without expertise in numerical analysis or advanced computing.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"75 1","pages":"52-65"},"PeriodicalIF":1.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954130/pdf/nihms-1547742.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25485801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of Bayesian Perspectives on Sample Size Derivation for Confirmatory Trials. 贝叶斯观点对确证试验样本大小推导的回顾。
IF 1.8 4区 数学
American Statistician Pub Date : 2021-01-01 Epub Date: 2021-04-22 DOI: 10.1080/00031305.2021.1901782
Kevin Kunzmann, Michael J Grayling, Kim May Lee, David S Robertson, Kaspar Rufibach, James M S Wason
{"title":"A Review of Bayesian Perspectives on Sample Size Derivation for Confirmatory Trials.","authors":"Kevin Kunzmann, Michael J Grayling, Kim May Lee, David S Robertson, Kaspar Rufibach, James M S Wason","doi":"10.1080/00031305.2021.1901782","DOIUrl":"10.1080/00031305.2021.1901782","url":null,"abstract":"<p><p>Sample size derivation is a crucial element of planning any confirmatory trial. The required sample size is typically derived based on constraints on the maximal acceptable Type I error rate and minimal desired power. Power depends on the unknown true effect and tends to be calculated either for the smallest relevant effect or a likely point alternative. The former might be problematic if the minimal relevant effect is close to the null, thus requiring an excessively large sample size, while the latter is dubious since it does not account for the a priori uncertainty about the likely alternative effect. A Bayesian perspective on sample size derivation for a frequentist trial can reconcile arguments about the relative a priori plausibility of alternative effects with ideas based on the relevance of effect sizes. Many suggestions as to how such \"hybrid\" approaches could be implemented in practice have been put forward. However, key quantities are often defined in subtly different ways in the literature. Starting from the traditional entirely frequentist approach to sample size derivation, we derive consistent definitions for the most commonly used hybrid quantities and highlight connections, before discussing and demonstrating their use in sample size derivation for clinical trials.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":" ","pages":"424-432"},"PeriodicalIF":1.8,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612172/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39652616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Causal Inferences for Personalized Medicine: How Hidden Causal Assumptions Led to Erroneous Causal Claims About the D-Value. 论个体化医疗的因果推论:隐藏的因果假设如何导致关于d值的错误因果主张。
IF 1.8 4区 数学
American Statistician Pub Date : 2020-01-01 Epub Date: 2019-05-20 DOI: 10.1080/00031305.2019.1575771
Sander Greenland, Michael P Fay, Erica H Brittain, Joanna H Shih, Dean A Follmann, Erin E Gabriel, James M Robins
{"title":"On Causal Inferences for Personalized Medicine: How Hidden Causal Assumptions Led to Erroneous Causal Claims About the D-Value.","authors":"Sander Greenland,&nbsp;Michael P Fay,&nbsp;Erica H Brittain,&nbsp;Joanna H Shih,&nbsp;Dean A Follmann,&nbsp;Erin E Gabriel,&nbsp;James M Robins","doi":"10.1080/00031305.2019.1575771","DOIUrl":"https://doi.org/10.1080/00031305.2019.1575771","url":null,"abstract":"<p><p>Personalized medicine asks if a new treatment will help a particular patient, rather than if it improves the average response in a population. Without a causal model to distinguish these questions, interpretational mistakes arise. These mistakes are seen in an article by Demidenko [2016] that recommends the \"D-value,\" which is the probability that a randomly chosen person from the new-treatment group has a higher value for the outcome than a randomly chosen person from the control-treatment group. The abstract states \"The D-value has a clear interpretation as the proportion of patients who get worse after the treatment\" with similar assertions appearing later. We show these statements are incorrect because they require assumptions about the potential outcomes which are neither testable in randomized experiments nor plausible in general. The D-value will <i>not</i> equal the proportion of patients who get worse after treatment if (as expected) those outcomes are correlated. Independence of potential outcomes is unrealistic and eliminates <i>any</i> personalized treatment effects; with dependence, the D-value can even imply treatment is better than control <i>even though most patients are harmed by the treatment</i>. Thus, D-values are misleading for personalized medicine. To prevent misunderstandings, we advise incorporating causal models into basic statistics education.</p>","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"74 3","pages":"243-248"},"PeriodicalIF":1.8,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/00031305.2019.1575771","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38853141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信