Divergence versus decision P‐values: A distinction worth making in theory and keeping in practice: Or, how divergence P‐values measure evidence even when decision P‐values do not

IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY
S. Greenland
{"title":"Divergence versus decision P‐values: A distinction worth making in theory and keeping in practice: Or, how divergence P‐values measure evidence even when decision P‐values do not","authors":"S. Greenland","doi":"10.1111/sjos.12625","DOIUrl":null,"url":null,"abstract":"There are two distinct definitions of “P‐value” for evaluating a proposed hypothesis or model for the process generating an observed dataset. The original definition starts with a measure of the divergence of the dataset from what was expected under the model, such as a sum of squares or a deviance statistic. A P‐value is then the ordinal location of the measure in a reference distribution computed from the model and the data, and is treated as a unit‐scaled index of compatibility between the data and the model. In the other definition, a P‐value is a random variable on the unit interval whose realizations can be compared to a cutoff α to generate a decision rule with known error rates under the model and specific alternatives. It is commonly assumed that realizations of such decision P‐values always correspond to divergence P‐values. But this need not be so: Decision P‐values can violate intuitive single‐sample coherence criteria where divergence P‐values do not. It is thus argued that divergence and decision P‐values should be carefully distinguished in teaching, and that divergence P‐values are the relevant choice when the analysis goal is to summarize evidence rather than implement a decision rule.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":"50 1","pages":"54 - 88"},"PeriodicalIF":0.8000,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1111/sjos.12625","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 6

Abstract

There are two distinct definitions of “P‐value” for evaluating a proposed hypothesis or model for the process generating an observed dataset. The original definition starts with a measure of the divergence of the dataset from what was expected under the model, such as a sum of squares or a deviance statistic. A P‐value is then the ordinal location of the measure in a reference distribution computed from the model and the data, and is treated as a unit‐scaled index of compatibility between the data and the model. In the other definition, a P‐value is a random variable on the unit interval whose realizations can be compared to a cutoff α to generate a decision rule with known error rates under the model and specific alternatives. It is commonly assumed that realizations of such decision P‐values always correspond to divergence P‐values. But this need not be so: Decision P‐values can violate intuitive single‐sample coherence criteria where divergence P‐values do not. It is thus argued that divergence and decision P‐values should be carefully distinguished in teaching, and that divergence P‐values are the relevant choice when the analysis goal is to summarize evidence rather than implement a decision rule.
分歧与决策P值:一个值得在理论和实践中做出的区分;或者,分歧P值如何衡量证据,即使决策P值没有
“P值”有两种不同的定义,用于评估生成观测数据集的过程中提出的假设或模型。最初的定义从数据集偏离模型预期的程度开始,比如平方和或偏差统计量。然后,P值是根据模型和数据计算的参考分布中度量的顺序位置,并被视为数据和模型之间的单位比例兼容性指数。在另一种定义中,P值是单位区间上的随机变量,其实现可以与截断α相比较,以生成在模型和特定替代方案下具有已知错误率的决策规则。通常假设这种决策P值的实现总是对应于发散P值。但事实并非如此:决策P值可能违反直观的单样本相干性准则,而散度P值则不会。因此,在教学中应该仔细区分发散P值和决策P值,当分析目标是总结证据而不是执行决策规则时,发散P值是相关的选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Scandinavian Journal of Statistics
Scandinavian Journal of Statistics 数学-统计学与概率论
CiteScore
1.80
自引率
0.00%
发文量
61
审稿时长
6-12 weeks
期刊介绍: The Scandinavian Journal of Statistics is internationally recognised as one of the leading statistical journals in the world. It was founded in 1974 by four Scandinavian statistical societies. Today more than eighty per cent of the manuscripts are submitted from outside Scandinavia. It is an international journal devoted to reporting significant and innovative original contributions to statistical methodology, both theory and applications. The journal specializes in statistical modelling showing particular appreciation of the underlying substantive research problems. The emergence of specialized methods for analysing longitudinal and spatial data is just one example of an area of important methodological development in which the Scandinavian Journal of Statistics has a particular niche.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信