均值的连续分层推断

Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark
{"title":"均值的连续分层推断","authors":"Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark","doi":"arxiv-2409.06680","DOIUrl":null,"url":null,"abstract":"We develop conservative tests for the mean of a bounded population using data\nfrom a stratified sample. The sample may be drawn sequentially, with or without\nreplacement. The tests are \"anytime valid,\" allowing optional stopping and\ncontinuation in each stratum. We call this combination of properties\nsequential, finite-sample, nonparametric validity. The methods express a\nhypothesis about the population mean as a union of intersection hypotheses\ndescribing within-stratum means. They test each intersection hypothesis using\nindependent test supermartingales (TSMs) combined across strata by\nmultiplication. The $P$-value of the global null hypothesis is then the maximum\n$P$-value of any intersection hypothesis in the union. This approach has three\nprimary moving parts: (i) the rule for deciding which stratum to draw from next\nto test each intersection null, given the sample so far; (ii) the form of the\nTSM for each null in each stratum; and (iii) the method of combining evidence\nacross strata. These choices interact. We examine the performance of a variety\nof rules with differing computational complexity. Approximately optimal methods\nhave a prohibitive computational cost, while naive rules may be inconsistent --\nthey will never reject for some alternative populations, no matter how large\nthe sample. We present a method that is statistically comparable to optimal\nmethods in examples where optimal methods are computable, but computationally\ntractable for arbitrarily many strata. In numerical examples its expected\nsample size is substantially smaller than that of previous methods.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"42 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sequential stratified inference for the mean\",\"authors\":\"Jacob V. Spertus, Mayuri Sridhar, Philip B. Stark\",\"doi\":\"arxiv-2409.06680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We develop conservative tests for the mean of a bounded population using data\\nfrom a stratified sample. The sample may be drawn sequentially, with or without\\nreplacement. The tests are \\\"anytime valid,\\\" allowing optional stopping and\\ncontinuation in each stratum. We call this combination of properties\\nsequential, finite-sample, nonparametric validity. The methods express a\\nhypothesis about the population mean as a union of intersection hypotheses\\ndescribing within-stratum means. They test each intersection hypothesis using\\nindependent test supermartingales (TSMs) combined across strata by\\nmultiplication. The $P$-value of the global null hypothesis is then the maximum\\n$P$-value of any intersection hypothesis in the union. This approach has three\\nprimary moving parts: (i) the rule for deciding which stratum to draw from next\\nto test each intersection null, given the sample so far; (ii) the form of the\\nTSM for each null in each stratum; and (iii) the method of combining evidence\\nacross strata. These choices interact. We examine the performance of a variety\\nof rules with differing computational complexity. Approximately optimal methods\\nhave a prohibitive computational cost, while naive rules may be inconsistent --\\nthey will never reject for some alternative populations, no matter how large\\nthe sample. We present a method that is statistically comparable to optimal\\nmethods in examples where optimal methods are computable, but computationally\\ntractable for arbitrarily many strata. In numerical examples its expected\\nsample size is substantially smaller than that of previous methods.\",\"PeriodicalId\":501425,\"journal\":{\"name\":\"arXiv - STAT - Methodology\",\"volume\":\"42 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Methodology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们利用分层抽样的数据,对有界群体的均值进行保守检验。样本可以连续抽取,也可以不替换。检验是 "随时有效 "的,允许在每个分层中选择停止和继续。我们把这种特性组合称为序列、有限样本、非参数有效性。这些方法将关于总体均值的假设表达为描述层内均值的交叉假设的结合。它们使用通过乘法跨层组合的独立检验超马尔廷公式(TSM)来检验每个交集假设。然后,全局零假设的 P$ 值就是联盟中任何交叉假设的最大 P$ 值。这种方法有三个主要的活动部分:(i) 根据迄今为止的样本情况,决定下一步从哪个层抽取样本来检验每个交叉零假设的规则;(ii) 每个层中每个零假设的全局矩阵形式;(iii) 组合跨层证据的方法。这些选择是相互影响的。我们研究了计算复杂度不同的各种规则的性能。近似最优方法的计算成本过高,而天真规则则可能不一致--无论样本量有多大,它们都不会拒绝某些替代人群。在最优方法可计算的例子中,我们提出了一种在统计上可与最优方法相媲美的方法,但对于任意多的分层来说,这种方法在计算上非常困难。在数值例子中,它的预期样本量大大小于以前的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sequential stratified inference for the mean
We develop conservative tests for the mean of a bounded population using data from a stratified sample. The sample may be drawn sequentially, with or without replacement. The tests are "anytime valid," allowing optional stopping and continuation in each stratum. We call this combination of properties sequential, finite-sample, nonparametric validity. The methods express a hypothesis about the population mean as a union of intersection hypotheses describing within-stratum means. They test each intersection hypothesis using independent test supermartingales (TSMs) combined across strata by multiplication. The $P$-value of the global null hypothesis is then the maximum $P$-value of any intersection hypothesis in the union. This approach has three primary moving parts: (i) the rule for deciding which stratum to draw from next to test each intersection null, given the sample so far; (ii) the form of the TSM for each null in each stratum; and (iii) the method of combining evidence across strata. These choices interact. We examine the performance of a variety of rules with differing computational complexity. Approximately optimal methods have a prohibitive computational cost, while naive rules may be inconsistent -- they will never reject for some alternative populations, no matter how large the sample. We present a method that is statistically comparable to optimal methods in examples where optimal methods are computable, but computationally tractable for arbitrarily many strata. In numerical examples its expected sample size is substantially smaller than that of previous methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信