{"title":"席林对评论的回应","authors":"J. Bartroff, G. Lorden, Lijia Wang","doi":"10.1080/00031305.2023.2205455","DOIUrl":null,"url":null,"abstract":"We appreciate the recent paper of Schilling and Stanley (2022, hereafter SS) on confidence intervals for the hypergeometric being brought to our attention, which we were not aware of while preparing our paper (Bartroff, Lorden, and Wang 2022, hereafter BLW) on that subject. Although there are commonalities between the two approaches, there are some important distinctions that we highlight here. Following those papers’ notations, below we denote the confidence intervals for the hypergeometric success parameter based on sample size n and population size N by LCO for SS, and C∗ for BLW. In the numerical examples below, LCO (github.com/mfschilling/ HGCIs) and C∗ (github.com/bartroff792/hyper) were computed using the respective authors’ publicly available R code, running on the same computer. Computational time. LCO and C∗ differ drastically in the amount of time required to compute them. Figure 1 shows the computational time of LCO and C∗ for α = 0.05, N = 200, 400, . . . , 1000, and n = N/2. For example, for N = 1000 the computational time of LCO exceeds 100 min whereas C∗ requires roughly 1/10th of a second (0.002 min). In further numerical comparisons not included here, we found this relationship to be common for moderate to large values of the sample and population sizes, n and N. This may be due to the algorithm for computing LCO which calls for searching among all acceptance functions of minimal span (SS, p. 37). Provable optimality. SS contains two proofs, one in the Appendix of a basic result about the hypergeometric parameters, and one in the main text of the paper’s only theorem (SS, p. 33) which is a general result that size-optimal hypergeometric acceptance sets are inverted to yield size-optimal confidence “intervals.” However, not all inverted acceptance sets will yield proper intervals, and in practice one often ends up with noninterval confidence sets, for example, intervals with “gaps.” This occurs when the endpoint sequences of the acceptance intervals being inverted are non-monotonic, or themselves have gaps. SS address this by modifying their proposal in this situation to mimic a method of Schilling and Doi (2014) developed for the Binomial distribution. SS (pp. 36–37) write, Where there is a need to resolve a gap, in which case the minimal span acceptance function that causes the gap is replaced with the one having the","PeriodicalId":342642,"journal":{"name":"The American Statistician","volume":"188 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Response to Comment by Schilling\",\"authors\":\"J. Bartroff, G. Lorden, Lijia Wang\",\"doi\":\"10.1080/00031305.2023.2205455\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We appreciate the recent paper of Schilling and Stanley (2022, hereafter SS) on confidence intervals for the hypergeometric being brought to our attention, which we were not aware of while preparing our paper (Bartroff, Lorden, and Wang 2022, hereafter BLW) on that subject. Although there are commonalities between the two approaches, there are some important distinctions that we highlight here. Following those papers’ notations, below we denote the confidence intervals for the hypergeometric success parameter based on sample size n and population size N by LCO for SS, and C∗ for BLW. In the numerical examples below, LCO (github.com/mfschilling/ HGCIs) and C∗ (github.com/bartroff792/hyper) were computed using the respective authors’ publicly available R code, running on the same computer. Computational time. LCO and C∗ differ drastically in the amount of time required to compute them. Figure 1 shows the computational time of LCO and C∗ for α = 0.05, N = 200, 400, . . . , 1000, and n = N/2. For example, for N = 1000 the computational time of LCO exceeds 100 min whereas C∗ requires roughly 1/10th of a second (0.002 min). In further numerical comparisons not included here, we found this relationship to be common for moderate to large values of the sample and population sizes, n and N. This may be due to the algorithm for computing LCO which calls for searching among all acceptance functions of minimal span (SS, p. 37). Provable optimality. SS contains two proofs, one in the Appendix of a basic result about the hypergeometric parameters, and one in the main text of the paper’s only theorem (SS, p. 33) which is a general result that size-optimal hypergeometric acceptance sets are inverted to yield size-optimal confidence “intervals.” However, not all inverted acceptance sets will yield proper intervals, and in practice one often ends up with noninterval confidence sets, for example, intervals with “gaps.” This occurs when the endpoint sequences of the acceptance intervals being inverted are non-monotonic, or themselves have gaps. SS address this by modifying their proposal in this situation to mimic a method of Schilling and Doi (2014) developed for the Binomial distribution. SS (pp. 36–37) write, Where there is a need to resolve a gap, in which case the minimal span acceptance function that causes the gap is replaced with the one having the\",\"PeriodicalId\":342642,\"journal\":{\"name\":\"The American Statistician\",\"volume\":\"188 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The American Statistician\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/00031305.2023.2205455\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The American Statistician","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/00031305.2023.2205455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
我们感谢最近Schilling和Stanley(2022,下文简称SS)关于超几何置信区间的论文引起了我们的注意,这是我们在准备关于该主题的论文(Bartroff, Lorden, and Wang 2022,下文简称BLW)时没有意识到的。尽管这两种方法之间存在共性,但我们在这里强调一些重要的区别。根据这些论文的注释,下面我们用LCO表示基于样本大小n和总体大小n的超几何成功参数的置信区间,LCO表示SS, C *表示BLW。在下面的数值示例中,LCO (github.com/mfschilling/ HGCIs)和C * (github.com/bartroff792/hyper)是使用各自作者在同一台计算机上运行的公开可用的R代码计算的。计算时间。LCO和C *在计算它们所需的时间上差别很大。图1显示了当α = 0.05, N = 200,400,…时LCO和C *的计算时间。, 1000, n = n /2。例如,当N = 1000时,LCO的计算时间超过100分钟,而C *大约需要1/10秒(0.002分钟)。在本文未包括的进一步数值比较中,我们发现这种关系对于样本和总体大小(n和n)的中大值是常见的。这可能是由于计算LCO的算法要求在最小跨度的所有可接受函数中进行搜索(SS,第37页)。可证明的最优。SS包含两个证明,一个在关于超几何参数的一个基本结果的附录中,另一个在本文唯一定理(SS,第33页)的正文中,该定理是大小最优的超几何可接受集被反转以产生大小最优的置信“区间”的一般结果。然而,并不是所有的反向接受集都会产生合适的区间,在实践中,人们经常会得到非区间置信集,例如,具有“间隙”的区间。当被反转的接受区间的端点序列是非单调的,或者其本身有间隙时,就会发生这种情况。SS通过修改他们在这种情况下的建议来解决这个问题,以模仿Schilling和Doi(2014)为二项分布开发的方法。SS(第36-37页)写道,当需要解决缺口时,在这种情况下,导致缺口的最小跨度接受函数被具有
We appreciate the recent paper of Schilling and Stanley (2022, hereafter SS) on confidence intervals for the hypergeometric being brought to our attention, which we were not aware of while preparing our paper (Bartroff, Lorden, and Wang 2022, hereafter BLW) on that subject. Although there are commonalities between the two approaches, there are some important distinctions that we highlight here. Following those papers’ notations, below we denote the confidence intervals for the hypergeometric success parameter based on sample size n and population size N by LCO for SS, and C∗ for BLW. In the numerical examples below, LCO (github.com/mfschilling/ HGCIs) and C∗ (github.com/bartroff792/hyper) were computed using the respective authors’ publicly available R code, running on the same computer. Computational time. LCO and C∗ differ drastically in the amount of time required to compute them. Figure 1 shows the computational time of LCO and C∗ for α = 0.05, N = 200, 400, . . . , 1000, and n = N/2. For example, for N = 1000 the computational time of LCO exceeds 100 min whereas C∗ requires roughly 1/10th of a second (0.002 min). In further numerical comparisons not included here, we found this relationship to be common for moderate to large values of the sample and population sizes, n and N. This may be due to the algorithm for computing LCO which calls for searching among all acceptance functions of minimal span (SS, p. 37). Provable optimality. SS contains two proofs, one in the Appendix of a basic result about the hypergeometric parameters, and one in the main text of the paper’s only theorem (SS, p. 33) which is a general result that size-optimal hypergeometric acceptance sets are inverted to yield size-optimal confidence “intervals.” However, not all inverted acceptance sets will yield proper intervals, and in practice one often ends up with noninterval confidence sets, for example, intervals with “gaps.” This occurs when the endpoint sequences of the acceptance intervals being inverted are non-monotonic, or themselves have gaps. SS address this by modifying their proposal in this situation to mimic a method of Schilling and Doi (2014) developed for the Binomial distribution. SS (pp. 36–37) write, Where there is a need to resolve a gap, in which case the minimal span acceptance function that causes the gap is replaced with the one having the