{"title":"基于部分观测到的二进制数据的两个比例之差的精确置信区间","authors":"Chongxiu Yu, Weizhen Wang, Zhongzhan Zhang","doi":"10.1002/sta4.631","DOIUrl":null,"url":null,"abstract":"In a matched pairs experiment, two binary variables are typically observed on all subjects in the experiment. However, when one of the variables is missing on some subjects, we have so called the partially observed binary data that consist of two parts: a multinomial from the subjects with a pair of observed variables and two independent binomials from the subjects with only one observed variable. The goal of this paper is to construct exact confidence intervals for the difference of two (success) proportions of the two binary variables. We first derive a new test by combining two score tests for the two parts of the data and invert it to an asymptotic confidence interval. Since asymptotic intervals do not achieve the nominal level, this interval and three other existing intervals are improved to be exact by the general <math altimg=\"urn:x-wiley:sta4:media:sta4631:sta4631-math-0001\" display=\"inline\" location=\"graphic/sta4631-math-0001.png\">\n<semantics>\n<mrow>\n<mi>h</mi>\n</mrow>\n$$ h $$</annotation>\n</semantics></math>-function method. We compare the infimum coverage probability and average interval length of these intervals and recommend the exact intervals that are improved from the newly proposed interval. Two real data sets are used to illustrate the intervals.","PeriodicalId":56159,"journal":{"name":"Stat","volume":"10 9","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exact confidence intervals for the difference of two proportions based on partially observed binary data\",\"authors\":\"Chongxiu Yu, Weizhen Wang, Zhongzhan Zhang\",\"doi\":\"10.1002/sta4.631\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In a matched pairs experiment, two binary variables are typically observed on all subjects in the experiment. However, when one of the variables is missing on some subjects, we have so called the partially observed binary data that consist of two parts: a multinomial from the subjects with a pair of observed variables and two independent binomials from the subjects with only one observed variable. The goal of this paper is to construct exact confidence intervals for the difference of two (success) proportions of the two binary variables. We first derive a new test by combining two score tests for the two parts of the data and invert it to an asymptotic confidence interval. Since asymptotic intervals do not achieve the nominal level, this interval and three other existing intervals are improved to be exact by the general <math altimg=\\\"urn:x-wiley:sta4:media:sta4631:sta4631-math-0001\\\" display=\\\"inline\\\" location=\\\"graphic/sta4631-math-0001.png\\\">\\n<semantics>\\n<mrow>\\n<mi>h</mi>\\n</mrow>\\n$$ h $$</annotation>\\n</semantics></math>-function method. We compare the infimum coverage probability and average interval length of these intervals and recommend the exact intervals that are improved from the newly proposed interval. Two real data sets are used to illustrate the intervals.\",\"PeriodicalId\":56159,\"journal\":{\"name\":\"Stat\",\"volume\":\"10 9\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stat\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1002/sta4.631\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stat","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.631","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
摘要
在配对实验中,通常在所有实验对象上观察到两个二元变量。然而,当某些受试者缺少其中一个变量时,我们称之为部分观察到的二元数据,它由两部分组成:一个来自受试者的多项数据,其中包含一对观察到的变量;两个来自受试者的独立二项式数据,仅包含一个观察到的变量。本文的目的是为两个二元变量的两个(成功)比例之差构造精确的置信区间。我们首先通过结合数据的两个部分的两个分数检验来推导一个新的检验,并将其反演为一个渐近置信区间。由于渐近区间没有达到标称水平,因此该区间和其他三个现有区间通过一般h $$ h $$ -函数方法进行改进以达到精确。我们比较了这些区间的最小覆盖概率和平均区间长度,并推荐了从新提出的区间改进的精确区间。使用两个真实数据集来说明区间。
Exact confidence intervals for the difference of two proportions based on partially observed binary data
In a matched pairs experiment, two binary variables are typically observed on all subjects in the experiment. However, when one of the variables is missing on some subjects, we have so called the partially observed binary data that consist of two parts: a multinomial from the subjects with a pair of observed variables and two independent binomials from the subjects with only one observed variable. The goal of this paper is to construct exact confidence intervals for the difference of two (success) proportions of the two binary variables. We first derive a new test by combining two score tests for the two parts of the data and invert it to an asymptotic confidence interval. Since asymptotic intervals do not achieve the nominal level, this interval and three other existing intervals are improved to be exact by the general -function method. We compare the infimum coverage probability and average interval length of these intervals and recommend the exact intervals that are improved from the newly proposed interval. Two real data sets are used to illustrate the intervals.
StatDecision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.10
自引率
0.00%
发文量
85
期刊介绍:
Stat is an innovative electronic journal for the rapid publication of novel and topical research results, publishing compact articles of the highest quality in all areas of statistical endeavour. Its purpose is to provide a means of rapid sharing of important new theoretical, methodological and applied research. Stat is a joint venture between the International Statistical Institute and Wiley-Blackwell.
Stat is characterised by:
• Speed - a high-quality review process that aims to reach a decision within 20 days of submission.
• Concision - a maximum article length of 10 pages of text, not including references.
• Supporting materials - inclusion of electronic supporting materials including graphs, video, software, data and images.
• Scope - addresses all areas of statistics and interdisciplinary areas.
Stat is a scientific journal for the international community of statisticians and researchers and practitioners in allied quantitative disciplines.