{"title":"The Variance of the Number of Effects in an Epidemiological Cohort - The Role of Dose Uncertainty","authors":"G. Miller","doi":"10.2174/1874297100801010048","DOIUrl":null,"url":null,"abstract":"Two basic formulas, for the mean and variance of the number of effects in an epidemiological cohort, are de- rived. The formula for variance shows \"extra-binomial variation\" or \"overdispersion\" when there is correlated uncertainty of the probability of an effect. The formulas were validated by a numerical Monte Carlo study. The method of including \"epistemic\" uncertainty discussed by Hofer (E. Hofer, Health Physics, 2007) is generalized to include separately uncer- tainty from a Bayesian posterior distribution when the prior is known, and uncertainty of the prior. In this note, two basic formulas, for the mean and vari- ance of the number of effects in an epidemiological cohort, are derived. It is conventionally assumed that the number of effects has either a Poisson or binomial distribution (1). The formula for the variance given here reduces to the binomial result in the case of no correlations of the probability of an effect, but shows \"extra-binomial variation\" or \"overdisper- sion\" when there are correlations. In practice, the variance could be calculated using the method advocated by Hofer (2), where, for each individual in an epidemiological cohort, some number j = 1…M of al- ternate realizations of the dose and hence the probability of an effect, taking into account possible correlations, are gen- erated using Monte Carlo. This method is generalized here to include separately uncertainty from a Bayesian posterior distribution when the prior is known, and uncertainties caused by lack of knowledge of the prior. This is because, within a linear dose-effect response model, the average num- ber of effects is proportional to the posterior-average- collec- tive dose, and the important uncertainty is that of the poste- rior-average-collective dose caused by lack of knowledge of the prior.","PeriodicalId":87834,"journal":{"name":"The open epidemiology journal","volume":"1 1","pages":"48-52"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open epidemiology journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874297100801010048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Two basic formulas, for the mean and variance of the number of effects in an epidemiological cohort, are de- rived. The formula for variance shows "extra-binomial variation" or "overdispersion" when there is correlated uncertainty of the probability of an effect. The formulas were validated by a numerical Monte Carlo study. The method of including "epistemic" uncertainty discussed by Hofer (E. Hofer, Health Physics, 2007) is generalized to include separately uncer- tainty from a Bayesian posterior distribution when the prior is known, and uncertainty of the prior. In this note, two basic formulas, for the mean and vari- ance of the number of effects in an epidemiological cohort, are derived. It is conventionally assumed that the number of effects has either a Poisson or binomial distribution (1). The formula for the variance given here reduces to the binomial result in the case of no correlations of the probability of an effect, but shows "extra-binomial variation" or "overdisper- sion" when there are correlations. In practice, the variance could be calculated using the method advocated by Hofer (2), where, for each individual in an epidemiological cohort, some number j = 1…M of al- ternate realizations of the dose and hence the probability of an effect, taking into account possible correlations, are gen- erated using Monte Carlo. This method is generalized here to include separately uncertainty from a Bayesian posterior distribution when the prior is known, and uncertainties caused by lack of knowledge of the prior. This is because, within a linear dose-effect response model, the average num- ber of effects is proportional to the posterior-average- collec- tive dose, and the important uncertainty is that of the poste- rior-average-collective dose caused by lack of knowledge of the prior.
推导了流行病学队列中效应数的均值和方差的两个基本公式。当影响的概率存在相关的不确定性时,方差公式显示出“额外二项变异”或“过度分散”。通过蒙特卡罗数值模拟验证了公式的正确性。Hofer (E. Hofer, Health Physics, 2007)讨论的包含“认知”不确定性的方法被推广为分别包括先验已知时贝叶斯后验分布的不确定性和先验的不确定性。在本文中,推导了流行病学队列中效应数的均值和方差的两个基本公式。通常假设效应的数量具有泊松分布或二项分布(1)。在效应的概率没有相关性的情况下,这里给出的方差公式简化为二项结果,但在存在相关性时显示“超二项变化”或“过分散”。在实践中,方差可以使用Hofer(2)提倡的方法来计算,其中,对于流行病学队列中的每个个体,剂量的交替实现的一些数字j = 1…M,因此考虑到可能的相关性,使用蒙特卡罗生成效应的概率。这种方法在这里进行了推广,包括先验已知时贝叶斯后验分布的不确定性,以及由于缺乏先验知识而引起的不确定性。这是因为,在线性剂量效应响应模型中,效应的平均数目与后平均集体剂量成正比,而重要的不确定性是由于缺乏对前平均集体剂量的了解而导致的后平均集体剂量的不确定性。