作者对Mishra等人在2021年6月11日皇家统计学会关于COVID-19传播的专题会议第二届会议上讨论的“英国地方当局的COVID-19模型”的答复

IF 1.5 3区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of the Royal Statistical Society Series A-Statistics in Society Pub Date : 2022-12-08 DOI:10.1111/rssa.12977

Swapnil Mishra, James A. Scott, Daniel J. Laydon, Harrison Zhu, Neil M. Ferguson, Samir Bhatt, Seth Flaxman, Axel Gandy

{"title":"作者对Mishra等人在2021年6月11日皇家统计学会关于COVID-19传播的专题会议第二届会议上讨论的“英国地方当局的COVID-19模型”的答复","authors":"Swapnil Mishra, James A. Scott, Daniel J. Laydon, Harrison Zhu, Neil M. Ferguson, Samir Bhatt, Seth Flaxman, Axel Gandy","doi":"10.1111/rssa.12977","DOIUrl":null,"url":null,"abstract":"We are very grateful for the interesting and constructive comments about the papers discussed in this meeting. We will address the points pertaining to our paper.Professor Gibson and Professor Nason stress that our model does not have NPIs or other information as covariates. We agree that ideally, one would like to do this. However, this conflicts with another goal of our model—to provide estimates on a very regular basis. Given the rapid decision-making and implementation of new measures, that varied substantially across the United Kingdom, often without exact precedent, it would have meant frequent adjustment of the model and collection and verification of, for example, NPIs for almost 400 areas on a daily basis, making it almost an implausible task without substantial time-commitment.Professor Gibson also remarks that our paper does not show prior-predictive checks to validate the model—we have omitted more detailed model checks due to space constraints. Our R package Epidemia has a full suite to do model checks and we have used them to verify and tune our models. We do agree with Professor Gibson that individual-based models can be more useful than aggregate models. However, constraints around data availability and compute time makes running individual models on daily basis an unattainable task. Our main objective behind the framework was to have something that can be updated on an almost daily basis, so we opted for simplicity.Professor Nason raises the point that there are instances of our estimates of <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mi>R</mi>\n </mrow>\n <mrow>\n <mi>t</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {R}_t $$</annotation>\n </semantics></math> projections are not overlapping with estimates of <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mi>R</mi>\n </mrow>\n <mrow>\n <mi>t</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {R}_t $$</annotation>\n </semantics></math> by Teh et al. in the same time period and how health officials should react to this. As Professor Nason, points out, it is not surprising that with models with different assumptions, sometimes conflicting estimates arise. This may actually be an advantage. If it is well understood how models differ this gives a more varied understanding of the epidemic in these places—for example, one model which relies more on spatial correlation would assume that a local outbreak will start spreading whereas another model with less spatial correlation, such as ours, might not predict this. Model uncertainty can be evident from results from different models.As Professor Nason points out, we could indeed have chosen other models for the reproduction number than a weekly random walk. We experimented with some of these, for example splines, but in the end, we settled on the random walk, as it proved to be more stable than other approaches, and as it does not project any trends in changes in <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mi>R</mi>\n </mrow>\n <mrow>\n <mi>t</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {R}_t $$</annotation>\n </semantics></math> forward in time, which could have lead to unrealistic values of <math>\n <semantics>\n <mrow>\n <msub>\n <mrow>\n <mi>R</mi>\n </mrow>\n <mrow>\n <mi>t</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$$ {R}_t $$</annotation>\n </semantics></math>, particularly around change points of the epidemic.Dr Jewell raises issues around not having spatial relationships encoded among local areas and using a fixed generation distribution throughout the period. We agree that an ideal model should account for spatial dependence in a realistic way. By not incorporating it we gained at least two advantages. First, easier computational tractability, which allowed us to update the model more frequently and thus produce more current updates. Secondly, and maybe more importantly, it allowed a local ‘accountability’—projections were based mainly on developments within that region, and thus the estimates cannot simply be discounted in one area by saying it only depends on the developments in neighbouring regions. During this epidemic, there were instances of local outbreaks or local reductions that did not, or not quickly, affect neighbouring areas. We experimented with mobility data ourselves, but we chose not to use it, as, in our experience, it tended to large swings in the estimates that were not necessarily reflected in the case data later on, and as mobility data has been, at least when we developed the model, only been available with substantial delays. As far as the use of predetermined generation distribution is concerned, we agree a model that can account for changing generation distribution would be ideal. However, this would again require much more computational expensive frameworks, limiting their use for daily updates. Additionally, identifiability of the parameters would be a major concern—for which we would at least require detailed incidence data not just raw daily case counts.Overall, we believe that our model has been a useful tool to several decision-makers during the epidemic. To be prepared for possible future epidemics or a future pandemic, we agree with the discussants that it would be beneficial to develop a detailed and adaptable models that would be able to follow the rapidly changing data availability throughout an epidemic and to address the rapidly changing questions that arise (e.g. effect of vaccinations, age-dependence, variants, <math>\n <semantics>\n <mrow>\n <mi>…</mi>\n <mo>)</mo>\n </mrow>\n <annotation>$$ \\dots \\Big) $$</annotation>\n </semantics></math>. These models will most likely require advances in computational methods to be helpful for the rapid decisions needed during an epidemic.","PeriodicalId":49983,"journal":{"name":"Journal of the Royal Statistical Society Series A-Statistics in Society","volume":"185 S1","pages":"S110-S111"},"PeriodicalIF":1.5000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12977","citationCount":"0","resultStr":"{\"title\":\"Authors' reply to the discussion of ‘A COVID-19 Model for Local Authorities of the United Kingdom’ by Mishra et al. in Session 2 of the Royal Statistical Society's Special Topic Meeting on COVID-19 transmission: 11 June 2021\",\"authors\":\"Swapnil Mishra, James A. Scott, Daniel J. Laydon, Harrison Zhu, Neil M. Ferguson, Samir Bhatt, Seth Flaxman, Axel Gandy\",\"doi\":\"10.1111/rssa.12977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We are very grateful for the interesting and constructive comments about the papers discussed in this meeting. We will address the points pertaining to our paper.Professor Gibson and Professor Nason stress that our model does not have NPIs or other information as covariates. We agree that ideally, one would like to do this. However, this conflicts with another goal of our model—to provide estimates on a very regular basis. Given the rapid decision-making and implementation of new measures, that varied substantially across the United Kingdom, often without exact precedent, it would have meant frequent adjustment of the model and collection and verification of, for example, NPIs for almost 400 areas on a daily basis, making it almost an implausible task without substantial time-commitment.Professor Gibson also remarks that our paper does not show prior-predictive checks to validate the model—we have omitted more detailed model checks due to space constraints. Our R package Epidemia has a full suite to do model checks and we have used them to verify and tune our models. We do agree with Professor Gibson that individual-based models can be more useful than aggregate models. However, constraints around data availability and compute time makes running individual models on daily basis an unattainable task. Our main objective behind the framework was to have something that can be updated on an almost daily basis, so we opted for simplicity.Professor Nason raises the point that there are instances of our estimates of <math>\\n <semantics>\\n <mrow>\\n <msub>\\n <mrow>\\n <mi>R</mi>\\n </mrow>\\n <mrow>\\n <mi>t</mi>\\n </mrow>\\n </msub>\\n </mrow>\\n <annotation>$$ {R}_t $$</annotation>\\n </semantics></math> projections are not overlapping with estimates of <math>\\n <semantics>\\n <mrow>\\n <msub>\\n <mrow>\\n <mi>R</mi>\\n </mrow>\\n <mrow>\\n <mi>t</mi>\\n </mrow>\\n </msub>\\n </mrow>\\n <annotation>$$ {R}_t $$</annotation>\\n </semantics></math> by Teh et al. in the same time period and how health officials should react to this. As Professor Nason, points out, it is not surprising that with models with different assumptions, sometimes conflicting estimates arise. This may actually be an advantage. If it is well understood how models differ this gives a more varied understanding of the epidemic in these places—for example, one model which relies more on spatial correlation would assume that a local outbreak will start spreading whereas another model with less spatial correlation, such as ours, might not predict this. Model uncertainty can be evident from results from different models.As Professor Nason points out, we could indeed have chosen other models for the reproduction number than a weekly random walk. We experimented with some of these, for example splines, but in the end, we settled on the random walk, as it proved to be more stable than other approaches, and as it does not project any trends in changes in <math>\\n <semantics>\\n <mrow>\\n <msub>\\n <mrow>\\n <mi>R</mi>\\n </mrow>\\n <mrow>\\n <mi>t</mi>\\n </mrow>\\n </msub>\\n </mrow>\\n <annotation>$$ {R}_t $$</annotation>\\n </semantics></math> forward in time, which could have lead to unrealistic values of <math>\\n <semantics>\\n <mrow>\\n <msub>\\n <mrow>\\n <mi>R</mi>\\n </mrow>\\n <mrow>\\n <mi>t</mi>\\n </mrow>\\n </msub>\\n </mrow>\\n <annotation>$$ {R}_t $$</annotation>\\n </semantics></math>, particularly around change points of the epidemic.Dr Jewell raises issues around not having spatial relationships encoded among local areas and using a fixed generation distribution throughout the period. We agree that an ideal model should account for spatial dependence in a realistic way. By not incorporating it we gained at least two advantages. First, easier computational tractability, which allowed us to update the model more frequently and thus produce more current updates. Secondly, and maybe more importantly, it allowed a local ‘accountability’—projections were based mainly on developments within that region, and thus the estimates cannot simply be discounted in one area by saying it only depends on the developments in neighbouring regions. During this epidemic, there were instances of local outbreaks or local reductions that did not, or not quickly, affect neighbouring areas. We experimented with mobility data ourselves, but we chose not to use it, as, in our experience, it tended to large swings in the estimates that were not necessarily reflected in the case data later on, and as mobility data has been, at least when we developed the model, only been available with substantial delays. As far as the use of predetermined generation distribution is concerned, we agree a model that can account for changing generation distribution would be ideal. However, this would again require much more computational expensive frameworks, limiting their use for daily updates. Additionally, identifiability of the parameters would be a major concern—for which we would at least require detailed incidence data not just raw daily case counts.Overall, we believe that our model has been a useful tool to several decision-makers during the epidemic. To be prepared for possible future epidemics or a future pandemic, we agree with the discussants that it would be beneficial to develop a detailed and adaptable models that would be able to follow the rapidly changing data availability throughout an epidemic and to address the rapidly changing questions that arise (e.g. effect of vaccinations, age-dependence, variants, <math>\\n <semantics>\\n <mrow>\\n <mi>…</mi>\\n <mo>)</mo>\\n </mrow>\\n <annotation>$$ \\\\dots \\\\Big) $$</annotation>\\n </semantics></math>. These models will most likely require advances in computational methods to be helpful for the rapid decisions needed during an epidemic.\",\"PeriodicalId\":49983,\"journal\":{\"name\":\"Journal of the Royal Statistical Society Series A-Statistics in Society\",\"volume\":\"185 S1\",\"pages\":\"S110-S111\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2022-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12977\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Royal Statistical Society Series A-Statistics in Society\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12977\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SOCIAL SCIENCES, MATHEMATICAL METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series A-Statistics in Society","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12977","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}

引用次数: 0

摘要

我们自己试验了机动性数据，但我们选择不使用它，因为，根据我们的经验，它倾向于在估计中有很大的波动，而这并不一定反映在后来的案例数据中，而且由于机动性数据，至少当我们开发模型时，只能在很大的延迟下使用。就使用预定的发电分布而言，我们同意一个可以考虑变化的发电分布的模型是理想的。然而，这将再次需要计算成本更高的框架，限制了它们在日常更新中的使用。此外，参数的可识别性将是一个主要问题——为此，我们至少需要详细的发病率数据，而不仅仅是原始的每日病例数。总的来说，我们认为，在疫情期间，我们的模型对一些决策者来说是一个有用的工具。为了为未来可能出现的流行病或未来的大流行做好准备，我们同意讨论者的观点，即开发一种详细和适应性强的模型将是有益的，这种模型将能够在整个流行病期间跟踪快速变化的可用数据，并解决出现的快速变化的问题(例如，疫苗接种的影响、年龄依赖性、变异、…)$$ \dots \Big) $$。这些模型很可能需要在计算方法方面取得进展，以便有助于在流行病期间所需的快速决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Authors' reply to the discussion of ‘A COVID-19 Model for Local Authorities of the United Kingdom’ by Mishra et al. in Session 2 of the Royal Statistical Society's Special Topic Meeting on COVID-19 transmission: 11 June 2021

We are very grateful for the interesting and constructive comments about the papers discussed in this meeting. We will address the points pertaining to our paper.

Professor Gibson and Professor Nason stress that our model does not have NPIs or other information as covariates. We agree that ideally, one would like to do this. However, this conflicts with another goal of our model—to provide estimates on a very regular basis. Given the rapid decision-making and implementation of new measures, that varied substantially across the United Kingdom, often without exact precedent, it would have meant frequent adjustment of the model and collection and verification of, for example, NPIs for almost 400 areas on a daily basis, making it almost an implausible task without substantial time-commitment.

Professor Gibson also remarks that our paper does not show prior-predictive checks to validate the model—we have omitted more detailed model checks due to space constraints. Our R package Epidemia has a full suite to do model checks and we have used them to verify and tune our models. We do agree with Professor Gibson that individual-based models can be more useful than aggregate models. However, constraints around data availability and compute time makes running individual models on daily basis an unattainable task. Our main objective behind the framework was to have something that can be updated on an almost daily basis, so we opted for simplicity.

Professor Nason raises the point that there are instances of our estimates of $R_{t}$ projections are not overlapping with estimates of $R_{t}$ by Teh et al. in the same time period and how health officials should react to this. As Professor Nason, points out, it is not surprising that with models with different assumptions, sometimes conflicting estimates arise. This may actually be an advantage. If it is well understood how models differ this gives a more varied understanding of the epidemic in these places—for example, one model which relies more on spatial correlation would assume that a local outbreak will start spreading whereas another model with less spatial correlation, such as ours, might not predict this. Model uncertainty can be evident from results from different models.

As Professor Nason points out, we could indeed have chosen other models for the reproduction number than a weekly random walk. We experimented with some of these, for example splines, but in the end, we settled on the random walk, as it proved to be more stable than other approaches, and as it does not project any trends in changes in $R_{t}$ forward in time, which could have lead to unrealistic values of $R_{t}$ , particularly around change points of the epidemic.

Dr Jewell raises issues around not having spatial relationships encoded among local areas and using a fixed generation distribution throughout the period. We agree that an ideal model should account for spatial dependence in a realistic way. By not incorporating it we gained at least two advantages. First, easier computational tractability, which allowed us to update the model more frequently and thus produce more current updates. Secondly, and maybe more importantly, it allowed a local ‘accountability’—projections were based mainly on developments within that region, and thus the estimates cannot simply be discounted in one area by saying it only depends on the developments in neighbouring regions. During this epidemic, there were instances of local outbreaks or local reductions that did not, or not quickly, affect neighbouring areas. We experimented with mobility data ourselves, but we chose not to use it, as, in our experience, it tended to large swings in the estimates that were not necessarily reflected in the case data later on, and as mobility data has been, at least when we developed the model, only been available with substantial delays. As far as the use of predetermined generation distribution is concerned, we agree a model that can account for changing generation distribution would be ideal. However, this would again require much more computational expensive frameworks, limiting their use for daily updates. Additionally, identifiability of the parameters would be a major concern—for which we would at least require detailed incidence data not just raw daily case counts.

Overall, we believe that our model has been a useful tool to several decision-makers during the epidemic. To be prepared for possible future epidemics or a future pandemic, we agree with the discussants that it would be beneficial to develop a detailed and adaptable models that would be able to follow the rapidly changing data availability throughout an epidemic and to address the rapidly changing questions that arise (e.g. effect of vaccinations, age-dependence, variants, $\dots)$ . These models will most likely require advances in computational methods to be helpful for the rapid decisions needed during an epidemic.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of the Royal Statistical Society Series A-Statistics in Society 数学-统计学与概率论

CiteScore

2.90

自引率

5.00%

发文量

136

审稿时长

>12 weeks

期刊介绍： Series A (Statistics in Society) publishes high quality papers that demonstrate how statistical thinking, design and analyses play a vital role in all walks of life and benefit society in general. There is no restriction on subject-matter: any interesting, topical and revelatory applications of statistics are welcome. For example, important applications of statistical and related data science methodology in medicine, business and commerce, industry, economics and finance, education and teaching, physical and biomedical sciences, the environment, the law, government and politics, demography, psychology, sociology and sport all fall within the journal''s remit. The journal is therefore aimed at a wide statistical audience and at professional statisticians in particular. Its emphasis is on well-written and clearly reasoned quantitative approaches to problems in the real world rather than the exposition of technical detail. Thus, although the methodological basis of papers must be sound and adequately explained, methodology per se should not be the main focus of a Series A paper. Of particular interest are papers on topical or contentious statistical issues, papers which give reviews or exposés of current statistical concerns and papers which demonstrate how appropriate statistical thinking has contributed to our understanding of important substantive questions. Historical, professional and biographical contributions are also welcome, as are discussions of methods of data collection and of ethical issues, provided that all such papers have substantial statistical relevance.