Jasmina Panovska-Griffiths' discussion contribution to papers in Session 3 of the Royal Statistical Society's special topic meeting on COVID-19 transmission: 11 June 2021

IF 1.6 3区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of the Royal Statistical Society Series A-Statistics in Society Pub Date : 2022-12-02 DOI:10.1111/rssa.12982

Jasmina Panovska-Griffiths

{"title":"Jasmina Panovska-Griffiths' discussion contribution to papers in Session 3 of the Royal Statistical Society's special topic meeting on COVID-19 transmission: 11 June 2021","authors":"Jasmina Panovska-Griffiths","doi":"10.1111/rssa.12982","DOIUrl":null,"url":null,"abstract":"The effective reproduction number R has been a headline epidemic metric since the onset of the COVID-19 pandemic. R measures the number of secondary infections arising from an existing infection. At the onset of a new disease, in a fully susceptible population, R is the basic reproduction number <math>\n <semantics>\n <mrow>\n <msub>\n <mi>R</mi>\n <mn>0</mn>\n </msub>\n </mrow>\n <annotation>$$ {R}_0 $$</annotation>\n </semantics></math> that describes the number of secondary infections stemming from an initial infected case. As the epidemic progresses, R reflects the number of secondary infections generated in a population comprising susceptible, exposed, infected, recovered and immune individuals. The growth rate r, another widely used epidemic metric throughout the COVID-19 pandemic, represents the rate at which the epidemic is growing during the exponential phase of epidemic growth. While R can be thought as reflective of the level of transmission, r can be thought as reflective of the transmission speed.Since the onset of the SARS-CoV-2 epidemic, epidemiological models, calibrated against data on infections, hospital admissions and occupancy and deaths related to COVID-19, have been widely used to generate outcome metrics such as R or r. These have been used to inform the status of the epidemic with R increasing above 1, and analogously, r above 0, suggesting that the epidemic is growing exponentially with the emerging virus spreading fast. Tracking whether R and r are crossing these thresholds can inform if the epidemic is in a growing or shrinking phase, or the impact of imposed control measures at the time. Generating such metrics across an ensemble of models—which may be different in methodology or on the data they use to fit against—allows a consensus value of R and r to be derived. The consensus value represents an average outcome across models, and taking a combination of models rather than one model derivative value, allows for uncertainty to be accounted for in the epidemic metrics.Generating consensus epidemic metrics from models, while useful in informing the epidemic status, has three challenges related to:Challenge 1: Understanding how to interpret R and r across different modelsChallenge 2: Understanding how R and r are statistically correlated within and across different modelsChallenge 3: Understanding whether R and r are the most reliable metrics as the epidemic progresses and different interventions are employedOn the first challenge, although R and r describe broadly similar model outcomes, their exact definition depends on the model structure. For example, in agent-based models (ABMs) R can be directly counted, in population-based SEIR-type models R is the largest eigenvalue of the generation matrix and hence represents a more abstract concept, while in non-mechanistic models R is typically calculated from the incidence level and the generation time of the circulating variant. In fact within models, r and R are related via the generation time of the epidemic: the longer the generation time and higher the epidemic growth rate, higher the value of R. Appreciating how R and r are generated and related within models is thus important when we compare these and use them as independent variables within a statistical model used to generate a consensus value.On the second challenge, we need to be clear on how related are the models and the data being used within models in the ensemble that is generating the consensus value. For example, when combining different models, some of which use different data streams or have different model structures, how does the combined estimate relatively weight their contribution? Or when generating a combined probability distribution, should we use the mean or the median as an average measure and what are the most relevant representative ranges to use: 10th to 90th, 5th or 95th or 25th to 75th percentiles? Finally, when combining model outcomes, what meta-analysis statistical approach do we use, for example, fixed or random effects models? It is important to be clear whether the generated consensus R and r are from statistically correlated model outcomes or data—if so the consensus value generated would be strongly weighted to those and statistical methods should be employed to prevent this and get a true representative consensus value.Finally, on the third challenge, should we in future consider additional metrics, and specifically account for hospitalisation rate in combination to growth rate (via r) and transmissibility of the different variants (via R)? And separately, should the gold-standard epidemic metrics be different if we are considering a number of local/regional epidemics that merge to produce a large national epidemic, compared to having a slower growing but geographically large epidemic? And should the consensus epidemic metrics be age-, settings- or variant-specific?It is important to address these, and similar challenges in generating epidemic metrics, as we continue utilising mathematical models to study and inform the epidemic status. There is ongoing work in my modelling team that is assessing how to robustly account for uncertainty—in developing models, in setting model parameters and in combining model outcomes—when generating a consensus epidemic metric such as R. Learning from the enormous amount of modelling development done at speed during the pandemic, and recognising the aspects that have flourished and can be readily used and those that need to be developed further and adapted to other pathogens, is crucial to be better prepared if another pandemic were to occur.","PeriodicalId":49983,"journal":{"name":"Journal of the Royal Statistical Society Series A-Statistics in Society","volume":"185 S1","pages":"S150-S151"},"PeriodicalIF":1.6000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12982","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series A-Statistics in Society","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12982","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

The effective reproduction number R has been a headline epidemic metric since the onset of the COVID-19 pandemic. R measures the number of secondary infections arising from an existing infection. At the onset of a new disease, in a fully susceptible population, R is the basic reproduction number $R_{0}$ that describes the number of secondary infections stemming from an initial infected case. As the epidemic progresses, R reflects the number of secondary infections generated in a population comprising susceptible, exposed, infected, recovered and immune individuals. The growth rate r, another widely used epidemic metric throughout the COVID-19 pandemic, represents the rate at which the epidemic is growing during the exponential phase of epidemic growth. While R can be thought as reflective of the level of transmission, r can be thought as reflective of the transmission speed.

Since the onset of the SARS-CoV-2 epidemic, epidemiological models, calibrated against data on infections, hospital admissions and occupancy and deaths related to COVID-19, have been widely used to generate outcome metrics such as R or r. These have been used to inform the status of the epidemic with R increasing above 1, and analogously, r above 0, suggesting that the epidemic is growing exponentially with the emerging virus spreading fast. Tracking whether R and r are crossing these thresholds can inform if the epidemic is in a growing or shrinking phase, or the impact of imposed control measures at the time. Generating such metrics across an ensemble of models—which may be different in methodology or on the data they use to fit against—allows a consensus value of R and r to be derived. The consensus value represents an average outcome across models, and taking a combination of models rather than one model derivative value, allows for uncertainty to be accounted for in the epidemic metrics.

Generating consensus epidemic metrics from models, while useful in informing the epidemic status, has three challenges related to:

Challenge 1: Understanding how to interpret R and r across different models

Challenge 2: Understanding how R and r are statistically correlated within and across different models

Challenge 3: Understanding whether R and r are the most reliable metrics as the epidemic progresses and different interventions are employed

On the first challenge, although R and r describe broadly similar model outcomes, their exact definition depends on the model structure. For example, in agent-based models (ABMs) R can be directly counted, in population-based SEIR-type models R is the largest eigenvalue of the generation matrix and hence represents a more abstract concept, while in non-mechanistic models R is typically calculated from the incidence level and the generation time of the circulating variant. In fact within models, r and R are related via the generation time of the epidemic: the longer the generation time and higher the epidemic growth rate, higher the value of R. Appreciating how R and r are generated and related within models is thus important when we compare these and use them as independent variables within a statistical model used to generate a consensus value.

On the second challenge, we need to be clear on how related are the models and the data being used within models in the ensemble that is generating the consensus value. For example, when combining different models, some of which use different data streams or have different model structures, how does the combined estimate relatively weight their contribution? Or when generating a combined probability distribution, should we use the mean or the median as an average measure and what are the most relevant representative ranges to use: 10th to 90th, 5th or 95th or 25th to 75th percentiles? Finally, when combining model outcomes, what meta-analysis statistical approach do we use, for example, fixed or random effects models? It is important to be clear whether the generated consensus R and r are from statistically correlated model outcomes or data—if so the consensus value generated would be strongly weighted to those and statistical methods should be employed to prevent this and get a true representative consensus value.

Finally, on the third challenge, should we in future consider additional metrics, and specifically account for hospitalisation rate in combination to growth rate (via r) and transmissibility of the different variants (via R)? And separately, should the gold-standard epidemic metrics be different if we are considering a number of local/regional epidemics that merge to produce a large national epidemic, compared to having a slower growing but geographically large epidemic? And should the consensus epidemic metrics be age-, settings- or variant-specific?

It is important to address these, and similar challenges in generating epidemic metrics, as we continue utilising mathematical models to study and inform the epidemic status. There is ongoing work in my modelling team that is assessing how to robustly account for uncertainty—in developing models, in setting model parameters and in combining model outcomes—when generating a consensus epidemic metric such as R. Learning from the enormous amount of modelling development done at speed during the pandemic, and recognising the aspects that have flourished and can be readily used and those that need to be developed further and adapted to other pathogens, is crucial to be better prepared if another pandemic were to occur.

查看原文本刊更多论文

Jasmina Panovska-Griffiths在2021年6月11日皇家统计学会关于COVID-19传播的专题会议第三次会议上对论文的讨论贡献

最后，关于第三个挑战，我们将来是否应该考虑其他指标，并具体考虑住院率与增长率(通过r)和不同变异的传播率(通过r)的结合?另外，如果我们考虑到一些地方/区域流行病合并形成一个全国性的大流行病，与一个增长较慢但地理范围较大的流行病相比，黄金标准流行病指标是否应该有所不同?共识的流行病指标应该是针对年龄、环境还是变异的?在我们继续利用数学模型研究和通报流行病状况的同时，必须解决这些问题，以及在制定流行病指标方面面临的类似挑战。我的建模团队正在进行的工作是评估如何在开发模型、设置模型参数和结合模型结果时，在生成共识流行病度量标准(如r)时，稳健地考虑不确定性。从大流行期间快速完成的大量建模开发中学习，并认识到哪些方面已经蓬勃发展并可以随时使用，哪些方面需要进一步开发并适应其他病原体。如果发生另一场大流行，做好准备至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the Royal Statistical Society Series A-Statistics in Society 数学-统计学与概率论

CiteScore

2.90

自引率

5.00%

发文量

136

审稿时长

>12 weeks

期刊介绍： Series A (Statistics in Society) publishes high quality papers that demonstrate how statistical thinking, design and analyses play a vital role in all walks of life and benefit society in general. There is no restriction on subject-matter: any interesting, topical and revelatory applications of statistics are welcome. For example, important applications of statistical and related data science methodology in medicine, business and commerce, industry, economics and finance, education and teaching, physical and biomedical sciences, the environment, the law, government and politics, demography, psychology, sociology and sport all fall within the journal''s remit. The journal is therefore aimed at a wide statistical audience and at professional statisticians in particular. Its emphasis is on well-written and clearly reasoned quantitative approaches to problems in the real world rather than the exposition of technical detail. Thus, although the methodological basis of papers must be sound and adequately explained, methodology per se should not be the main focus of a Series A paper. Of particular interest are papers on topical or contentious statistical issues, papers which give reviews or exposés of current statistical concerns and papers which demonstrate how appropriate statistical thinking has contributed to our understanding of important substantive questions. Historical, professional and biographical contributions are also welcome, as are discussions of methods of data collection and of ethical issues, provided that all such papers have substantial statistical relevance.