Jasmina Panovska-Griffiths' discussion contribution to papers in Session 3 of the Royal Statistical Society's special topic meeting on COVID-19 transmission: 11 June 2021
IF 1.6 3区 数学Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS
{"title":"Jasmina Panovska-Griffiths' discussion contribution to papers in Session 3 of the Royal Statistical Society's special topic meeting on COVID-19 transmission: 11 June 2021","authors":"Jasmina Panovska-Griffiths","doi":"10.1111/rssa.12982","DOIUrl":null,"url":null,"abstract":"<p>The effective reproduction number <i>R</i> has been a headline epidemic metric since the onset of the COVID-19 pandemic. <i>R</i> measures the number of secondary infections arising from an existing infection. At the onset of a new disease, in a fully susceptible population, <i>R</i> is the basic reproduction number <math>\n <semantics>\n <mrow>\n <msub>\n <mi>R</mi>\n <mn>0</mn>\n </msub>\n </mrow>\n <annotation>$$ {R}_0 $$</annotation>\n </semantics></math> that describes the number of secondary infections stemming from an initial infected case. As the epidemic progresses, <i>R</i> reflects the number of secondary infections generated in a population comprising susceptible, exposed, infected, recovered and immune individuals. The growth rate <i>r</i>, another widely used epidemic metric throughout the COVID-19 pandemic, represents the rate at which the epidemic is growing during the exponential phase of epidemic growth. While <i>R</i> can be thought as reflective of the level of transmission, <i>r</i> can be thought as reflective of the transmission speed.</p><p>Since the onset of the SARS-CoV-2 epidemic, epidemiological models, calibrated against data on infections, hospital admissions and occupancy and deaths related to COVID-19, have been widely used to generate outcome metrics such as <i>R</i> or <i>r</i>. These have been used to inform the status of the epidemic with <i>R</i> increasing above 1, and analogously, <i>r</i> above 0, suggesting that the epidemic is growing exponentially with the emerging virus spreading fast. Tracking whether <i>R</i> and <i>r</i> are crossing these thresholds can inform if the epidemic is in a growing or shrinking phase, or the impact of imposed control measures at the time. Generating such metrics across an ensemble of models—which may be different in methodology or on the data they use to fit against—allows a consensus value of <i>R</i> and <i>r</i> to be derived. The consensus value represents an average outcome across models, and taking a combination of models rather than one model derivative value, allows for uncertainty to be accounted for in the epidemic metrics.</p><p>Generating consensus epidemic metrics from models, while useful in informing the epidemic status, has three challenges related to:</p><p>Challenge 1: Understanding how to interpret <i>R</i> and <i>r</i> across different models</p><p>Challenge 2: Understanding how <i>R</i> and <i>r</i> are statistically correlated within and across different models</p><p>Challenge 3: Understanding whether <i>R</i> and <i>r</i> are the most reliable metrics as the epidemic progresses and different interventions are employed</p><p>On the first challenge, although <i>R</i> and <i>r</i> describe broadly similar model outcomes, their exact definition depends on the model structure. For example, in agent-based models (ABMs) <i>R</i> can be directly counted, in population-based SEIR-type models <i>R</i> is the largest eigenvalue of the generation matrix and hence represents a more abstract concept, while in non-mechanistic models <i>R</i> is typically calculated from the incidence level and the generation time of the circulating variant. In fact within models, <i>r</i> and <i>R</i> are related via the generation time of the epidemic: the longer the generation time and higher the epidemic growth rate, higher the value of <i>R</i>. Appreciating how <i>R</i> and <i>r</i> are generated and related within models is thus important when we compare these and use them as independent variables within a statistical model used to generate a consensus value.</p><p>On the second challenge, we need to be clear on how related are the models and the data being used within models in the ensemble that is generating the consensus value. For example, when combining different models, some of which use different data streams or have different model structures, how does the combined estimate relatively weight their contribution? Or when generating a combined probability distribution, should we use the mean or the median as an average measure and what are the most relevant representative ranges to use: 10th to 90th, 5th or 95th or 25th to 75th percentiles? Finally, when combining model outcomes, what meta-analysis statistical approach do we use, for example, fixed or random effects models? It is important to be clear whether the generated consensus <i>R</i> and <i>r</i> are from statistically correlated model outcomes or data—if so the consensus value generated would be strongly weighted to those and statistical methods should be employed to prevent this and get a true representative consensus value.</p><p>Finally, on the third challenge, should we in future consider additional metrics, and specifically account for hospitalisation rate in combination to growth rate (via <i>r</i>) and transmissibility of the different variants (via <i>R</i>)? And separately, should the gold-standard epidemic metrics be different if we are considering a number of local/regional epidemics that merge to produce a large national epidemic, compared to having a slower growing but geographically large epidemic? And should the consensus epidemic metrics be age-, settings- or variant-specific?</p><p>It is important to address these, and similar challenges in generating epidemic metrics, as we continue utilising mathematical models to study and inform the epidemic status. There is ongoing work in my modelling team that is assessing how to robustly account for uncertainty—in developing models, in setting model parameters and in combining model outcomes—when generating a consensus epidemic metric such as <i>R</i>. Learning from the enormous amount of modelling development done at speed during the pandemic, and recognising the aspects that have flourished and can be readily used and those that need to be developed further and adapted to other pathogens, is crucial to be better prepared if another pandemic were to occur.</p>","PeriodicalId":49983,"journal":{"name":"Journal of the Royal Statistical Society Series A-Statistics in Society","volume":"185 S1","pages":"S150-S151"},"PeriodicalIF":1.6000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://rss.onlinelibrary.wiley.com/doi/epdf/10.1111/rssa.12982","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series A-Statistics in Society","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/rssa.12982","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL SCIENCES, MATHEMATICAL METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The effective reproduction number R has been a headline epidemic metric since the onset of the COVID-19 pandemic. R measures the number of secondary infections arising from an existing infection. At the onset of a new disease, in a fully susceptible population, R is the basic reproduction number that describes the number of secondary infections stemming from an initial infected case. As the epidemic progresses, R reflects the number of secondary infections generated in a population comprising susceptible, exposed, infected, recovered and immune individuals. The growth rate r, another widely used epidemic metric throughout the COVID-19 pandemic, represents the rate at which the epidemic is growing during the exponential phase of epidemic growth. While R can be thought as reflective of the level of transmission, r can be thought as reflective of the transmission speed.
Since the onset of the SARS-CoV-2 epidemic, epidemiological models, calibrated against data on infections, hospital admissions and occupancy and deaths related to COVID-19, have been widely used to generate outcome metrics such as R or r. These have been used to inform the status of the epidemic with R increasing above 1, and analogously, r above 0, suggesting that the epidemic is growing exponentially with the emerging virus spreading fast. Tracking whether R and r are crossing these thresholds can inform if the epidemic is in a growing or shrinking phase, or the impact of imposed control measures at the time. Generating such metrics across an ensemble of models—which may be different in methodology or on the data they use to fit against—allows a consensus value of R and r to be derived. The consensus value represents an average outcome across models, and taking a combination of models rather than one model derivative value, allows for uncertainty to be accounted for in the epidemic metrics.
Generating consensus epidemic metrics from models, while useful in informing the epidemic status, has three challenges related to:
Challenge 1: Understanding how to interpret R and r across different models
Challenge 2: Understanding how R and r are statistically correlated within and across different models
Challenge 3: Understanding whether R and r are the most reliable metrics as the epidemic progresses and different interventions are employed
On the first challenge, although R and r describe broadly similar model outcomes, their exact definition depends on the model structure. For example, in agent-based models (ABMs) R can be directly counted, in population-based SEIR-type models R is the largest eigenvalue of the generation matrix and hence represents a more abstract concept, while in non-mechanistic models R is typically calculated from the incidence level and the generation time of the circulating variant. In fact within models, r and R are related via the generation time of the epidemic: the longer the generation time and higher the epidemic growth rate, higher the value of R. Appreciating how R and r are generated and related within models is thus important when we compare these and use them as independent variables within a statistical model used to generate a consensus value.
On the second challenge, we need to be clear on how related are the models and the data being used within models in the ensemble that is generating the consensus value. For example, when combining different models, some of which use different data streams or have different model structures, how does the combined estimate relatively weight their contribution? Or when generating a combined probability distribution, should we use the mean or the median as an average measure and what are the most relevant representative ranges to use: 10th to 90th, 5th or 95th or 25th to 75th percentiles? Finally, when combining model outcomes, what meta-analysis statistical approach do we use, for example, fixed or random effects models? It is important to be clear whether the generated consensus R and r are from statistically correlated model outcomes or data—if so the consensus value generated would be strongly weighted to those and statistical methods should be employed to prevent this and get a true representative consensus value.
Finally, on the third challenge, should we in future consider additional metrics, and specifically account for hospitalisation rate in combination to growth rate (via r) and transmissibility of the different variants (via R)? And separately, should the gold-standard epidemic metrics be different if we are considering a number of local/regional epidemics that merge to produce a large national epidemic, compared to having a slower growing but geographically large epidemic? And should the consensus epidemic metrics be age-, settings- or variant-specific?
It is important to address these, and similar challenges in generating epidemic metrics, as we continue utilising mathematical models to study and inform the epidemic status. There is ongoing work in my modelling team that is assessing how to robustly account for uncertainty—in developing models, in setting model parameters and in combining model outcomes—when generating a consensus epidemic metric such as R. Learning from the enormous amount of modelling development done at speed during the pandemic, and recognising the aspects that have flourished and can be readily used and those that need to be developed further and adapted to other pathogens, is crucial to be better prepared if another pandemic were to occur.
期刊介绍:
Series A (Statistics in Society) publishes high quality papers that demonstrate how statistical thinking, design and analyses play a vital role in all walks of life and benefit society in general. There is no restriction on subject-matter: any interesting, topical and revelatory applications of statistics are welcome. For example, important applications of statistical and related data science methodology in medicine, business and commerce, industry, economics and finance, education and teaching, physical and biomedical sciences, the environment, the law, government and politics, demography, psychology, sociology and sport all fall within the journal''s remit. The journal is therefore aimed at a wide statistical audience and at professional statisticians in particular. Its emphasis is on well-written and clearly reasoned quantitative approaches to problems in the real world rather than the exposition of technical detail. Thus, although the methodological basis of papers must be sound and adequately explained, methodology per se should not be the main focus of a Series A paper. Of particular interest are papers on topical or contentious statistical issues, papers which give reviews or exposés of current statistical concerns and papers which demonstrate how appropriate statistical thinking has contributed to our understanding of important substantive questions. Historical, professional and biographical contributions are also welcome, as are discussions of methods of data collection and of ethical issues, provided that all such papers have substantial statistical relevance.