{"title":"Rethinking statistical approaches for serological data analysis for viral surveillance","authors":"Morgan P. Kain , Jonathan H. Epstein , Noam Ross","doi":"10.1016/j.jviromet.2025.115149","DOIUrl":null,"url":null,"abstract":"<div><div>A robust serological surveillance system for zoonotic pathogens is imperative for both early detection and advancing knowledge of emerging diseases. A statistical analysis plan that is aligned to research and epidemiological goals requires a purposeful choice among alternative methods for differentiating seronegative from seropositive samples, estimating seroprevalence, and estimating risk factors associated with seropositivity. The common standard deviation-based cutoff (e.g., 3sd) approach is simple to implement and understand, but fails to appropriately propagate uncertainty in serostatus assignments to any risk factor analysis. Methods such as Gaussian mixture models, which jointly estimate serostatus, risk factors, and their uncertainty, can alleviate the dichotomy created by the cutoff approach. Yet, because of a lack of empirical guidance of method performance, it remains difficult to choose a robust analysis method for a given serological dataset. Here we examine the performance of both cutoff and clustering approaches using simulated datasets that represent the epidemiological, biological, and immunological data generation process. We focus on understudied pathogens for which validated serological assays do not exist, as is common in emerging viruses in wildlife. We quantify coverage (the proportion of time 95 % confidence intervals contain the true value) and bias (systematic differences between true values and model point estimates) of model estimates for individual serostatus assignments, population seroprevalence, and regression coefficients for serostatus risk factors. In nearly all scenarios, Bayesian mixture models provide the highest coverage and lowest bias. Only with very low seroprevalence (∼ < 3 %) and large differences in signal between seronegative and seropositive individuals will a cutoff provide low bias and near-nominal coverage. Given poor coverage of risk factor regression coefficients, we advise against using a cutoff approach for quantifying determinants of seropositivity.</div></div>","PeriodicalId":17663,"journal":{"name":"Journal of virological methods","volume":"335 ","pages":"Article 115149"},"PeriodicalIF":2.2000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of virological methods","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166093425000424","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
A robust serological surveillance system for zoonotic pathogens is imperative for both early detection and advancing knowledge of emerging diseases. A statistical analysis plan that is aligned to research and epidemiological goals requires a purposeful choice among alternative methods for differentiating seronegative from seropositive samples, estimating seroprevalence, and estimating risk factors associated with seropositivity. The common standard deviation-based cutoff (e.g., 3sd) approach is simple to implement and understand, but fails to appropriately propagate uncertainty in serostatus assignments to any risk factor analysis. Methods such as Gaussian mixture models, which jointly estimate serostatus, risk factors, and their uncertainty, can alleviate the dichotomy created by the cutoff approach. Yet, because of a lack of empirical guidance of method performance, it remains difficult to choose a robust analysis method for a given serological dataset. Here we examine the performance of both cutoff and clustering approaches using simulated datasets that represent the epidemiological, biological, and immunological data generation process. We focus on understudied pathogens for which validated serological assays do not exist, as is common in emerging viruses in wildlife. We quantify coverage (the proportion of time 95 % confidence intervals contain the true value) and bias (systematic differences between true values and model point estimates) of model estimates for individual serostatus assignments, population seroprevalence, and regression coefficients for serostatus risk factors. In nearly all scenarios, Bayesian mixture models provide the highest coverage and lowest bias. Only with very low seroprevalence (∼ < 3 %) and large differences in signal between seronegative and seropositive individuals will a cutoff provide low bias and near-nominal coverage. Given poor coverage of risk factor regression coefficients, we advise against using a cutoff approach for quantifying determinants of seropositivity.
期刊介绍:
The Journal of Virological Methods focuses on original, high quality research papers that describe novel and comprehensively tested methods which enhance human, animal, plant, bacterial or environmental virology and prions research and discovery.
The methods may include, but not limited to, the study of:
Viral components and morphology-
Virus isolation, propagation and development of viral vectors-
Viral pathogenesis, oncogenesis, vaccines and antivirals-
Virus replication, host-pathogen interactions and responses-
Virus transmission, prevention, control and treatment-
Viral metagenomics and virome-
Virus ecology, adaption and evolution-
Applied virology such as nanotechnology-
Viral diagnosis with novelty and comprehensive evaluation.
We seek articles, systematic reviews, meta-analyses and laboratory protocols that include comprehensive technical details with statistical confirmations that provide validations against current best practice, international standards or quality assurance programs and which advance knowledge in virology leading to improved medical, veterinary or agricultural practices and management.