Statistical SciencePub Date : 2022-05-01Epub Date: 2022-05-16DOI: 10.1214/22-sts853
Lorenzo Cappello, Jaehee Kim, Sifan Liu, Julia A Palacios
{"title":"Statistical Challenges in Tracking the Evolution of SARS-CoV-2.","authors":"Lorenzo Cappello, Jaehee Kim, Sifan Liu, Julia A Palacios","doi":"10.1214/22-sts853","DOIUrl":"10.1214/22-sts853","url":null,"abstract":"<p><p>Genomic surveillance of SARS-CoV-2 has been instrumental in tracking the spread and evolution of the virus during the pandemic. The availability of SARS-CoV-2 molecular sequences isolated from infected individuals, coupled with phylodynamic methods, have provided insights into the origin of the virus, its evolutionary rate, the timing of introductions, the patterns of transmission, and the rise of novel variants that have spread through populations. Despite enormous global efforts of governments, laboratories, and researchers to collect and sequence molecular data, many challenges remain in analyzing and interpreting the data collected. Here, we describe the models and methods currently used to monitor the spread of SARS-CoV-2, discuss long-standing and new statistical challenges, and propose a method for tracking the rise of novel variants during the epidemic.</p>","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"37 2","pages":"162-182"},"PeriodicalIF":3.9,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9409356/pdf/nihms-1829142.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9532515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
George Nicholson, Marta Blangiardo, Mark Briers, Peter J Diggle, Tor Erlend Fjelde, Hong Ge, Robert J B Goudie, Radka Jersakova, Ruairidh E King, Brieuc C L Lehmann, Ann-Marie Mallon, Tullia Padellini, Yee Whye Teh, Chris Holmes, Sylvia Richardson
{"title":"Interoperability of statistical models in pandemic preparedness: principles and reality.","authors":"George Nicholson, Marta Blangiardo, Mark Briers, Peter J Diggle, Tor Erlend Fjelde, Hong Ge, Robert J B Goudie, Radka Jersakova, Ruairidh E King, Brieuc C L Lehmann, Ann-Marie Mallon, Tullia Padellini, Yee Whye Teh, Chris Holmes, Sylvia Richardson","doi":"10.1214/22-STS854","DOIUrl":"10.1214/22-STS854","url":null,"abstract":"<p><p>We present <i>interoperability</i> as a guiding framework for statistical modelling to assist policy makers asking multiple questions using diverse datasets in the face of an evolving pandemic response. Interoperability provides an important set of principles for future pandemic preparedness, through the joint design and deployment of adaptable systems of statistical models for disease surveillance using probabilistic reasoning. We illustrate this through case studies for inferring and characterising spatial-temporal prevalence and reproduction numbers of SARS-CoV-2 infections in England.</p>","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"37 2","pages":"183-206"},"PeriodicalIF":3.9,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612804/pdf/EMS144307.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10852221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lessons Learned from the COVID-19 Pandemic: A Statistician’s Reflection","authors":"Xihong Lin","doi":"10.1214/22-sts860","DOIUrl":"https://doi.org/10.1214/22-sts860","url":null,"abstract":"In this article, I will discuss my experience as a statistician involved in COVID-19 research in multiple capacities in the last two years, especially in the early phase of the pandemic. I will reflect on the challenges and the lessons I have learned in pandemic research regarding data collection and access, epidemic modeling and data analysis, open science and real time dissemination of research findings, implementation science, media and public communication, and partnerships between academia, government, industry and civil society. I will also make several recommendations on navigating the next stage of the pandemic and preparing for future pandemics.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42229218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data, Science, and Global Disasters","authors":"John M. Chambers","doi":"10.1214/22-sts858","DOIUrl":"https://doi.org/10.1214/22-sts858","url":null,"abstract":"The spread and impact of COVID-19 have disrupted human activities and energized a response of scientific activity on a remarkable, nearly unprecedented scale. This has somewhat distracted attention from a broad range of less immediate but fundamentally more serious global threats resulting from human actions. These can be collectively labelled the anthropocene disasters. Science cannot itself prevent or mitigate them. To do so requires a global policy resolve not currently existing. When and if that resolve emerges, science will be essential for guiding action. This science will be radically data-intensive, global and inclusive. Teams will be required that include the best and most motivated individuals from all relevant scientific disciplines, plus members knowledgable about implementing likely policy recommendations. Such participants must be attracted to join and then properly supported and rewarded–not likely with current academic structures. Some insights can be gained from the recent experience with COVID-19 and the much less recent example of research at Bell Labs.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"1 1","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66088792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Being a Public Health Statistician During a Global Pandemic","authors":"B. Mukherjee","doi":"10.1214/22-sts859","DOIUrl":"https://doi.org/10.1214/22-sts859","url":null,"abstract":"In this perspective, I first share some key lessons learned from the experience of modeling the transmission dynamics of SARS-CoV-2 in India since the beginning of the COVID-19 pandemic in 2020. Second, I discuss some interesting open problems related to COVID-19 where statisticians have a lot to contribute to in the coming years. Finally, I emphasize the need for having integrated and resilient public health data systems: good data coupled with good models are at the heart of effective policymaking.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42709497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Conversation with Ross Prentice","authors":"L. Hsu, C. Kooperberg","doi":"10.1214/21-sts829","DOIUrl":"https://doi.org/10.1214/21-sts829","url":null,"abstract":"Ross L. Prentice received his B.Sc. from the University of Waterloo and his Ph.D. from the University of Toronto. He joined the University of Washington (UW) and the Fred Hutchinson Cancer Research Center (the Hutch) in 1974, and is currently Professor of Biostatistics at these institutions. He was Senior Vice President at the Hutch, and Director of its Public Health Sciences Division, for more than 25 years. Dr. Prentice’s expertise and research interests are in the fields of biostatistics, epidemiology, and disease prevention. He played a central role in the conception, design, and implementation of the Women’s Health Initiative. In statistical and medical literature he has over 500 scientific papers, including more than 40 with 500 or more citations. His substantial contributions to the theory of population and clinical research include the use of surrogate endpoints and case-cohort designs and other areas such as survival analysis, nutritional epidemiology, genetic epidemiology, biomarkers, and measurement error. Dr. Prentice is recognized for his mentoring of students and junior colleagues, and for his generous collaborations. Dr. Prentice has received numerous awards for his work, including an honorary doctorate in mathematics from the University of Waterloo, the Mantel Award for Lifetime Contributions to Statistics in Epidemiology from the American Statistical Association, the Mortimer Spiegelman Award from the American Public Health Association, the Committee of Presidents of Statistical Societies Presidents’ Award and RA Fisher Award, the Marvin Zelen Leadership Award for Outstanding Achievement in Statistical Science from Harvard University, the American Association of Cancer Research/American Cancer Society Award for Research Excellence in Cancer Epidemiology and Prevention, and the American Association for Cancer Research Team Science Award. He was elected to the Institute of Medicine/National Academy of Medicine in 1990. The Ross L. Prentice Endowed Professorship of Biostatistical Collaboration was created at the UW in 2005 and has been awarded every year since its inception. The interior space of the Public Health Sciences building at the Hutch has been named the Ross L. Prentice Atrium. In his spare time, Ross enjoys sports including water skiing, golf, running, and spending time with his wife, Didi, and with his daughters, sons-in-law, and grandchildren. He ran daily from when he was in his 20s until his knees objected about 10 years ago. This interview took place with Li Hsu and Charles Kooperberg via Zoom in December 2020.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"1 1","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41588952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Baddeley, Tilman M. Davies, S. Rakshit, Gopalan M. Nair, Greg McSwiggan
{"title":"Diffusion Smoothing for Spatial Point Patterns","authors":"A. Baddeley, Tilman M. Davies, S. Rakshit, Gopalan M. Nair, Greg McSwiggan","doi":"10.1214/21-sts825","DOIUrl":"https://doi.org/10.1214/21-sts825","url":null,"abstract":"Traditional kernel methods for estimating the spatially-varying density of points in a spatial point pattern may exhibit unrealistic artefacts, in addition to the familiar problems of bias and overor undersmoothing. Performance can be improved by using diffusion smoothing, in which the smoothing kernel is the heat kernel on the spatial domain. This paper develops diffusion smoothing into a practical statistical methodology for two-dimensional spatial point pattern data. We clarify the advantages and disadvantages of diffusion smoothing over Gaussian kernel smoothing. Adaptive smoothing, where the smoothing bandwidth is spatially-varying, can be performed by adopting a spatiallyvarying diffusion rate: this avoids technical problems with adaptive Gaussian smoothing and has substantially better performance. We introduce a new form of adaptive smoothing using lagged arrival times, which has good performance and improved robustness. Applications in archaeology and epidemiology are demonstrated. The methods are implemented in open-source R code. AMS 2000 subject classifications: Primary 62G07; secondary 62M30.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47305929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Statistical Dependence: Beyond Pearson’s ρ","authors":"D. Tjøstheim, Håkon Otneim, Bård Støve","doi":"10.1214/21-sts823","DOIUrl":"https://doi.org/10.1214/21-sts823","url":null,"abstract":"Pearson’s ρ is the most used measure of statistical dependence. It gives a complete characterization of dependence in the Gaussian case, and it also works well in some non-Gaussian situations. It is well known, however, that it has a number of shortcomings; in particular for heavy tailed distributions and in nonlinear situations, where it may produce misleading, and even disastrous results. In recent years a number of alternatives have been proposed. In this paper, we will survey these developments, especially results obtained in the last couple of decades. Among measures discussed are the copula, distribution-based measures, the distance covariance, the HSIC measure popular in machine learning, and finally the local Gaussian correlation, which is a local version of Pearson’s ρ. Throughout we put the emphasis on conceptual developments and a comparison of these. We point out relevant references to technical details as well as comparative empirical and simulated experiments. There is a broad selection of references under each topic treated.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49302811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Some Perspectives on Inference in High Dimensions","authors":"H. Battey, D. Cox","doi":"10.1214/21-sts824","DOIUrl":"https://doi.org/10.1214/21-sts824","url":null,"abstract":"With very large amounts of data, important aspects of statistical analysis may appear largely descriptive in that the role of probability sometimes seems limited or totally absent. The main emphasis of the present paper lies on contexts where formulation in terms of a probabilistic model is feasible and fruitful but to be at all realistic large numbers of unknown parameters need consideration. Then many of the standard approaches to statistical analysis, for instance direct application of the method of maximum likelihood, or the use of flat priors, often encounter difficulties. After a brief discussion of broad conceptual issues, we provide some new perspectives on aspects of high-dimensional statistical theory, emphasizing a number of open problems.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46293804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}