{"title":"A Legacy of EM Algorithms","authors":"Kenneth Lange, Hua Zhou","doi":"10.1111/insr.12526","DOIUrl":"10.1111/insr.12526","url":null,"abstract":"<div>\u0000 \u0000 <p>Nan Laird has an enormous and growing impact on computational statistics. Her paper with Dempster and Rubin on the expectation-maximisation (EM) algorithm is the second most cited paper in statistics. Her papers and book on longitudinal modelling are nearly as impressive. In this brief survey, we revisit the derivation of some of her most useful algorithms from the perspective of the minorisation-maximisation (MM) principle. The MM principle generalises the EM principle and frees it from the shackles of missing data and conditional expectations. Instead, the focus shifts to the construction of surrogate functions via standard mathematical inequalities. The MM principle can deliver a classical EM algorithm with less fuss or an entirely new algorithm with a faster rate of convergence. In any case, the MM principle enriches our understanding of the EM principle and suggests new algorithms of considerable potential in high-dimensional settings where standard algorithms such as Newton's method and Fisher scoring falter.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"90 S1","pages":"S52-S66"},"PeriodicalIF":2.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9550131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Katarzyna Reluga, María-José Lombardía, Stefan Sperlich
{"title":"Simultaneous inference for linear mixed model parameters with an application to small area estimation","authors":"Katarzyna Reluga, María-José Lombardía, Stefan Sperlich","doi":"10.1111/insr.12519","DOIUrl":"10.1111/insr.12519","url":null,"abstract":"<p>Over the past decades, linear mixed models have attracted considerable attention in various fields of applied statistics. They are popular whenever clustered, hierarchical or longitudinal data are investigated. Nonetheless, statistical tools for valid simultaneous inference for mixed parameters are rare. This is surprising because one often faces inferential problems beyond the pointwise examination of fixed or mixed parameters. For example, there is an interest in a comparative analysis of cluster-level parameters or subject-specific estimates in studies with repeated measurements. We discuss methods for simultaneous inference assuming a linear mixed model. Specifically, we develop simultaneous prediction intervals as well as multiple testing procedures for mixed parameters. They are useful for joint considerations or comparisons of cluster-level parameters. We employ a consistent bootstrap approximation of the distribution of max-type statistic to construct our tools. The numerical performance of the developed methodology is studied in simulation experiments and illustrated in a data example on household incomes in small areas.</p>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"91 2","pages":"193-217"},"PeriodicalIF":2.0,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/insr.12519","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49301545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Computational Perspective on Projection Pursuit in High Dimensions: Feasible or Infeasible Feature Extraction","authors":"Chunming Zhang, Jimin Ye, Xiaomei Wang","doi":"10.1111/insr.12517","DOIUrl":"10.1111/insr.12517","url":null,"abstract":"<p>Finding a suitable representation of multivariate data is fundamental in many scientific disciplines. Projection pursuit (\u0000<math>\u0000 <mtext>PP</mtext></math>) aims to extract interesting ‘non-Gaussian’ features from multivariate data, and tends to be computationally intensive even when applied to data of low dimension. In high-dimensional settings, a recent work (Bickel et al., 2018) on \u0000<math>\u0000 <mtext>PP</mtext></math> addresses asymptotic characterization and conjectures of the feasible projections as the dimension grows with sample size. To gain practical utility of and learn theoretical insights into \u0000<math>\u0000 <mtext>PP</mtext></math> in an integral way, data analytic tools needed to evaluate the behaviour of \u0000<math>\u0000 <mtext>PP</mtext></math> in high dimensions become increasingly desirable but are less explored in the literature. This paper focuses on developing computationally fast and effective approaches central to finite sample studies for (i) visualizing the feasibility of \u0000<math>\u0000 <mtext>PP</mtext></math> in extracting features from high-dimensional data, as compared with alternative methods like \u0000<math>\u0000 <mtext>PCA</mtext></math> and \u0000<math>\u0000 <mtext>ICA</mtext></math>, and (ii) assessing the plausibility of \u0000<math>\u0000 <mtext>PP</mtext></math> in cases where asymptotic studies are lacking or unavailable, with the goal of better understanding the practicality, limitation and challenge of \u0000<math>\u0000 <mtext>PP</mtext></math> in the analysis of large data sets.</p>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"91 1","pages":"140-161"},"PeriodicalIF":2.0,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/insr.12517","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46111599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference","authors":"Shixiao Zhang, Peisong Han, Changbao Wu","doi":"10.1111/insr.12518","DOIUrl":"10.1111/insr.12518","url":null,"abstract":"<div>\u0000 \u0000 <p>We provide a critical review on calibration methods developed in three different areas: survey sampling, missing data analysis and causal inference. We highlight the connections and variations of calibration techniques used in missing data analysis and causal inference to conventional calibration weighting and estimation in survey sampling and provide a common framework through model-calibration and empirical likelihood to unify different calibration methods proposed in recent literature. The goal is to demonstrate the success and effectiveness of calibration methods in achieving some highly desired properties for missing data analysis and causal inference.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"91 2","pages":"165-192"},"PeriodicalIF":2.0,"publicationDate":"2022-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44699065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Administrative Records for Survey Methodology Edited by Asaph Young Chun, Michael D. Larsen, Gabriele Durrant, Jerome P. ReiterJohn Wiley and Sons, 2021, 384 pages, $128.95 (hardcover) ISBN: 978-1-1192-7204-5","authors":"Reijo Sund","doi":"10.1111/insr.12516","DOIUrl":"10.1111/insr.12516","url":null,"abstract":"","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"90 2","pages":"415-417"},"PeriodicalIF":2.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42392630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extreme Value Theory with Applications to Natural Hazards: From Statistical Theory to Industrial Practice Edited by Nicolas Bousquet and Pietro BernardaraSpringer Cham, 2021, xxii + 481 pages, $199.99 ISBN: 978-3-030-74941-5","authors":"Fabrizio Durante","doi":"10.1111/insr.12513","DOIUrl":"10.1111/insr.12513","url":null,"abstract":"","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"90 2","pages":"411-412"},"PeriodicalIF":2.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46430119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Are You All Normal? It Depends!","authors":"Wanfang Chen, Marc G. Genton","doi":"10.1111/insr.12512","DOIUrl":"10.1111/insr.12512","url":null,"abstract":"<div>\u0000 \u0000 <p>The assumption of normality has underlain much of the development of statistics, including spatial statistics, and many tests have been proposed. In this work, we focus on the multivariate setting and first review the recent advances in multivariate normality tests for i.i.d. data, with emphasis on the skewness and kurtosis approaches. We show through simulation studies that some of these tests cannot be used directly for testing normality of spatial data. We further review briefly the few existing univariate tests under dependence (time or space), and then propose a new multivariate normality test for spatial data by accounting for the spatial dependence. The new test utilises the union-intersection principle to decompose the null hypothesis into intersections of univariate normality hypotheses for projection data, and it rejects the multivariate normality if any individual hypothesis is rejected. The individual hypotheses for univariate normality are conducted using a Jarque–Bera type test statistic that accounts for the spatial dependence in the data. We also show in simulation studies that the new test has a good control of the type I error and a high empirical power, especially for large sample sizes. We further illustrate our test on bivariate wind data over the Arabian Peninsula.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"91 1","pages":"114-139"},"PeriodicalIF":2.0,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48771273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
James H. McVittie, Ana F. Best, David B. Wolfson, David A. Stephens, Julian Wolfson, David L. Buckeridge, Shahinaz M. Gadalla
{"title":"Survival Modelling for Data From Combined Cohorts: Opening the Door to Meta Survival Analyses and Survival Analysis Using Electronic Health Records","authors":"James H. McVittie, Ana F. Best, David B. Wolfson, David A. Stephens, Julian Wolfson, David L. Buckeridge, Shahinaz M. Gadalla","doi":"10.1111/insr.12510","DOIUrl":"10.1111/insr.12510","url":null,"abstract":"<div>\u0000 \u0000 <p>Non-parametric estimation of the survival function using observed failure time data depends on the underlying data generating mechanism, including the ways in which the data may be censored and/or truncated. For data arising from a single source or collected from a single cohort, a wide range of estimators have been proposed and compared in the literature. Often, however, it may be possible, and indeed advantageous, to combine and then analyse survival data that have been collected under different study designs. We review non-parametric survival analysis for data obtained by combining the most common types of cohort. We have two main goals: (i) to clarify the differences in the model assumptions and (ii) to provide a single lens through which some of the proposed estimators may be viewed. Our discussion is relevant to the meta-analysis of survival data obtained from different types of study, and to the modern era of electronic health records.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"91 1","pages":"72-87"},"PeriodicalIF":2.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/insr.12510","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9490735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}