{"title":"Robust reflections","authors":"David Andrews, Chris Field","doi":"10.1002/cjs.11709","DOIUrl":"10.1002/cjs.11709","url":null,"abstract":"<p>Two senior statisticians/data scientists reflect on the challenges arising from the analysis of increasingly complex data using robustness. They include some thoughts on the types of robust analysis that will be needed in the future, while cognizant of our very limited ability to successfully predict the future.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11709","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44229400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Canadian Statistical Sciences Institute 2003–2022","authors":"Mary Thompson, Nancy Reid, Don Estep","doi":"10.1002/cjs.11716","DOIUrl":"10.1002/cjs.11716","url":null,"abstract":"<p>This article describes the founding and growth of the Canadian Statistical Sciences Institute (CANSSI), starting from its early roots and continuing through to establishment as a mature research enterprise. The goal is to present a historical record of events and activities that were important in the development of CANSSI.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11716","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42193083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pseudo empirical likelihood inference for nonprobability survey samples","authors":"Yilin Chen, Pengfei Li, J. N. K. Rao, Changbao Wu","doi":"10.1002/cjs.11708","DOIUrl":"10.1002/cjs.11708","url":null,"abstract":"<p>In this article, we first provide an overview of two major developments on complex survey data analysis: the empirical likelihood methods and statistical inference with nonprobability survey samples. We highlight the important research contributions to the field of survey sampling in general and the two topics in particular by Canadian survey statisticians. We then propose new inferential procedures for analyzing nonprobability survey samples through the pseudo empirical likelihood approach. The proposed methods lead to point estimators asymptotically equivalent to those discussed in the recent literature but with more desirable features on confidence intervals such as range-respecting and data-driven orientation. Results from a simulation study demonstrate the superiority of the proposed methods in dealing with binary response variables.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11708","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48329998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Radu V. Craiu, Paul Gustafson, Jeffrey S. Rosenthal
{"title":"Reflections on Bayesian inference and Markov chain Monte Carlo","authors":"Radu V. Craiu, Paul Gustafson, Jeffrey S. Rosenthal","doi":"10.1002/cjs.11707","DOIUrl":"10.1002/cjs.11707","url":null,"abstract":"<p>Bayesian inference and Markov chain Monte Carlo methods are vigorous areas of statistical research. Here we reflect on some recent developments and future directions in these fields.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11707","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42447477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distributed estimation with empirical likelihood","authors":"Qianqian Liu, Zhouping Li","doi":"10.1002/cjs.11706","DOIUrl":"10.1002/cjs.11706","url":null,"abstract":"<p>With the development of science and technology, massive datasets stored in multiple machines are increasingly prevalent. It is known that traditional statistical methods may be infeasible for analyzing large datasets owing to excessive computing time, memory limitations, communication costs, and privacy concerns. This article develops divide-and-conquer empirical likelihood (DEL) and divide-and-conquer exponentially tilted empirical likelihood (DETEL) methods for the distributed computing setting. We investigate the theoretical properties of the DEL and DETEL estimators. In particular, we derive upper bounds for the mean squared errors of the DEL and DETEL estimators, and, under some mild conditions, we prove the consistency and the asymptotic normality of the proposed estimators. Simulation studies and a real data analysis are carried out to demonstrate the finite-sample performance of the proposed methods.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41412417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reducing bias due to misclassified exposures using instrumental variables","authors":"Christopher Manuel, Samiran Sinha, Suojin Wang","doi":"10.1002/cjs.11705","DOIUrl":"10.1002/cjs.11705","url":null,"abstract":"<p>Exposures are often misclassified in observational studies. Any analysis that does not make proper adjustments for misclassification may result in biased estimates of model parameters, resulting in distorted inference. Settings where a multicategory exposure variable has more than two nominal categories or where no validation data are available to assess misclassification probabilities are common in practice but seldom considered in the literature. This article presents a novel method of analyzing cohort data with a misclassified, multicategory exposure variable and a binary response variable that uses instrumental variables in lieu of a validation dataset. First, a sufficient condition is obtained for model identifiability. Then, methods for model estimation and inference are proposed after adopting a sufficient condition for identifiability. We consider a variational Bayesian inference procedure aided by automatic differentiation along with Markov chain Monte Carlo-based computation. Operating characteristics of the proposed methods are assessed through simulation studies. For the purpose of illustration, the proposed Bayesian methods are applied to the U.S. breast cancer mortality data sampled from the Surveillance Epidemiology and End Results database, where reported treatment therapy is the misclassified multicategory exposure variable.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48218452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Likelihood identifiability and parameter estimation with nonignorable missing data","authors":"Siming Zheng, Juan Zhang, Yong Zhou","doi":"10.1002/cjs.11704","DOIUrl":"10.1002/cjs.11704","url":null,"abstract":"<p>We identify sufficient conditions to resolve the identification problem under nonignorable missingness, especially the identifiability of the observed likelihood when some of the covariate values are missing not at random, or, simultaneously, the response is also missing not at random. It is more difficult to tackle these cases than the nonignorable nonresponse case, and, to the best of our knowledge, the simultaneously missing case has never been discussed before. Under these conditions, we propose some parameter estimation methods. As an illustration, when some of the covariate values are missing not at random, we adopt a semiparametric logistic model with a tilting parameter to model the missingness mechanism and use an imputed estimating equation based on the generalized method of moments to estimate the parameters of interest and the tilting parameter simultaneously. This approach avoids the requirement for other independent surveys or a validation sample to estimate the unknown tilting parameter. The asymptotic properties of our proposed estimators are derived, and the proofs can be modified to show that our methods of estimation, which are based on inverse probability weighting, augmented inverse probability weighting, and estimating equation projection, have the same asymptotic efficiency when the tilting parameter is either known or unknown but estimated by some other method. In simulation studies, we compare our methods with various alternative approaches and find that our methods are more robust and effective.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47145609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dynamic treatment regimes with interference","authors":"Cong Jiang, Michael P. Wallace, Mary E. Thompson","doi":"10.1002/cjs.11702","DOIUrl":"10.1002/cjs.11702","url":null,"abstract":"<p>Precision medicine describes health care where patient-level data are used to inform treatment decisions. Within this framework, dynamic treatment regimes (DTRs) are sequences of decision rules that take individual patient information as input data and then output treatment recommendations. DTR estimation from observational data typically relies on the assumption of no interference: i.e., the outcome of one individual is unaffected by the treatment assignment of others. However, in many social network contexts, such as friendship or family networks, and for many health concerns, such as infectious diseases, this assumption is questionable. We investigate the DTR estimation method of dynamic weighted ordinary least squares (dWOLS), which boasts of easy implementation and the so-called double-robustness property, but relies on the assumption of no interference. We define a network propensity function and build on it to establish an implementation of dWOLS that remains doubly robust under interference associated with network links. The method's properties are demonstrated via simulation and applied to data from the Population Assessment of Tobacco and Health (PATH) study to investigate cigarette dependence within two-person household networks.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47847365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Missing data analysis with sufficient dimension reduction","authors":"Siming Zheng, Alan T. K. Wan, Yong Zhou","doi":"10.1002/cjs.11700","DOIUrl":"10.1002/cjs.11700","url":null,"abstract":"<p>This article develops a two-step procedure for estimating the unknown parameters in a model that contains a fixed but large number of covariates, more moment conditions than unknown parameters, and responses that are missing at random. We propose a sufficient dimension reduction method to be implemented in the first step and prove that the method is asymptotically valid. In the second step, we apply three well-known missing data handling mechanisms together with the generalized method of moments to the reduced-dimensional subspace to obtain estimates of unknown parameters. We investigate the theoretical properties of the proposed methods, including the effects of dimension reduction on the asymptotic distributions of the estimators. Our results refute a claim in an earlier study that dimension reduction yields the same asymptotic distributions of estimators as when the reduced-dimensional structure is the true structure. We illustrate our method by way of a simulation study and a real clinical trial data example.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44037737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Composite bias-reduced \u0000 \u0000 \u0000 \u0000 L\u0000 \u0000 \u0000 p\u0000 \u0000 \u0000 -quantile-based estimators of extreme quantiles and expectiles","authors":"Gilles Stupfler, Antoine Usseglio-Carleve","doi":"10.1002/cjs.11703","DOIUrl":"10.1002/cjs.11703","url":null,"abstract":"<p>Quantiles are a fundamental concept in extreme value theory. They can be obtained from a minimization framework using an asymmetric absolute error loss criterion. The companion notion of expectiles, based on asymmetric squared rather than asymmetric absolute error loss minimization, has received substantial attention from the fields of actuarial science, finance, and econometrics over the last decade. Quantiles and expectiles can be embedded in a common framework of <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantiles, whose extreme value properties have been explored very recently. Although this generalized notion of quantiles has shown potential for the estimation of extreme quantiles and expectiles, available estimators remain quite difficult to use: they suffer from substantial bias, and the question of the choice of the tuning parameter <math>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow></math> remains open. In this article, we work in a context of heavy tails and construct composite bias-reduced estimators of extreme quantiles and expectiles based on <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantiles. We provide a discussion of the data-driven choice of <math>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow></math> and of the anchor <math>\u0000 <mrow>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 </mrow>\u0000 </msup>\u0000 </mrow></math>-quantile level in practice. The proposed methodology is compared with existing approaches on simulated data and real data.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.6,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48049797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}