Aya A. Mitani , Allison W. Kurian , Amar K. Das , Manisha Desai
{"title":"Navigating choices when applying multiple imputation in the presence of multi-level categorical interaction effects","authors":"Aya A. Mitani , Allison W. Kurian , Amar K. Das , Manisha Desai","doi":"10.1016/j.stamet.2015.06.001","DOIUrl":"10.1016/j.stamet.2015.06.001","url":null,"abstract":"<div><p>Multiple imputation (MI) is an appealing option for handling missing data. When implementing MI, however, users need to make important decisions to obtain estimates with good statistical properties. One such decision involves the choice of imputation model–the joint modeling (JM) versus fully conditional specification (FCS) approach. Another involves the choice of method to handle interactions. These include imputing the interaction term as any other variable (active imputation), or imputing the main effects and then deriving the interaction (passive imputation). Our study investigates the best approach to perform MI in the presence of interaction effects involving two categorical variables. Such effects warrant special attention as they involve multiple correlated parameters that are handled differently under JM and FCS modeling. Through an extensive simulation study, we compared active, passive and an improved passive approach under FCS, as JM precludes passive imputation. We additionally compared JM and FCS techniques using active imputation. Performance between active and passive imputation was comparable. The improved passive approach proved superior to the other two particularly when the number of parameters corresponding to the interaction was large. JM without rounding and FCS using active imputation were also mostly comparable, with JM outperforming FCS when the number of parameters was large. In a direct comparison of JM active and FCS improved passive, the latter was the clear winner. We recommend improved passive imputation under FCS along with sensitivity analyses to handle multi-level interaction terms.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"27 ","pages":"Pages 82-99"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.06.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Directionally collapsible parameterizations of multivariate binary distributions","authors":"Tamás Rudas","doi":"10.1016/j.stamet.2015.07.001","DOIUrl":"https://doi.org/10.1016/j.stamet.2015.07.001","url":null,"abstract":"<div><p>Odds ratios and log-linear parameters are not collapsible, which means that including a variable into the analysis or omitting one from it, may change the strength of association among the remaining variables. Even the direction of association may be reversed, a fact that is often discussed under the name of Simpson’s paradox. A parameter of association is directionally collapsible, if this reversal cannot occur. The paper investigates the existence of parameters of association which are directionally collapsible. It is shown, that subject to two simple assumptions, no parameter of association, which depends only on the conditional distributions, like the odds ratio does, can be directionally collapsible. The main result is that every directionally collapsible parameter of association gives the same direction of association as a linear contrast of the cell probabilities does. The implication for dealing with Simpson’s paradox is that there is exactly one way to associate direction with the association in any table, so that the paradox never occurs.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"27 ","pages":"Pages 132-145"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.07.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92142578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A study of moments and likelihood estimators of the odd Weibull distribution","authors":"Kahadawala Cooray","doi":"10.1016/j.stamet.2015.03.003","DOIUrl":"10.1016/j.stamet.2015.03.003","url":null,"abstract":"<div><p><span>The odd Weibull distribution is a three-parameter generalization of the Weibull and the inverse Weibull distributions having rich density and hazard shapes for modeling lifetime data. This paper explored the odd </span>Weibull parameter<span><span> regions having finite moments and examined the relation to some well-known distributions based on skewness and kurtosis functions. The existence of </span>maximum likelihood estimators<span> have shown with complete data for any sample size. The proof for the uniqueness of these estimators is given only when the absolute value of the second shape parameter is between zero and one. Furthermore, elements of the Fisher information matrix are obtained based on complete data using a single integral representation which have shown to exist for any parameter values. The performance of the odd Weibull distribution over various density and hazard shapes is compared with generalized gamma distribution using two different test statistics. Finally, analysis of two data sets has been performed for illustrative purposes.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 72-83"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.03.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Testing for the cointegration rank in threshold cointegrated systems with multiple cointegrating relationships","authors":"Jaya Krishnakumar, David Neto","doi":"10.1016/j.stamet.2015.04.001","DOIUrl":"10.1016/j.stamet.2015.04.001","url":null,"abstract":"<div><p>This paper is concerned with the estimation and inference for a threshold VECM with more than one cointegrating relation thus extending the literature in this context which has mostly considered only one cointegrating relationship so far. We then go on to develop an appropriate test for the number of cointegrating relationships in a threshold VECM. Asymptotic distributions of our test statistics are derived and tabulated. Finite sample performance of the proposed testing procedure is investigated.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 84-102"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.04.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Differentiated logdensity approximants","authors":"Serge B. Provost , Hyung-Tae Ha","doi":"10.1016/j.stamet.2015.02.005","DOIUrl":"10.1016/j.stamet.2015.02.005","url":null,"abstract":"<div><p><span>A moment-based density approximation technique whereby the derivative of the logarithm of a density approximant is expressed as a rational function is introduced in this paper. Guidelines for the selection of the polynomial orders of the numerator<span> and denominator are proposed. The coefficients are then determined by solving a system of linear equations. The resulting density approximation, referred to as a differentiated logdensity approximant or </span></span><span><math><mi>DLA</mi></math></span><span>, satisfies a differential equation whose explicit solution is provided. It is shown that a unique solution exists when a polynomial is utilized in lieu of a rational function. The proposed methodology is successfully applied to two test statistics and several distributions. It is also explained that the same moment-matching technique can yield density estimates on the basis of sample moments. An example involving a widely analyzed data set illustrates this approach.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 61-71"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.02.005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Nonconvex penalized ridge estimations for partially linear additive models in ultrahigh dimension","authors":"Mingqiu Wang","doi":"10.1016/j.stamet.2015.03.001","DOIUrl":"10.1016/j.stamet.2015.03.001","url":null,"abstract":"<div><p><span><span><span><span><span>Nonconvex penalties (such as the smoothly clipped absolute deviation penalty and the minimax<span> concave penalty) have some attractive properties including the unbiasedness, continuity and sparsity, and the ridge regression can deal with the </span></span>collinearity problem. Combining the strengths of nonconvex penalties and ridge regression (abbreviated as NPR), we study the oracle selection property of the NPR estimator for high-dimensional partially linear </span>additive models with highly </span>correlated predictors, where the dimensionality of </span>covariates </span><span><math><msub><mrow><mi>p</mi></mrow><mrow><mi>n</mi></mrow></msub></math></span> is allowed to increase exponentially with the sample size <span><math><mi>n</mi></math></span>. Simulation studies and a real data analysis are carried out to show the performance of the NPR method.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 1-15"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.03.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tomasz J. Kozubowski , Anna K. Panorska , Matthew L. Forister
{"title":"A discrete truncated Pareto distribution","authors":"Tomasz J. Kozubowski , Anna K. Panorska , Matthew L. Forister","doi":"10.1016/j.stamet.2015.04.002","DOIUrl":"10.1016/j.stamet.2015.04.002","url":null,"abstract":"<div><p>We propose a new discrete distribution with finite support, which generalizes truncated Pareto and beta distributions<span> as well as uniform and Benford’s laws. Although our focus is on basic properties and stochastic representations, we also consider parameter estimation and include an illustration from ecology showing potential applications of this new stochastic model.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 135-150"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.04.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sequential negative binomial problems and statistical ecology: A selected review with new directions","authors":"Nitis Mukhopadhyay , Swarnali Banerjee","doi":"10.1016/j.stamet.2015.02.006","DOIUrl":"10.1016/j.stamet.2015.02.006","url":null,"abstract":"<div><p><span>Count data is abundant in entomology, more broadly, in statistical ecology. In 1949, Frank Anscombe pioneered the role of negative binomial (NB) modeling while working with insect count data. Since then, the spectrum of available research methods has grown immensely in more than past sixty years in involving count data modeled by a </span>NB distribution. NB distribution also finds extensive use in agriculture, insect infestation, soil and weed science, etc. In this paper we have used a real dataset on potato beetle infestation (Beall, 1939) to illustrate smooth data collection under various sequential inferential procedures to draw important and practical conclusions.</p><p>We begin by selectively reviewing a majority of influential research methods for a number of formulations and their executions in the context of fixed-sample-size inferential procedures (Section <span>2</span>). Subsequently, we elaborate on purely sequential and two-stage sampling methodologies for data collection (Sections <span>3 Sequential inferential problems: tests of hypotheses</span>, <span>4 Sequential inferential problems: estimation</span>). In Section <span>5</span>, we summarize some major results with their interpretations including large-sample first- and second-order properties as appropriate. The illustrations of all the sequential inferential procedures on the real dataset gives interesting insights (Section <span>6</span>). We also propose a number of selected directions for future research of substantial nature (Section <span>7</span>). Finally, our own R codes are provided in the <span>Appendix</span>.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 34-60"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.02.006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohammad Nourmohammadi, Mohammad Jafari Jozani, Brad C. Johnson
{"title":"Distribution-free tolerance intervals with nomination samples: Applications to mercury contamination in fish","authors":"Mohammad Nourmohammadi, Mohammad Jafari Jozani, Brad C. Johnson","doi":"10.1016/j.stamet.2015.03.002","DOIUrl":"10.1016/j.stamet.2015.03.002","url":null,"abstract":"<div><p><span>Tolerance intervals<span> are enclosure intervals which will cover a fixed portion of the population distribution with a specified confidence. These intervals are widely used in clinical, environmental, biological and industrial applications, including quality control and environmental monitoring, to help determine limits for detection or assessment monitoring. In many of these applications the measurement of the variable of interest is costly and/or destructive but a small number of sampling units can be ranked easily by using expert-opinion knowledge or inexpensive and easily obtained measurements from these units. In this paper, we construct tolerance intervals based on the expensive measurements that are obtained using randomized nomination sampling (RNS) with the help of inexpensive auxiliary information. We study the performance of our proposed RNS-based tolerance intervals based on the corresponding coverage probabilities and the necessary sample size for their existence with those based on </span></span>simple random sampling (SRS). The efficiency of the constructed RNS-based tolerance intervals compared to their SRS counterparts is discussed. We investigate the performance of RNS-based tolerance intervals for different values of the design parameters and various population shapes. We find the values of the design parameters which improve RNS over SRS. The RNS design in presence of ranking error is discussed and a new method for estimating ranking error probabilities is proposed. Theoretical results are augmented with numerical evaluations and a case study based on a fish mercury level dataset.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 16-33"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.03.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimating the parameters of a seasonal Markov-modulated Poisson process","authors":"Armelle Guillou , Stéphane Loisel , Gilles Stupfler","doi":"10.1016/j.stamet.2015.04.003","DOIUrl":"10.1016/j.stamet.2015.04.003","url":null,"abstract":"<div><p>Motivated by seasonality and regime-switching features of some insurance claim counting processes, we study the statistical analysis of a Markov-modulated Poisson process featuring seasonality. We prove the strong consistency and the asymptotic normality<span> of a maximum split-time likelihood estimator of the parameters of this model, and present an algorithm to compute it in practice. The method is illustrated on a small simulation study and a real data analysis.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":"26 ","pages":"Pages 103-123"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.04.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55092958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}