{"title":"Rank-based kernel estimation of the area under the ROC curve","authors":"Jingjing Yin, Yi Hao, Hani Samawi, Haresh Rochani","doi":"10.1016/j.stamet.2016.04.001","DOIUrl":"10.1016/j.stamet.2016.04.001","url":null,"abstract":"<div><p>In medical diagnostics, the ROC curve is the graph of sensitivity against 1-specificity as the diagnostic threshold runs through all possible values. The ROC curve and its associated summary indices are very useful for the evaluation of the discriminatory ability of biomarkers/diagnostic tests with continuous measurements. Among all summary indices, the area under the ROC curve (AUC) is the most popular diagnostic accuracy index, which has been extensively used by researchers for biomarker evaluation and selection. Sometimes, taking the actual measurements of a biomarker is difficult and expensive, whereas ranking them without actual measurements can be easy. In such cases, ranked set sampling based on judgment order statistics would provide more representative samples yielding more accurate estimation. In this study, Gaussian kernel is utilized to obtain a nonparametric estimate of the AUC. Asymptotic properties<span><span> of the AUC estimates are derived based on the theory of U-statistics. Intensive simulation is conducted to compare the estimates using ranked set samples versus </span>simple random samples. The simulation and theoretical derivation indicate that ranked set sampling is generally preferred with smaller variances and mean squared errors (MSE). The proposed method is illustrated via a real data analysis.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.04.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chantal Larose , Ofer Harel , Katarzyna Kordas , Dipak K. Dey
{"title":"Latent class analysis of incomplete data via an entropy-based criterion","authors":"Chantal Larose , Ofer Harel , Katarzyna Kordas , Dipak K. Dey","doi":"10.1016/j.stamet.2016.04.004","DOIUrl":"https://doi.org/10.1016/j.stamet.2016.04.004","url":null,"abstract":"<div><p><span>Latent class analysis is used to group categorical data into classes via a probability model. </span>Model selection criteria<span> then judge how well the model fits the data. When addressing incomplete data, the current methodology restricts the imputation to a single, pre-specified number of classes. We seek to develop an entropy-based model selection criterion that does not restrict the imputation to one number of clusters. Simulations show the new criterion performing well against the current standards of AIC<span> and BIC, while a family studies application demonstrates how the criterion provides more detailed and useful results than AIC and BIC.</span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.04.004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137212272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bayesian optimal cluster designs","authors":"Satya Prakash Singh, Siuli Mukhopadhyay","doi":"10.1016/j.stamet.2016.02.002","DOIUrl":"10.1016/j.stamet.2016.02.002","url":null,"abstract":"<div><p><span><span>Designing cluster trials depends on the knowledge of the intracluster correlation coefficient. To overcome the issue of parameter dependence, </span>Bayesian designs are proposed for two level models with and without </span>covariates. These designs minimize the variance of the treatment contrast under certain cost constraints. A pseudo Bayesian design approach is advocated that integrates and averages the objective function over a prior distribution of the intracluster correlation coefficient. Theoretical results on the Bayesian criterion are noted when the intracluster correlation follows a uniform distribution. Two data sets based on educational surveys conducted in schools are used to illustrate the proposed methodology.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.02.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A note on domains of attraction of the limit laws of intermediate order statistics under power normalization","authors":"H.M. Barakat, A.R. Omar","doi":"10.1016/j.stamet.2016.01.001","DOIUrl":"10.1016/j.stamet.2016.01.001","url":null,"abstract":"<div><p><span>In this paper we compare the domains of attraction of limit laws of intermediate order statistics under power normalization with those of limit laws of intermediate order statistics under linear normalization. As a result of this comparison, we obtain necessary and sufficient conditions for a </span>univariate distribution function to belong to the domain of attraction for each of the possible limit laws of intermediate order statistics under power normalization.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.01.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new skew integer valued time series process","authors":"Marcelo Bourguignon , Klaus L.P. Vasconcellos","doi":"10.1016/j.stamet.2016.01.002","DOIUrl":"10.1016/j.stamet.2016.01.002","url":null,"abstract":"<div><p><span>In this paper, we introduce a stationary first-order integer-valued autoregressive process with geometric–Poisson marginals. The new process allows negative values for the series. Several properties of the process are established. The unknown parameters of the model are estimated using the Yule–Walker method and the </span>asymptotic properties of the estimator are considered. Some numerical results of the estimators are presented with a brief discussion. Possible application of the process is discussed through a real data example.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.01.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ehssan Ghashim , Éric Marchand , William E. Strawderman
{"title":"On a better lower bound for the frequentist probability of coverage of Bayesian credible intervals in restricted parameter spaces","authors":"Ehssan Ghashim , Éric Marchand , William E. Strawderman","doi":"10.1016/j.stamet.2016.01.006","DOIUrl":"10.1016/j.stamet.2016.01.006","url":null,"abstract":"<div><p><span>For estimating a lower restricted parametric function in the framework of Marchand and Strawderman (2006), we show how </span><span><math><mrow><mo>(</mo><mn>1</mn><mo>−</mo><mi>α</mi><mo>)</mo></mrow><mo>×</mo><mn>100</mn><mi>%</mi></math></span><span><span> Bayesian credible intervals can be constructed so that the </span>frequentist<span> probability of coverage is no less than </span></span><span><math><mn>1</mn><mo>−</mo><mfrac><mrow><mn>3</mn><mi>α</mi></mrow><mrow><mn>2</mn></mrow></mfrac></math></span>. As in Marchand and Strawderman (2013), the findings are achieved through the specification of the <em>spending function</em> of the Bayes credible interval and apply to an “equal-tails” modification of the HPD procedure among others. Our results require a logconcave assumption for the distribution of a pivot, and apply to estimating a lower bounded normal mean with known variance, and to further examples include lower bounded scale parameters from Gamma, Weibull, and Fisher distributions, with the latter also applicable to random effects analysis of variance.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.01.006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On testing local hypotheses via local divergence","authors":"G. Avlogiaris , A. Micheas , K. Zografos","doi":"10.1016/j.stamet.2016.01.003","DOIUrl":"10.1016/j.stamet.2016.01.003","url":null,"abstract":"<div><p>The aim of this paper is to propose procedures that test statistical hypotheses locally, that is, assess the validity of a model in a specific domain of the data. In this context, the one and two sample problems will be discussed. The proposed tests are based on local divergences which are defined in such a way as to quantify the divergence between probability distributions locally, in a specific area of the joint domain of the underlined models. The theoretical results are exemplified using simulations and two real datasets.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.01.003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cathy W.S. Chen , Mike K.P. So , Jessica C. Li , Songsak Sriboonchitta
{"title":"Autoregressive conditional negative binomial model applied to over-dispersed time series of counts","authors":"Cathy W.S. Chen , Mike K.P. So , Jessica C. Li , Songsak Sriboonchitta","doi":"10.1016/j.stamet.2016.02.001","DOIUrl":"10.1016/j.stamet.2016.02.001","url":null,"abstract":"<div><p><span><span><span>Integer-valued time series analysis offers various applications in biomedical, financial, and environmental research. However, existing works usually assume no or constant over-dispersion. In this paper, we propose a new model for time series of counts, the autoregressive conditional </span>negative binomial model that has a time-varying conditional autoregressive mean function and heteroskedasticity. The location and scale parameters of the </span>negative binomial distribution are flexible in the proposed set-up, inducing dynamic over-dispersion. We adopt </span>Bayesian<span><span> methods with a Markov chain Monte Carlo sampling scheme to estimate model parameters and utilize deviance information criterion for model comparison. We conduct simulations to investigate the estimation performance of this sampling scheme for the proposed negative binomial model. To demonstrate the proposed approach in modelling time-varying over-dispersion, we consider two types of criminal incidents recorded by New South Wales (NSW) Police Force in Australia. We also fit the autoregressive conditional </span>Poisson model to these two datasets. Our results demonstrate that the proposed negative binomial model is preferable to the Poisson model.</span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.02.001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimation of the stress–strength reliability for the two-parameter bathtub-shaped lifetime distribution based on upper record values","authors":"Bahman Tarvirdizade, Mohammad Ahmadpour","doi":"10.1016/j.stamet.2016.01.005","DOIUrl":"10.1016/j.stamet.2016.01.005","url":null,"abstract":"<div><p>In this paper, the estimation of the stress–strength reliability <span><math><mo>Pr</mo><mrow><mo>(</mo><mi>X</mi><mo>></mo><mi>Y</mi><mo>)</mo></mrow></math></span> based on upper record values is considered when <span><math><mi>X</mi></math></span> and <span><math><mi>Y</mi></math></span><span><span><span> are independent random variables from a two-parameter bathtub-shaped lifetime distribution with the same shape but different scale parameters. The </span>maximum likelihood estimator<span><span> (MLE), the approximate Bayes estimator and the </span>exact confidence intervals of stress–strength reliability are obtained when the shape parameter is known. When the shape parameter is unknown, we obtain the MLE, the </span></span>asymptotic confidence interval<span> and some bootstrap confidence intervals<span> of stress–strength reliability. In this case, we also apply the Gibbs sampling technique to study the Bayesian estimation of stress–strength reliability and the corresponding credible interval. A Monte Carlo simulation study is conducted to investigate and compare the performance of the different proposed methods in this paper. Finally, analysis of a real data set is presented for illustrative purposes.</span></span></span></p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2016.01.005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A family of association measures for 2×2 contingency tables based on the ϕ-divergence","authors":"Michael Espendiller, Maria Kateri","doi":"10.1016/j.stamet.2015.12.002","DOIUrl":"10.1016/j.stamet.2015.12.002","url":null,"abstract":"<div><p>The odds ratio is the predominant measure of association in <span><math><mn>2</mn><mo>×</mo><mn>2</mn></math></span> contingency tables, which, for inferential purposes, is usually considered on the log-scale. Under an information theoretic set-up, it is connected to the Kullback–Leibler divergence. Considering a generalized family of divergences, the <span><math><mi>ϕ</mi></math></span> divergence, alternative association measures are derived for <span><math><mn>2</mn><mo>×</mo><mn>2</mn></math></span><span> contingency tables. Their properties are studied and asymptotic inference is developed. For some members of this family, the estimated association measures remain finite in the presence of a sampling zero while for a subset of these members the estimators of these measures have finite variance as well. Special attention is given to the power divergence, which is a parametric family. The role of its parameter </span><span><math><mi>λ</mi></math></span><span><span><span>, in terms of the asymptotic confidence intervals’ coverage probability and average relative length, is further discussed. In special </span>probability table structures, for which the performance of the </span>asymptotic confidence intervals for the classical log odds ratio is poor, the measure corresponding to </span><span><math><mi>λ</mi><mo>=</mo><mn>1</mn><mo>/</mo><mn>3</mn></math></span> is suggested as an alternative.</p></div>","PeriodicalId":48877,"journal":{"name":"Statistical Methodology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.stamet.2015.12.002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"55093214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}