Australian & New Zealand Journal of Statistics最新文献

筛选
英文 中文
Telling Stories with Data: With Application in R. By Rohan Alexander. CRC Press. 2023. 622 pages. AU$129.60 (hardback). ISBN: 978-1-0321-3477-2. 用数据讲故事:在r语言中的应用CRC出版社。2023。622页。非盟(精装)129.60美元。ISBN: 978-1-0321-3477-2。
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-09-30 DOI: 10.1111/anzs.12428
Emi Tanaka
{"title":"Telling Stories with Data: With Application in R. By Rohan Alexander. CRC Press. 2023. 622 pages. AU$129.60 (hardback). ISBN: 978-1-0321-3477-2.","authors":"Emi Tanaka","doi":"10.1111/anzs.12428","DOIUrl":"https://doi.org/10.1111/anzs.12428","url":null,"abstract":"","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 4","pages":"467-470"},"PeriodicalIF":0.8,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142869185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Full Bayesian analysis of triple seasonal autoregressive models 三季节自回归模型的全贝叶斯分析
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-09-27 DOI: 10.1111/anzs.12427
Ayman A. Amin
{"title":"Full Bayesian analysis of triple seasonal autoregressive models","authors":"Ayman A. Amin","doi":"10.1111/anzs.12427","DOIUrl":"https://doi.org/10.1111/anzs.12427","url":null,"abstract":"<div>\u0000 \u0000 <p>Seasonal autoregressive (SAR) time series models have been extended to fit time series exhibiting multiple seasonalities. However, hardly any research in Bayesian literature has been done on modelling multiple seasonalities. In this article, we propose a full Bayesian analysis of triple SAR (TSAR) models for time series with triple seasonality, considering identification, estimation and prediction for these TSAR models. In this Bayesian analysis of TSAR models, we assume the model errors to be normally distributed and the model order to be a random variable with a known maximum value, and we employ the g prior for the model coefficients and variance. Accordingly, we first derive the posterior mass function of the TSAR order in closed form, which then enables us to identify the best order of TSAR model as the order value with the highest posterior probability. In addition, we derive the conditional posteriors to be a multivariate normal for the TSAR coefficients and to be an inverse gamma for the TSAR variance; also, we derive the conditional predictive distribution to be a multivariate normal for future observations. Since these derived conditional distributions are in closed forms, we introduce the Gibbs sampler to present the Bayesian analysis of TSAR models and to easily produce multiple-step-ahead predictions. Using <span>Julia</span> programming language, we conduct an extensive simulation study, aiming to evaluate the accuracy of our proposed full Bayesian analysis for TSAR models. In addition, we apply our work on time series to hourly electricity load in some European countries.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 4","pages":"389-416"},"PeriodicalIF":0.8,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142869120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Examining collinearities 检查共线性
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-08-29 DOI: 10.1111/anzs.12425
Zillur R. Shabuz, Paul H. Garthwaite
{"title":"Examining collinearities","authors":"Zillur R. Shabuz,&nbsp;Paul H. Garthwaite","doi":"10.1111/anzs.12425","DOIUrl":"10.1111/anzs.12425","url":null,"abstract":"<div>\u0000 \u0000 <p>The cos-max method is a little-known method of identifying collinearities. It is based on the cos-max transformation, which makes minimal adjustment to a set of vectors to create orthogonal components with a one-to-one correspondence between the original vectors and the components. The aim of the transformation is that each vector should be close to the orthogonal component with which it is paired. Vectors involved in a collinearity must be adjusted substantially in order to create orthogonal components, while other vectors will typically be adjusted far less. The cos-max method uses the size of adjustments to identify collinearities. It gives a coherent relationship between collinear sets of variables and variance inflation factors (VIFs) and identifies collinear sets using more information than traditional methods. In this paper we describe these features of the method and examine its performance in examples, comparing it with alternative methods. In each example, the collinearities identified by the cos-max method only contained variables with high VIFs and contained all variables with high VIFs. The collinearities identified by other methods did not have such a close link to VIFs. Also, the collinearities identified by the cos-max method were as simple as or simpler than those given by other methods, with less overlap between collinearities in the variables that they contained.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 3","pages":"367-388"},"PeriodicalIF":0.8,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exact samples sizes for clinical trials subject to size and power constraints 受规模和功率限制的临床试验的精确样本量
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-08-29 DOI: 10.1111/anzs.12424
Chris J. Lloyd
{"title":"Exact samples sizes for clinical trials subject to size and power constraints","authors":"Chris J. Lloyd","doi":"10.1111/anzs.12424","DOIUrl":"10.1111/anzs.12424","url":null,"abstract":"<p>This paper first describes the difficulties in providing the required sample sizes for clinical trials that guarantee type 1 and type 2 error control. The required sample sizes obviously depend on the test employed, and in this study we use the so-called <i>E</i>-test, which is known to have extremely favourable size properties and higher power than alternatives. To compute exact powers for this test in real time is not currently feasible, so a corpus of pre-computed exact powers (and sizes) was created, covering sample sizes up to 500. When there are no solutions within the corpus, a novel extrapolation technique is used. Exact size can be computed after the sample sizes have been extracted; however, for the <i>E</i>-test the exact size is virtually always very close to the nominal target. All the code has been converted into an <span>R-package</span>, which is available on CRAN and illustrated.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 3","pages":"297-305"},"PeriodicalIF":0.8,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12424","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian analysis of multivariate mixed longitudinal ordinal and continuous data 多变量混合纵向序数和连续数据的贝叶斯分析
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-08-13 DOI: 10.1111/anzs.12421
Xiao Zhang
{"title":"Bayesian analysis of multivariate mixed longitudinal ordinal and continuous data","authors":"Xiao Zhang","doi":"10.1111/anzs.12421","DOIUrl":"10.1111/anzs.12421","url":null,"abstract":"<p>Multivariate longitudinal ordinal and continuous data exist in many scientific fields. However, it is a rigorous task to jointly analyse them due to the complicated correlated structures of those mixed data and the lack of a multivariate distribution. The multivariate probit model, assuming there is a multivariate normal latent variable for each multivariate ordinal data, becomes a natural modeling choice for longitudinal ordinal data especially for jointly analysing with longitudinal continuous data. However, the identifiable multivariate probit model requires the variances of the latent normal variables to be fixed at 1, thus the joint covariance matrix of the latent variables and the continuous multivariate normal variables is restricted at some of the diagonal elements. This constrains to develop both the classical and Bayesian methods to analyse mixed ordinal and continuous data. In this investigation, we proposed three Markov chain Monte Carlo (MCMC) methods: Metropolis–Hastings within Gibbs algorithm based on the identifiable model, and a Gibbs sampling algorithm and parameter-expanded data augmentation based on the constructed non-identifiable model. Through simulation studies and a real data application, we illustrated the performance of these three methods and provided an observation of using non-identifiable model to develop MCMC sampling methods.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 3","pages":"325-346"},"PeriodicalIF":0.8,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12421","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributional modelling of positively skewed data via the flexible Weibull extension distribution 通过灵活的威布尔扩展分布建立正倾斜数据的分布模型
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-08-11 DOI: 10.1111/anzs.12423
Freddy Hernández-Barajas, Olga Usuga-Manco, Carmen Patino-Rodríguez, Fernando Marmolejo-Ramos
{"title":"Distributional modelling of positively skewed data via the flexible Weibull extension distribution","authors":"Freddy Hernández-Barajas,&nbsp;Olga Usuga-Manco,&nbsp;Carmen Patino-Rodríguez,&nbsp;Fernando Marmolejo-Ramos","doi":"10.1111/anzs.12423","DOIUrl":"10.1111/anzs.12423","url":null,"abstract":"<p>The time until an event occurs is often known to have a skewed distribution. To model this, a statistical distribution called the two-parameter flexible Weibull extension (FWE) has been proposed. In this paper, the FWE distribution is used to model datasets through the use of generalised additive models for location, scale and shape (GAMLSS) distributional regression. GAMLSS is the only regression technique that can examine the effects of both categorical and numeric predictors on all the parameters of the distribution used to fit the dependent variable. To make it easier to use the FWE distribution through GAMLSS, the <span>RelDists</span> R package is proposed. A simulation study shows that FWE modelling through GAMLSS provides reliable parameter estimates even in the presence of factors that affect the distribution.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 3","pages":"306-324"},"PeriodicalIF":0.8,"publicationDate":"2024-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12423","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spline linear mixed-effects models for causal mediation analysis with longitudinal data 用于纵向数据因果中介分析的样条线性混合效应模型
IF 0.8 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-07-26 DOI: 10.1111/anzs.12422
Jeffrey M. Albert, Hongxu Zhu, Tanujit Dey, Jiayang Sun, Wojbor A. Woyczynski, Gregory Powers, Meeyoung Min
{"title":"Spline linear mixed-effects models for causal mediation analysis with longitudinal data","authors":"Jeffrey M. Albert,&nbsp;Hongxu Zhu,&nbsp;Tanujit Dey,&nbsp;Jiayang Sun,&nbsp;Wojbor A. Woyczynski,&nbsp;Gregory Powers,&nbsp;Meeyoung Min","doi":"10.1111/anzs.12422","DOIUrl":"10.1111/anzs.12422","url":null,"abstract":"<div>\u0000 \u0000 <p>Often, causal mediation analysis is of interest when both the mediator and the final outcome are repeatedly measured, but limited work has been done for this situation (as opposed to where only the mediator is repeatedly measured). Available methods are primarily based on parametric models and tend to be sensitive to model assumptions. This article presents semiparametric, continuous-time models to provide a flexible and robust approach to causal mediation analysis for longitudinal data, which allows these data to be unbalanced or irregular. Specifically, the method uses spline linear mixed-effects models for the mediator and for the final outcome, with a two-step approach to model-fitting in which a predicted mediator is used as a covariate in the final outcome model. The models allow flexible functions for both the mean and individual response functions for each outcome. We derive estimated natural direct and indirect effects as a function of time using an extended mediation formula and sequential ignorability assumption. In simulation studies, we compare properties of estimated direct and indirect effects, and a delta method estimate of the standard error of the latter, under alternative approaches for predicting the mediator. The approach is illustrated using harmonised data from two cohort studies to examine attention as a mediator of the effect of prenatal tobacco exposure on externalising behaviour in children.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 3","pages":"347-366"},"PeriodicalIF":0.8,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new robust covariance matrix estimation for high-dimensional microbiome data 用于高维微生物组数据的新型鲁棒协方差矩阵估算法
IF 1.1 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-05-28 DOI: 10.1111/anzs.12415
Jiyang Wang, Wanfeng Liang, Lijie Li, Yue Wu, Xiaoyan Ma
{"title":"A new robust covariance matrix estimation for high-dimensional microbiome data","authors":"Jiyang Wang,&nbsp;Wanfeng Liang,&nbsp;Lijie Li,&nbsp;Yue Wu,&nbsp;Xiaoyan Ma","doi":"10.1111/anzs.12415","DOIUrl":"10.1111/anzs.12415","url":null,"abstract":"&lt;div&gt;\u0000 \u0000 &lt;p&gt;Microbiome data typically lie in a high-dimensional simplex. One of the key questions in metagenomic analysis is to exploit the covariance structure for this kind of data. In this paper, a framework called approximate-estimate-threshold (AET) is developed for the robust basis covariance estimation for high-dimensional microbiome data. To be specific, we first construct a proxy matrix &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;Γ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ boldsymbol{Gamma} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;, which is almost indistinguishable from the real basis covariance matrix &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;∑&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ boldsymbol{Sigma} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;. Then, any estimator &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mover&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;Γ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mo&gt;^&lt;/mo&gt;\u0000 &lt;/mover&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ hat{boldsymbol{Gamma}} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; satisfying some conditions can be used to estimate &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;Γ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ boldsymbol{Gamma} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;. Finally, we impose a thresholding step on &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mover&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;Γ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mo&gt;^&lt;/mo&gt;\u0000 &lt;/mover&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ hat{boldsymbol{Gamma}} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; to obtain the final estimator &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mover&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;∑&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mo&gt;^&lt;/mo&gt;\u0000 &lt;/mover&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ hat{boldsymbol{Sigma}} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;. In particular, this paper applies a Huber-type estimator &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mover&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;Γ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mo&gt;^&lt;/mo&gt;\u0000 &lt;/mover&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ hat{boldsymbol{Gamma}} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;, and achieves robustness by only requiring the boundedness of 2+&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;ϵ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;a","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 2","pages":"281-295"},"PeriodicalIF":1.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing multiple dispersion effects from unreplicated order-of-addition experiments 从不可重复的加阶实验中测试多重分散效应
IF 1.1 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-05-23 DOI: 10.1111/anzs.12416
Shin-Fu Tsai, Shan-Syue He
{"title":"Testing multiple dispersion effects from unreplicated order-of-addition experiments","authors":"Shin-Fu Tsai,&nbsp;Shan-Syue He","doi":"10.1111/anzs.12416","DOIUrl":"10.1111/anzs.12416","url":null,"abstract":"<p>Optimal addition orders of several components can be determined systematically to address order-of-addition problems when active location and dispersion effects are both taken into account. Based on the concept of fiducial generalised pivotal quantities, a new testing procedure is proposed in this paper to identify active dispersion effects from unreplicated order-of-addition experiments. Because the proposed method is free of all nuisance parameters indexed by the requirement set, it is capable of testing multiple dispersion effects. Simulation results show that the proposed method can maintain the empirical sizes close to the nominal level. A paint viscosity study is used to show that the proposed method can be practical. In addition, testable requirement sets are characterised when an order-of-addition orthogonal array is used to design an experiment.</p>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 2","pages":"228-248"},"PeriodicalIF":1.1,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/anzs.12416","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141104106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A calibrated data-driven approach for small area estimation using big data 利用大数据进行小面积估算的校准数据驱动方法
IF 1.1 4区 数学
Australian & New Zealand Journal of Statistics Pub Date : 2024-05-14 DOI: 10.1111/anzs.12414
Siu-Ming Tam, Shaila Sharmeen
{"title":"A calibrated data-driven approach for small area estimation using big data","authors":"Siu-Ming Tam,&nbsp;Shaila Sharmeen","doi":"10.1111/anzs.12414","DOIUrl":"10.1111/anzs.12414","url":null,"abstract":"<div>\u0000 \u0000 <p>Where the response variable in a big dataset is consistent with the variable of interest for small area estimation, the big data by itself can provide the estimates for small areas. These estimates are often subject to the coverage and measurement error bias inherited from the big data. However, if a probability survey of the same variable of interest is available, the survey data can be used as a training dataset to develop an algorithm to impute for the data missed by the big data and adjust for measurement errors. In this paper, we outline a methodology for such imputations based on an <i>k</i>-nearest neighbours (kNN) algorithm calibrated to an asymptotically design-unbiased estimate of the national total, and illustrate the use of a training dataset to estimate the imputation bias and the “fixed-<i>k</i> asymptotic” bootstrap to estimate the variance of the small area hybrid estimator. We illustrate the methodology of this paper using a public-use dataset and use it to compare the accuracy and precision of our hybrid estimator with the Fay–Harriot (FH) estimator. Finally, we also examine numerically the accuracy and precision of the FH estimator when the auxiliary variables used in the linking models are subject to undercoverage errors.</p>\u0000 </div>","PeriodicalId":55428,"journal":{"name":"Australian & New Zealand Journal of Statistics","volume":"66 2","pages":"125-145"},"PeriodicalIF":1.1,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信