{"title":"Nonparametric estimation of isotropic covariance function","authors":"Yiming Wang, Sujit K. Ghosh","doi":"10.1080/10485252.2022.2146111","DOIUrl":"https://doi.org/10.1080/10485252.2022.2146111","url":null,"abstract":"A nonparametric model using a sequence of Bernstein polynomials is constructed to approximate arbitrary isotropic covariance functions valid in and related approximation properties are investigated using the popular norm and norms. A computationally efficient sieve maximum likelihood (sML) estimation is then developed to nonparametrically estimate the unknown isotropic covariance function valid in . Consistency of the proposed sieve ML estimator is established under increasing domain regime. The proposed methodology is compared numerically with couple of existing nonparametric as well as with commonly used parametric methods. Numerical results based on simulated data show that our approach outperforms the parametric methods in reducing bias due to model misspecification and also the nonparametric methods in terms of having significantly lower values of expected and norms. Application to precipitation data is illustrated to showcase a real case study. Additional technical details and numerical illustrations are also made available.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"19 1","pages":"198 - 237"},"PeriodicalIF":1.2,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76341515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A high-dimensional test for the k-sample Behrens–Fisher problem","authors":"Daojiang He, Huijun Shi, Kai Xu, M. Cao","doi":"10.1080/10485252.2022.2147172","DOIUrl":"https://doi.org/10.1080/10485252.2022.2147172","url":null,"abstract":"In this paper, the problem of testing the equality of the mean vectors of k populations with possibly unknown and unequal covariance matrices is investigated in high-dimensional settings. The null distributions of most existing tests are asymptotically normal which inevitably imposes strong conditions on covariance matrices. However, we assume here only mild additional conditions on the proposed test, which offers much flexibility in practical applications. Additionally, the Welch–Satterthwaite -approximation we adopted can automatically mimic the shape of the null distribution of the proposed test statistic, while the normal approximation cannot achieve the adaptivity. Finally, an extensive simulation study shows that the proposed test has better performance on both size and power compared with existing methods.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"29 1","pages":"239 - 265"},"PeriodicalIF":1.2,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75115241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust oracle estimation and uncertainty quantification for possibly sparse quantiles","authors":"E. Belitser, P. Serra, Alexandra G. J. Vegelien","doi":"10.1080/10485252.2023.2226779","DOIUrl":"https://doi.org/10.1080/10485252.2023.2226779","url":null,"abstract":"A general many quantiles + noise model is studied in the robust formulation (allowing non-normal, non-independent observations), where the identifiability requirement for the noise is formulated in terms of quantiles rather than the traditional zero expectation assumption. We propose a penalization method based on the quantile loss function with appropriately chosen penalty function making inference on possibly sparse high-dimensional quantile vector. We apply a local approach to address the optimality by comparing procedures to the oracle sparsity structure. We establish that the proposed procedure mimics the oracle in the problems of estimation and uncertainty quantification (under the so called EBR condition). Adaptive minimax results over sparsity scale follow from our local results.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"458 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78296795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Minimax estimation in multi-task regression under low-rank structures","authors":"Kwan-Young Bak, J. Koo","doi":"10.1080/10485252.2022.2146110","DOIUrl":"https://doi.org/10.1080/10485252.2022.2146110","url":null,"abstract":"This study investigates the minimaxity of a multi-task nonparametric regression problem. We formulate a simultaneous function estimation problem based on information pooling across multiple experiments under a low-dimensional structure. A nonparametric reduced rank regression estimator based on the nuclear norm penalisation scheme is proposed to incorporate the low-dimensional structure in the estimation process. A rank of a set of functions is defined in terms of their Fourier coefficients to formally characterise the dependence structure among functions. Minimax upper and lower bounds are established under various asymptotic scenarios to examine the role of the low-rank structure in determining optimal rates of convergence. The results confirm that exploiting the low-rank structure can significantly improve the convergence rate for the simultaneous estimation of multiple functions. The results also imply that the proposed estimator is rate optimal in the minimax sense for the rank-constraint Sobolev class of vector-valued functions.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"57 1","pages":"122 - 144"},"PeriodicalIF":1.2,"publicationDate":"2022-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76856131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christophe Crambes, Chayma Daayeb, A. Gannoun, Yousri Henchiri
{"title":"Functional linear model with partially observed covariate and missing values in the response","authors":"Christophe Crambes, Chayma Daayeb, A. Gannoun, Yousri Henchiri","doi":"10.1080/10485252.2022.2142222","DOIUrl":"https://doi.org/10.1080/10485252.2022.2142222","url":null,"abstract":"Dealing with missing values is an important issue in data observation or data recording process. In this paper, we consider a functional linear regression model with partially observed covariate and missing values in the response. We use a reconstruction operator that aims at recovering the missing parts of the explanatory curves, then we are interested in regression imputation method of missing data on the response variable, using functional principal component regression to estimate the functional coefficient of the model. We study the asymptotic behaviour of the prediction error when missing data are replaced by the imputed values in the original dataset. The practical behaviour of the method is also studied on simulated data and a real dataset.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"36 1","pages":"172 - 197"},"PeriodicalIF":1.2,"publicationDate":"2022-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79794673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Inference on semi-parametric transformation model with a pairwise likelihood based on left-truncated and interval-censored data","authors":"Yichen Lou, Peijie Wang, Jianguo Sun","doi":"10.1080/10485252.2022.2138383","DOIUrl":"https://doi.org/10.1080/10485252.2022.2138383","url":null,"abstract":"Semi-parametric transformation models provide a general and flexible class of models for regression analysis of failure time data and many methods have been developed for their estimation. In particular, they include the proportional hazards and proportional odds models as special cases. In this paper, we discuss the situation where one observes left-truncated and interval-censored data, for which it does not seem to exist an established method. For the problem, in contrast to the commonly used conditional approach that may not be efficient, a pairwise pseudo-likelihood method is proposed to recover some missing information in the conditional method. The proposed estimators are proved to be consistent and asymptotically efficient and normal. A simulation study is conducted to assess the empirical performance of the method and suggests that it works well in practical situations. This method is illustrated by using a set of real data arising from an HIV/AIDS cohort study.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"122 1","pages":"38 - 55"},"PeriodicalIF":1.2,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79082730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Asymptotic and finite sample properties of Hill-type estimators in the presence of errors in observations","authors":"Mihyun Kim, P. Kokoszka","doi":"10.1080/10485252.2022.2136662","DOIUrl":"https://doi.org/10.1080/10485252.2022.2136662","url":null,"abstract":"We establish asymptotic and finite sample properties of the Hill and Harmonic Moment estimators applied to heavy-tailed data contaminated by errors. We formulate conditions on the errors and the number of upper order statistics under which these estimators continue to be asymptotically normal. We specify analogous conditions which must hold in finite samples for the confidence intervals derived from the asymptotic normal distribution to be reliable. In the large sample analysis, we specify conditions related to second-order regular variation and divergence rates for the number of upper order statistics, k, used to compute the estimators. In the finite sample analysis, we examine several data-driven methods of selecting k, and determine which of them are most suitable for confidence interval inference. The results of these investigations are applied to interarrival times of internet traffic anomalies, which are available only with a round-off error.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"76 1","pages":"1 - 18"},"PeriodicalIF":1.2,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91178412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recursive kernel estimator in a semiparametric regression model","authors":"Emmanuel De Dieu Nkou","doi":"10.1080/10485252.2022.2130308","DOIUrl":"https://doi.org/10.1080/10485252.2022.2130308","url":null,"abstract":"Sliced inverse regression (SIR) is a recommended method to identify and estimate the central dimension reduction (CDR) subspace. CDR subspace is at the base to describe the conditional distribution of the response Y given a d-dimensional predictor vector X. To estimate this space, two versions are very popular: the slice version and the kernel version. A recursive method of the slice version has already been the subject of a systematic study. In this paper, we propose to study the kernel version. It's a recursive method based on a stochastic approximation algorithm of the kernel version. The asymptotic normality of the proposed estimator is also proved. A simulation study that not only shows the good numerical performance of the proposed estimate and which also allows to evaluate its performance with respect to existing methods is presented. A real dataset is also used to illustrate the approach.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"187 1","pages":"145 - 171"},"PeriodicalIF":1.2,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75406537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Uniform convergence rates and automatic variable selection in nonparametric regression with functional and categorical covariates","authors":"Leonie Selk","doi":"10.1080/10485252.2023.2207673","DOIUrl":"https://doi.org/10.1080/10485252.2023.2207673","url":null,"abstract":"In Selk and Gertheiss (2022) a nonparametric prediction method for models with multiple functional and categorical covariates is introduced. The dependent variable can be categorical (binary or multi-class) or continuous, thus both classification and regression problems are considered. In the paper at hand the asymptotic properties of this method are developed. A uniform rate of convergence for the regression / classification estimator is given. Further it is shown that, asymptotically, a data-driven least squares cross-validation method can automatically remove irrelevant, noise variables.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"1 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88658073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Another look at halfspace depth: flag halfspaces with applications","authors":"Duvsan Pokorn'y, P. Laketa, Stanislav Nagy","doi":"10.1080/10485252.2023.2236721","DOIUrl":"https://doi.org/10.1080/10485252.2023.2236721","url":null,"abstract":"The halfspace depth is a well studied tool of nonparametric statistics in multivariate spaces, naturally inducing a multivariate generalisation of quantiles. The halfspace depth of a point with respect to a measure is defined as the infimum mass of closed halfspaces that contain the given point. In general, a closed halfspace that attains that infimum does not have to exist. We introduce a flag halfspace - an intermediary between a closed halfspace and its interior. We demonstrate that the halfspace depth can be equivalently formulated also in terms of flag halfspaces, and that there always exists a flag halfspace whose boundary passes through any given point $x$, and has mass exactly equal to the halfspace depth of $x$. Flag halfspaces allow us to derive theoretical results regarding the halfspace depth without the need to differentiate absolutely continuous measures from measures containing atoms, as was frequently done previously. The notion of flag halfspaces is used to state results on the dimensionality of the halfspace median set for random samples. We prove that under mild conditions, the dimension of the sample halfspace median set of $d$-variate data cannot be $d-1$, and that for $d=2$ the sample halfspace median set must be either a two-dimensional convex polygon, or a data point. The latter result guarantees that the computational algorithm for the sample halfspace median form the R package TukeyRegion is exact also in the case when the median set is less-than-full-dimensional in dimension $d=2$.","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"31 1","pages":""},"PeriodicalIF":1.2,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73149443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}