Shrijita Bhattacharya, Francois Kamper, J. Beirlant
{"title":"Outlier detection based on extreme value theory and applications","authors":"Shrijita Bhattacharya, Francois Kamper, J. Beirlant","doi":"10.1111/sjos.12665","DOIUrl":"https://doi.org/10.1111/sjos.12665","url":null,"abstract":"Whether an extreme observation is an outlier or not depends strongly on the corresponding tail behavior of the underlying distribution. We develop an automatic, data‐driven method rooted in the mathematical theory of extremes to identify observations that deviate from the intermediate and central characteristics. The proposed algorithm is an extension of a method previously proposed in the literature for the specific case of heavy tailed Pareto‐type distributions to all max‐domains of attraction. We propose some applications such as a tail‐adjusted boxplot which yields a more accurate representation of possible outliers, and the identification of outliers in a multivariate context through an analysis of associated random variables such as local outlier factors. Several examples and simulation results illustrate the finite sample behavior of the algorithm and its applications.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":"50 1","pages":"1466 - 1502"},"PeriodicalIF":1.0,"publicationDate":"2023-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46062864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Nonparametric adaptive estimation for Interacting particle systems","authors":"F. Comte, V. Genon-Catalot","doi":"10.1111/sjos.12661","DOIUrl":"https://doi.org/10.1111/sjos.12661","url":null,"abstract":". We consider a stochastic system of N interacting particles with constant di(cid:27)usion coe(cid:30)cient and drift linear in space, time-depending on two unknown deterministic functions. Our concern here is the nonparametric estimation of these functions from a continuous observation of the process on [0 , T ] for (cid:28)xed T and large N . We de(cid:28)ne two collections of projection estimators belonging to (cid:28)nite-dimensional subspaces of L 2 ([0 , T ]) . We study the L 2 -risks of these estimators, where the risk is de(cid:28)ned either by the expectation of an empirical norm or by the expectation of a deterministic norm. Afterwards, we propose a data-driven choice of the dimensions and study the risk of the adaptive estimators. The results are illustrated by numerical experiments on simulated data.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48367198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A robust model averaging approach for partially linear models with responses missing at random","authors":"Zhongqi Liang, Qihua Wang","doi":"10.1111/sjos.12659","DOIUrl":"https://doi.org/10.1111/sjos.12659","url":null,"abstract":"In this paper, with an assumed parametric model for the selection probability function, a robust model averaging estimation method is proposed for partially linear models with responses missing at random. The method is based on a weighted Mallows‐type criterion. The method is robust in the sense that the asymptotic optimality holds true as long as the true model of the selection probability function is some measurable function of its assumed model. The optimal weight vector for model averaging is obtained by minimizing the weighted Mallows‐type criterion. It is shown that the robust model averaging method achieves the lowest possible squared error asymptotically. Some simulation studies were conducted to evaluate the proposed method. An application to two real examples are provided as illustration.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49286763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Errata for “A framework for covariate balance using Bregman distances”","authors":"","doi":"10.1111/sjos.12657","DOIUrl":"https://doi.org/10.1111/sjos.12657","url":null,"abstract":"This aligns with efficiency bound targeted in the proof within the online supplement. Second, in equation (26), there is an errant qi included into the right-hand side of the second constraint that should be removed. Finally, the description of the hdCBPS in Section 5.2 requires clarification. The itemized entry should instead state “An augmented version of CBPS that extends (34) by using regularized regression techniques to find debiased estimates of the potential outcome means.” The new wording better reflects the hdCBPS method versus the original description.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49009937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient\u0000 t\u0000 0\u0000 ‐year risk regression using the logistic model","authors":"T. Martinussen, T. Scheike","doi":"10.1111/sjos.12658","DOIUrl":"https://doi.org/10.1111/sjos.12658","url":null,"abstract":"In some clinical studies patient survival beyond a specific point in time, t0$$ {t}_0 $$ , say, may be of special interest as it may for instance indicate patient cure. To analyze the t0$$ {t}_0 $$ ‐year risk for such patients may be accomplished using logistic regression with appropriate weights (IPWCC) that may further be augmented (AIPWCC) to improve efficiency. In this paper, we derive the most efficient estimator for this problem, which is different from the AIPWCC based on the full data efficient influence function. We first give the result for a survival endpoint and then generalize to the competing risk setting. The proposed estimators superior behavior is illustrated using simulations as well as applying it to some real data concerning the survival of blood and marrow transplanted patients.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.0,"publicationDate":"2023-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45028261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}