{"title":"Non-asymptotic robustness analysis of regression depth median","authors":"Yijun Zuo","doi":"10.1016/j.jmva.2023.105247","DOIUrl":null,"url":null,"abstract":"<div><p>The maximum depth estimator (aka depth median) (<span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span>) induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the <span><math><mi>ϵ</mi></math></span>-contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> were obtained whereas an exact FSBP for <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value <span><math><mrow><mn>1</mn><mo>/</mo><mn>3</mn></mrow></math></span> (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> and the newly obtained exact FSBP. It justifies the employment of <span><math><msubsup><mrow><mi>β</mi></mrow><mrow><mi>R</mi><mi>D</mi></mrow><mrow><mo>∗</mo></mrow></msubsup></math></span> as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.</p></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"199 ","pages":"Article 105247"},"PeriodicalIF":1.4000,"publicationDate":"2023-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0047259X23000933/pdfft?md5=41b0163d4b47acc16c5399dda63160ea&pid=1-s2.0-S0047259X23000933-main.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Multivariate Analysis","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0047259X23000933","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1
Abstract
The maximum depth estimator (aka depth median) () induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the -contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for were obtained whereas an exact FSBP for remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of and the newly obtained exact FSBP. It justifies the employment of as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.
期刊介绍:
Founded in 1971, the Journal of Multivariate Analysis (JMVA) is the central venue for the publication of new, relevant methodology and particularly innovative applications pertaining to the analysis and interpretation of multidimensional data.
The journal welcomes contributions to all aspects of multivariate data analysis and modeling, including cluster analysis, discriminant analysis, factor analysis, and multidimensional continuous or discrete distribution theory. Topics of current interest include, but are not limited to, inferential aspects of
Copula modeling
Functional data analysis
Graphical modeling
High-dimensional data analysis
Image analysis
Multivariate extreme-value theory
Sparse modeling
Spatial statistics.