Comparing Robust Versions of Distance Covariance: A Comment on the Biloop Approach

IF 1.8 3区数学 Q1 STATISTICS & PROBABILITY

International Statistical Review Pub Date : 2026-04-07 Epub Date: 2025-10-02 DOI:10.1111/insr.70012

Dominic Edelmann

{"title":"Comparing Robust Versions of Distance Covariance: A Comment on the Biloop Approach","authors":"Dominic Edelmann","doi":"10.1111/insr.70012","DOIUrl":null,"url":null,"abstract":"I commend the authors for their significant contributions to the field of distance correlation presented in this paper. This work marks the first thorough investigation of the robustness properties of distance correlation, distance covariance, and distance standard deviation in terms of influence functions and breakdown points. These robustness aspects have previously caused considerable confusion in the literature, and this work will serve as an important and clarifying reference. Additionally, the authors introduce a novel version of distance covariance based on an innovative biloop transformation. Thanks to its redescending influence function, this version of distance covariance is robust to bivariate outliers and shows promise for real-world applications.However, the paper does not fully capture the extensive literature on robust distance covariance measures. Criticism regarding the lack of robustness in classical distance covariance can be traced back to the 2009 discussion paper of Székely and Rizzo, in which Bruno Rémillard (2009) identifies the moment assumption as a weakness of distance covariance and suggests a rank transformation as a remedy. In the same discussion, Gretton et al. (2009) give a brief historic overview and present several robust distance covariance type statistics (Kankainen, 1995; Feuerverger, 1993).In this comment, I offer a concise, albeit non-comprehensive, overview of robust versions of distance covariance. Moreover, I extend the simulation studies presented in the main paper, giving further insights into the properties of the newly proposed biloop distance covariance.The literature offers at least three different approaches to defining extensions of distance covariance. Each approach gives rise to dependence measures with positive breakdown points.The simulations presented below extend those in the main paper to shed further light on the properties of biloop distance covariance. Since the results of the normal score distance covariance were very similar to the rank distance covariance, the former method has been omitted. Instead the distance covariance \n<math>\n <semantics>\n <mrow>\n <mspace></mspace>\n <msub>\n <mtext>dCov</mtext>\n <mi>d</mi>\n </msub>\n </mrow>\n <annotation>$$ \\kern0.1em {\\mathrm{dCov}}_d $$</annotation>\n </semantics></math> in Equation (2) employing the RBF distance (Equation 2) is included; this method is denoted as RBF distance covariance in the following. The hyperparameter \n<math>\n <semantics>\n <mrow>\n <mi>c</mi>\n </mrow>\n <annotation>$$ c $$</annotation>\n </semantics></math> for the biloop distance covariance was selected as in the main paper, while the bandwidth for the RBF distance covariance was set using the median heuristic (Fukumizu et al., 2009). A wider range of sample sizes (\n<math>\n <semantics>\n <mrow>\n <mi>n</mi>\n <mo>=</mo>\n <mn>50</mn>\n <mo>,</mo>\n <mo> </mo>\n <mn>100</mn>\n <mo>,</mo>\n <mo> </mo>\n <mn>200</mn>\n <mo>,</mo>\n <mo> </mo>\n <mn>400</mn>\n <mo>,</mo>\n <mspace></mspace>\n <mn>800</mn>\n <mo>,</mo>\n <mo> </mo>\n <mn>1</mn>\n <mo> </mo>\n <mn>600</mn>\n <mo>,</mo>\n <mspace></mspace>\n <mn>3</mn>\n <mo> </mo>\n <mn>200</mn>\n </mrow>\n <annotation>$$ n&amp;#x0003D;50,100,200,400,\\kern0.5em 800,1\\ 600,\\kern0.5em 3\\ 200 $$</annotation>\n </semantics></math>) than in the main paper was examined in each scenario. The bivariate data \n<math>\n <semantics>\n <mrow>\n <msup>\n <mfenced>\n <msub>\n <mi>X</mi>\n <mn>1</mn>\n </msub>\n <msub>\n <mi>Y</mi>\n <mn>1</mn>\n </msub>\n </mfenced>\n <mi>t</mi>\n </msup>\n <mo>,</mo>\n <msup>\n <mfenced>\n <msub>\n <mi>X</mi>\n <mn>2</mn>\n </msub>\n <msub>\n <mi>Y</mi>\n <mn>2</mn>\n </msub>\n </mfenced>\n <mi>t</mi>\n </msup>\n <mo>,</mo>\n <mo>…</mo>\n <mo>,</mo>\n <msup>\n <mfenced>\n <msub>\n <mi>X</mi>\n <mi>n</mi>\n </msub>\n <msub>\n <mi>Y</mi>\n <mi>n</mi>\n </msub>\n </mfenced>\n <mi>t</mi>\n </msup>\n </mrow>\n <annotation>$$ {\\left({X}_1,{Y}_1\\right)}&amp;#x0005E;t,{\\left({X}_2,{Y}_2\\right)}&amp;#x0005E;t,\\dots, {\\left({X}_n,{Y}_n\\right)}&amp;#x0005E;t $$</annotation>\n </semantics></math> are always independently generated. I focus on the problem of independence testing. The tests are based on \n<math>\n <semantics>\n <mrow>\n <mi>K</mi>\n <mo>=</mo>\n <mn>500</mn>\n </mrow>\n <annotation>$$ K&amp;#x0003D;500 $$</annotation>\n </semantics></math> permutations, empirical rejection rates are determined from \n<math>\n <semantics>\n <mrow>\n <msub>\n <mi>N</mi>\n <mi>sim</mi>\n </msub>\n <mo>=</mo>\n <mn>1</mn>\n <mo> </mo>\n <mn>000</mn>\n </mrow>\n <annotation>$$ {N}_{sim}&amp;#x0003D;1\\ 000 $$</annotation>\n </semantics></math> simulations and a nominal level of \n<math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo>=</mo>\n <mn>0.05</mn>\n </mrow>\n <annotation>$$ \\alpha &amp;#x0003D;0.05 $$</annotation>\n </semantics></math> is used. All simulations were performed using the R package dcortools available on the Comprehensive R Archive Network (CRAN).From my perspective, distance covariance methods hold great promise for the analysis of high-dimensional data. When testing a large number of bivariate dependencies, distance covariance provides a computationally efficient omnibus independence test—crucial in settings where visual inspection is impossible and where classical correlation measures may miss nonlinear relationships. Likewise, because outliers can lurk undetected in high dimensions, it is often advisable to employ a robust variant that automatically down-weights or excludes extreme observations.Several robust variants of distance covariance have been proposed; among them, the biloop distance correlation stands out because its redescending influence function substantially diminishes the effect of any observation whose marginal distance is far from the median. As demonstrated in this comment and the main paper, this property makes the biloop method robust to outliers, even when these outliers induce a monotone dependence. In scenarios with only univariate outliers or bivariate outliers that do not induce monotone dependence, its performance closely matches that of rank-based distance covariance. A possible caveat is that, by construction, the biloop distance covariance is virtually blind to certain very strong dependence patterns (see Figure 3). Thus, while the method is promising, we must carefully weigh its robustness benefits against potential losses in power for specific dependence structures. Targeted simulations and theoretical analyses are needed to precisely quantify this trade-off.","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"94 1","pages":"26-34"},"PeriodicalIF":1.8000,"publicationDate":"2026-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/insr.70012","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Statistical Review","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/insr.70012","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/10/2 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

I commend the authors for their significant contributions to the field of distance correlation presented in this paper. This work marks the first thorough investigation of the robustness properties of distance correlation, distance covariance, and distance standard deviation in terms of influence functions and breakdown points. These robustness aspects have previously caused considerable confusion in the literature, and this work will serve as an important and clarifying reference. Additionally, the authors introduce a novel version of distance covariance based on an innovative biloop transformation. Thanks to its redescending influence function, this version of distance covariance is robust to bivariate outliers and shows promise for real-world applications.

However, the paper does not fully capture the extensive literature on robust distance covariance measures. Criticism regarding the lack of robustness in classical distance covariance can be traced back to the 2009 discussion paper of Székely and Rizzo, in which Bruno Rémillard (2009) identifies the moment assumption as a weakness of distance covariance and suggests a rank transformation as a remedy. In the same discussion, Gretton et al. (2009) give a brief historic overview and present several robust distance covariance type statistics (Kankainen, 1995; Feuerverger, 1993).

In this comment, I offer a concise, albeit non-comprehensive, overview of robust versions of distance covariance. Moreover, I extend the simulation studies presented in the main paper, giving further insights into the properties of the newly proposed biloop distance covariance.

The literature offers at least three different approaches to defining extensions of distance covariance. Each approach gives rise to dependence measures with positive breakdown points.

The simulations presented below extend those in the main paper to shed further light on the properties of biloop distance covariance. Since the results of the normal score distance covariance were very similar to the rank distance covariance, the former method has been omitted. Instead the distance covariance ${dCov}_{d}$ in Equation (2) employing the RBF distance (Equation 2) is included; this method is denoted as RBF distance covariance in the following. The hyperparameter $c$ for the biloop distance covariance was selected as in the main paper, while the bandwidth for the RBF distance covariance was set using the median heuristic (Fukumizu et al., 2009). A wider range of sample sizes ( $n = 50, 100, 200, 400, 800, 1 600, 3 200$ ) than in the main paper was examined in each scenario. The bivariate data ${(X_{1}, Y_{1})}^{t}, {(X_{2}, Y_{2})}^{t}, \dots, {(X_{n}, Y_{n})}^{t}$ are always independently generated. I focus on the problem of independence testing. The tests are based on $K = 500$ permutations, empirical rejection rates are determined from $N_{sim} = 1 000$ simulations and a nominal level of $α = 0.05$ is used. All simulations were performed using the R package dcortools available on the Comprehensive R Archive Network (CRAN).

From my perspective, distance covariance methods hold great promise for the analysis of high-dimensional data. When testing a large number of bivariate dependencies, distance covariance provides a computationally efficient omnibus independence test—crucial in settings where visual inspection is impossible and where classical correlation measures may miss nonlinear relationships. Likewise, because outliers can lurk undetected in high dimensions, it is often advisable to employ a robust variant that automatically down-weights or excludes extreme observations.

Several robust variants of distance covariance have been proposed; among them, the biloop distance correlation stands out because its redescending influence function substantially diminishes the effect of any observation whose marginal distance is far from the median. As demonstrated in this comment and the main paper, this property makes the biloop method robust to outliers, even when these outliers induce a monotone dependence. In scenarios with only univariate outliers or bivariate outliers that do not induce monotone dependence, its performance closely matches that of rank-based distance covariance. A possible caveat is that, by construction, the biloop distance covariance is virtually blind to certain very strong dependence patterns (see Figure 3). Thus, while the method is promising, we must carefully weigh its robustness benefits against potential losses in power for specific dependence structures. Targeted simulations and theoretical analyses are needed to precisely quantify this trade-off.

Abstract Image

查看原文本刊更多论文

比较距离协方差的鲁棒版本：对Biloop方法的评论

我赞扬作者在本文中对距离相关领域的重大贡献。这项工作标志着距离相关、距离协方差和距离标准差在影响函数和崩溃点方面的鲁棒性的第一次彻底调查。这些稳健性方面以前在文献中引起了相当大的混乱，这项工作将作为一个重要的和澄清参考。此外，作者还介绍了一种基于创新的双环变换的距离协方差的新版本。由于其重降影响函数，该版本的距离协方差对二元异常值具有鲁棒性，并有望用于实际应用。然而，本文并没有完全捕捉到关于鲁棒距离协方差测量的广泛文献。关于经典距离协方差缺乏稳健性的批评可以追溯到szsamkely和Rizzo在2009年的讨论论文中，Bruno r<s:1> millard（2009）认为矩假设是距离协方差的一个弱点，并建议用秩变换作为补救措施。在同样的讨论中，Gretton等人（2009）给出了一个简短的历史概述，并提出了几个稳健的距离协方差型统计（Kankainen, 1995; Feuerverger, 1993）。在这篇评论中，我提供了一个简洁的，尽管不全面的，距离协方差鲁棒版本的概述。此外，我扩展了在主论文中提出的仿真研究，进一步深入了解新提出的双环距离协方差的性质。文献提供了至少三种不同的方法来定义距离协方差的扩展。每种方法都会产生具有正击穿点的依赖度量。下面给出的模拟扩展了主论文中的模拟，进一步阐明了双环距离协方差的性质。由于正态得分距离协方差的结果与秩距离协方差非常相似，故省略了前一种方法。代入式(2)中采用RBF距离（式2）的距离协方差dCov d $$ \kern0.1em {\mathrm{dCov}}_d $$；该方法表示为RBF距离协方差。双环距离协方差的超参数c $$ c $$与主文中一样，而RBF距离协方差的带宽使用中值启发式设置（Fukumizu et al., 2009）。更大的样本量范围(n = 50、100、200、400、800、1 600, 3 200 $$ n&amp;#x0003D;50,100,200,400,\kern0.5em 800,1\ 600,\kern0.5em 3\ 200 $$)比在主要论文中审查了每个场景。二元数据X 1y 1t，X 2y 2t，…，X n Y nt $$ {\left({X}_1,{Y}_1\right)}&amp;#x0005E;t,{\left({X}_2,{Y}_2\right)}&amp;#x0005E;t,\dots, {\left({X}_n,{Y}_n\right)}&amp;#x0005E;t $$总是独立生成的。我关注的是独立性测试的问题。测试是基于K = 500 $$ K&amp;#x0003D;500 $$排列，经验拒绝率由N sim = 1 000 $$ {N}_{sim}&amp;#x0003D;1\ 000 $$模拟和名义水平α = 0.05确定使用$$ \alpha &amp;#x0003D;0.05 $$。所有的模拟都是使用综合R档案网络（CRAN）上提供的R软件包工具进行的。从我的角度来看，距离协方差方法对高维数据的分析有很大的希望。当测试大量的二元相关性时，距离协方差提供了一种计算效率高的综合独立性测试-在不可能进行视觉检查和经典相关度量可能错过非线性关系的情况下至关重要。同样，由于异常值可能在高维中潜伏而未被发现，因此通常建议使用一个健壮的变量来自动降低权重或排除极端观察值。已经提出了距离协方差的几个鲁棒变体；其中，双环距离相关尤为突出，因为它的重降影响函数大大降低了任何边缘距离远离中位数的观测值的影响。正如在这篇评论和主要论文中所证明的那样，这个性质使得双环方法对异常值具有鲁棒性，即使这些异常值引起单调依赖。在只有单变量异常值或双变量异常值且不产生单调依赖性的情况下，其性能与基于秩的距离协方差非常接近。一个可能的警告是，通过构造，双环距离协方差实际上对某些非常强的依赖模式是不可见的（参见图3）。因此，虽然该方法很有前途，但我们必须仔细权衡其鲁棒性优势与特定依赖结构的潜在功率损失。需要有针对性的模拟和理论分析来精确地量化这种权衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Statistical Review 数学-统计学与概率论

CiteScore

4.30

自引率

5.00%

发文量

审稿时长

>12 weeks

期刊介绍： International Statistical Review is the flagship journal of the International Statistical Institute (ISI) and of its family of Associations. It publishes papers of broad and general interest in statistics and probability. The term Review is to be interpreted broadly. The types of papers that are suitable for publication include (but are not limited to) the following: reviews/surveys of significant developments in theory, methodology, statistical computing and graphics, statistical education, and application areas; tutorials on important topics; expository papers on emerging areas of research or application; papers describing new developments and/or challenges in relevant areas; papers addressing foundational issues; papers on the history of statistics and probability; white papers on topics of importance to the profession or society; and historical assessment of seminal papers in the field and their impact.