Johan Medrano, Abderrahmane Kheddar, Annick Lesne, Sofiane Ramdani
{"title":"Radius selection using kernel density estimation for the computation of nonlinear measures","authors":"Johan Medrano, Abderrahmane Kheddar, Annick Lesne, Sofiane Ramdani","doi":"arxiv-2401.03891","DOIUrl":null,"url":null,"abstract":"When nonlinear measures are estimated from sampled temporal signals with\nfinite-length, a radius parameter must be carefully selected to avoid a poor\nestimation. These measures are generally derived from the correlation integral\nwhich quantifies the probability of finding neighbors, i.e. pair of points\nspaced by less than the radius parameter. While each nonlinear measure comes\nwith several specific empirical rules to select a radius value, we provide a\nsystematic selection method. We show that the optimal radius for nonlinear\nmeasures can be approximated by the optimal bandwidth of a Kernel Density\nEstimator (KDE) related to the correlation sum. The KDE framework provides\nnon-parametric tools to approximate a density function from finite samples\n(e.g. histograms) and optimal methods to select a smoothing parameter, the\nbandwidth (e.g. bin width in histograms). We use results from KDE to derive a\nclosed-form expression for the optimal radius. The latter is used to compute\nthe correlation dimension and to construct recurrence plots yielding an\nestimate of Kolmogorov-Sinai entropy. We assess our method through numerical\nexperiments on signals generated by nonlinear systems and experimental\nelectroencephalographic time series.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"254 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2401.03891","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
When nonlinear measures are estimated from sampled temporal signals with
finite-length, a radius parameter must be carefully selected to avoid a poor
estimation. These measures are generally derived from the correlation integral
which quantifies the probability of finding neighbors, i.e. pair of points
spaced by less than the radius parameter. While each nonlinear measure comes
with several specific empirical rules to select a radius value, we provide a
systematic selection method. We show that the optimal radius for nonlinear
measures can be approximated by the optimal bandwidth of a Kernel Density
Estimator (KDE) related to the correlation sum. The KDE framework provides
non-parametric tools to approximate a density function from finite samples
(e.g. histograms) and optimal methods to select a smoothing parameter, the
bandwidth (e.g. bin width in histograms). We use results from KDE to derive a
closed-form expression for the optimal radius. The latter is used to compute
the correlation dimension and to construct recurrence plots yielding an
estimate of Kolmogorov-Sinai entropy. We assess our method through numerical
experiments on signals generated by nonlinear systems and experimental
electroencephalographic time series.
从无限长的采样时间信号中估计非线性度量时,必须仔细选择半径参数,以避免估计结果不佳。这些度量通常由相关积分推导而来,相关积分量化了找到邻近点(即间距小于半径参数的点对)的概率。虽然每种非线性度量都有几种特定的经验规则来选择半径值,但我们提供了一种系统的选择方法。我们证明,非线性度量的最佳半径可以用与相关性总和相关的核密度估计器(KDE)的最佳带宽来近似。KDE 框架提供了从有限样本(如直方图)近似密度函数的非参数工具,以及选择平滑参数--带宽(如直方图中的二进制宽度)的最优方法。我们利用 KDE 的结果推导出最优半径的封闭式表达式。后者用于计算相关维度和构建递归图,从而得出柯尔莫哥洛夫-西奈熵的估计值。我们通过对非线性系统产生的信号和脑电图时间序列进行数值实验来评估我们的方法。