{"title":"Gram-Charlier A Series Based Extended Rule-of-Thumb for Bandwidth Selection in Univariate Kernel Density Estimation","authors":"B. Dharmani","doi":"10.17713/ajs.v51i3.1204","DOIUrl":null,"url":null,"abstract":"Bandwidth parameter estimation in univariate Kernel Density Estimation has traditionally two approaches. Rule(s)-of-Thumb (ROT) achieve ‘quick and dirty’ estimations with some specific assumption for an unknown density. More accurate solve-the-equation-plug-in (STEPI) rules have almost no direct assumption for the unknown density but demand high computation. This article derives a balancing third approach. Extending an assumption of Gaussianity for the unknown density to be estimated in \\textit{normal reference} ROT (NRROT) to near Gaussianity, and then expressing the density using Gram-Charlier A (GCA) series to minimize the asymptotic mean integrated square error, it derives GCA series based Extended ROT (GCAExROT). The performance analysis using the simulated and the real datasets suggests to replace NRROT by a modified GCAExROT rule achieving a balancing performance by accuracy nearer to STEPI rules at computation nearer to NRROT, specifically at small samples.","PeriodicalId":51761,"journal":{"name":"Austrian Journal of Statistics","volume":"250 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Austrian Journal of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17713/ajs.v51i3.1204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 2
Abstract
Bandwidth parameter estimation in univariate Kernel Density Estimation has traditionally two approaches. Rule(s)-of-Thumb (ROT) achieve ‘quick and dirty’ estimations with some specific assumption for an unknown density. More accurate solve-the-equation-plug-in (STEPI) rules have almost no direct assumption for the unknown density but demand high computation. This article derives a balancing third approach. Extending an assumption of Gaussianity for the unknown density to be estimated in \textit{normal reference} ROT (NRROT) to near Gaussianity, and then expressing the density using Gram-Charlier A (GCA) series to minimize the asymptotic mean integrated square error, it derives GCA series based Extended ROT (GCAExROT). The performance analysis using the simulated and the real datasets suggests to replace NRROT by a modified GCAExROT rule achieving a balancing performance by accuracy nearer to STEPI rules at computation nearer to NRROT, specifically at small samples.
单变量核密度估计中的带宽参数估计传统上有两种方法。经验法则(ROT)通过对未知密度的某些特定假设实现了“快速而肮脏”的估计。更精确的方程插入式求解(STEPI)规则几乎没有对未知密度的直接假设,但计算量大。本文提出了第三种平衡方法。将正\textit{态参考}ROT (NRROT)中待估计的未知密度的高斯性假设推广到接近高斯性,然后用Gram-Charlier A (GCA)级数表示密度,使渐近平均积分平方误差最小,导出了基于GCA级数的扩展ROT (GCAExROT)。使用模拟数据集和真实数据集进行性能分析,建议用改进的GCAExROT规则代替NRROT,在计算更接近NRROT的情况下,特别是在小样本情况下,通过更接近STEPI规则的精度来实现平衡性能。
期刊介绍:
The Austrian Journal of Statistics is an open-access journal (without any fees) with a long history and is published approximately quarterly by the Austrian Statistical Society. Its general objective is to promote and extend the use of statistical methods in all kind of theoretical and applied disciplines. The Austrian Journal of Statistics is indexed in many data bases, such as Scopus (by Elsevier), Web of Science - ESCI by Clarivate Analytics (formely Thompson & Reuters), DOAJ, Scimago, and many more. The current estimated impact factor (via Publish or Perish) is 0.775, see HERE, or even more indices HERE. Austrian Journal of Statistics ISNN number is 1026597X Original papers and review articles in English will be published in the Austrian Journal of Statistics if judged consistently with these general aims. All papers will be refereed. Special topics sections will appear from time to time. Each section will have as a theme a specialized area of statistical application, theory, or methodology. Technical notes or problems for considerations under Shorter Communications are also invited. A special section is reserved for book reviews.