Nonparametric estimation via partial derivatives.

IF 3.1 1区 数学 Q1 STATISTICS & PROBABILITY
Xiaowu Dai
{"title":"Nonparametric estimation via partial derivatives.","authors":"Xiaowu Dai","doi":"10.1093/jrsssb/qkae093","DOIUrl":null,"url":null,"abstract":"<p><p>Traditional nonparametric estimation methods often lead to a slow convergence rate in large dimensions and require unrealistically large dataset sizes for reliable conclusions. We develop an approach based on partial derivatives, either observed or estimated, to effectively estimate the function at near-parametric convergence rates. This novel approach and computational algorithm could lead to methods useful to practitioners in many areas of science and engineering. Our theoretical results reveal behaviour universal to this class of nonparametric estimation problems. We explore a general setting involving tensor product spaces and build upon the smoothing spline analysis of variance framework. For <i>d</i>-dimensional models under full interaction, the optimal rates with gradient information on <i>p</i> covariates are identical to those for the <math><mo>(</mo> <mi>d</mi> <mo>-</mo> <mi>p</mi> <mo>)</mo></math> -interaction models without gradients and, therefore, the models are immune to the <i>curse of interaction</i>. For additive models, the optimal rates using gradient information are <math><msqrt><mi>n</mi></msqrt> </math> , thus achieving the <i>parametric rate</i>. We demonstrate aspects of the theoretical results through synthetic and real data applications.</p>","PeriodicalId":49982,"journal":{"name":"Journal of the Royal Statistical Society Series B-Statistical Methodology","volume":"87 2","pages":"319-336"},"PeriodicalIF":3.1000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11985098/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series B-Statistical Methodology","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/jrsssb/qkae093","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional nonparametric estimation methods often lead to a slow convergence rate in large dimensions and require unrealistically large dataset sizes for reliable conclusions. We develop an approach based on partial derivatives, either observed or estimated, to effectively estimate the function at near-parametric convergence rates. This novel approach and computational algorithm could lead to methods useful to practitioners in many areas of science and engineering. Our theoretical results reveal behaviour universal to this class of nonparametric estimation problems. We explore a general setting involving tensor product spaces and build upon the smoothing spline analysis of variance framework. For d-dimensional models under full interaction, the optimal rates with gradient information on p covariates are identical to those for the ( d - p ) -interaction models without gradients and, therefore, the models are immune to the curse of interaction. For additive models, the optimal rates using gradient information are n , thus achieving the parametric rate. We demonstrate aspects of the theoretical results through synthetic and real data applications.

通过偏导数的非参数估计。
传统的非参数估计方法往往导致在大维度上收敛速度慢,并且需要不切实际的大数据集才能得到可靠的结论。我们开发了一种基于偏导数的方法,无论是观测的还是估计的,以有效地估计函数在近参数收敛率。这种新颖的方法和计算算法可以为许多科学和工程领域的实践者提供有用的方法。我们的理论结果揭示了这类非参数估计问题的普遍行为。我们探索了一个涉及张量积空间的一般设置,并建立在平滑样条方差分析框架的基础上。对于充分相互作用的d维模型,具有p协变量梯度信息的最优速率与没有梯度的(d- p) -相互作用模型的最优速率相同,因此模型不受相互作用的诅咒。对于加性模型,使用梯度信息的最优速率为n,从而实现参数速率。我们通过综合和实际数据应用来论证理论结果的各个方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.80
自引率
0.00%
发文量
83
审稿时长
>12 weeks
期刊介绍: Series B (Statistical Methodology) aims to publish high quality papers on the methodological aspects of statistics and data science more broadly. The objective of papers should be to contribute to the understanding of statistical methodology and/or to develop and improve statistical methods; any mathematical theory should be directed towards these aims. The kinds of contribution considered include descriptions of new methods of collecting or analysing data, with the underlying theory, an indication of the scope of application and preferably a real example. Also considered are comparisons, critical evaluations and new applications of existing methods, contributions to probability theory which have a clear practical bearing (including the formulation and analysis of stochastic models), statistical computation or simulation where original methodology is involved and original contributions to the foundations of statistical science. Reviews of methodological techniques are also considered. A paper, even if correct and well presented, is likely to be rejected if it only presents straightforward special cases of previously published work, if it is of mathematical interest only, if it is too long in relation to the importance of the new material that it contains or if it is dominated by computations or simulations of a routine nature.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信