José R. Berrendero, Alejandro Cholaquidis, Antonio Cuevas
{"title":"On the functional regression model and its finite-dimensional approximations","authors":"José R. Berrendero, Alejandro Cholaquidis, Antonio Cuevas","doi":"10.1007/s00362-024-01567-9","DOIUrl":null,"url":null,"abstract":"<p>The problem of linearly predicting a scalar response <i>Y</i> from a functional (random) explanatory variable <span>\\(X=X(t),\\ t\\in I\\)</span> is considered. It is argued that the term “linearly” can be interpreted in several meaningful ways. Thus, one could interpret that (up to a random noise) <i>Y</i> could be expressed as a linear combination of a finite family of marginals <span>\\(X(t_i)\\)</span> of the process <i>X</i>, or a limit of a sequence of such linear combinations. This simple point of view (which has some precedents in the literature) leads to a formulation of the linear model in terms of the RKHS space generated by the covariance function of the process <i>X</i>(<i>t</i>). It turns out that such RKHS-based formulation includes the standard functional linear model, based on the inner product in the space <span>\\(L^2[0,1]\\)</span>, as a particular case. It includes as well all models in which <i>Y</i> is assumed to be (up to an additive noise) a linear combination of a finite number of linear projections of <i>X</i>. Some consistency results are proved which, in particular, lead to an asymptotic approximation of the predictions derived from the general (functional) linear model in terms of finite-dimensional models based on a finite family of marginals <span>\\(X(t_i)\\)</span>, for an increasing grid of points <span>\\(t_j\\)</span> in <i>I</i>. We also include a discussion on the crucial notion of coefficient of determination (aimed at assessing the fit of the model) in this setting. A few experimental results are given.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"18 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Papers","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s00362-024-01567-9","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
The problem of linearly predicting a scalar response Y from a functional (random) explanatory variable \(X=X(t),\ t\in I\) is considered. It is argued that the term “linearly” can be interpreted in several meaningful ways. Thus, one could interpret that (up to a random noise) Y could be expressed as a linear combination of a finite family of marginals \(X(t_i)\) of the process X, or a limit of a sequence of such linear combinations. This simple point of view (which has some precedents in the literature) leads to a formulation of the linear model in terms of the RKHS space generated by the covariance function of the process X(t). It turns out that such RKHS-based formulation includes the standard functional linear model, based on the inner product in the space \(L^2[0,1]\), as a particular case. It includes as well all models in which Y is assumed to be (up to an additive noise) a linear combination of a finite number of linear projections of X. Some consistency results are proved which, in particular, lead to an asymptotic approximation of the predictions derived from the general (functional) linear model in terms of finite-dimensional models based on a finite family of marginals \(X(t_i)\), for an increasing grid of points \(t_j\) in I. We also include a discussion on the crucial notion of coefficient of determination (aimed at assessing the fit of the model) in this setting. A few experimental results are given.
研究考虑了从函数(随机)解释变量 \(X=X(t),\t\in I\) 线性预测标量响应 Y 的问题。有人认为,"线性 "一词可以有几种有意义的解释。因此,我们可以将 Y 解释为过程 X 的边际值 \(X(t_i)\)的有限族的线性组合,或者是这种线性组合序列的极限。这种简单的观点(在文献中已有先例)导致了线性模型的表述,即由过程 X(t) 的协方差函数生成的 RKHS 空间。事实证明,这种基于 RKHS 的表述包括标准函数线性模型,它基于空间 \(L^2[0,1]\)中的内积,是一种特殊情况。它还包括所有假定 Y 是 X 的有限数量线性投影的线性组合(直到加性噪声)的模型。我们还讨论了在这种情况下决定系数的关键概念(旨在评估模型的拟合度)。文中给出了一些实验结果。
期刊介绍:
The journal Statistical Papers addresses itself to all persons and organizations that have to deal with statistical methods in their own field of work. It attempts to provide a forum for the presentation and critical assessment of statistical methods, in particular for the discussion of their methodological foundations as well as their potential applications. Methods that have broad applications will be preferred. However, special attention is given to those statistical methods which are relevant to the economic and social sciences. In addition to original research papers, readers will find survey articles, short notes, reports on statistical software, problem section, and book reviews.