MetrikaPub Date : 2024-03-05DOI: 10.1007/s00184-024-00956-2
Andrea C. Garcia-Angulo, Gerda Claeskens
{"title":"Bootstrap for inference after model selection and model averaging for likelihood models","authors":"Andrea C. Garcia-Angulo, Gerda Claeskens","doi":"10.1007/s00184-024-00956-2","DOIUrl":"https://doi.org/10.1007/s00184-024-00956-2","url":null,"abstract":"<p>A one-step semiparametric bootstrap procedure is constructed to estimate the distribution of estimators after model selection and of model averaging estimators with data-dependent weights. The method is generally applicable to non-normal models. Misspecification is allowed for all candidate parametric models. The semiparametric bootstrap estimator is shown to be consistent within specific regions such that the good and the bad candidate models are separated. Simulation studies exemplify that the bootstrap procedure leads to short confidence intervals with a good coverage.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"156 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140043940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-26DOI: 10.1007/s00184-024-00954-4
{"title":"A multivariate Jacobi polynomials regression estimator associated with an ANOVA decomposition model","authors":"","doi":"10.1007/s00184-024-00954-4","DOIUrl":"https://doi.org/10.1007/s00184-024-00954-4","url":null,"abstract":"<h3>Abstract</h3> <p>In this work, we construct a stable and fairly fast estimator for solving multidimensional non-parametric regression problems. The proposed estimator is based on the use of a novel and special system of multivariate Jacobi polynomials that generate a basis for a reduced size of <span> <span>(d-)</span> </span>variate finite dimensional polynomials space. An ANOVA decomposition trick has been used for building this space. Also, by using some results from the theory of positive definite random matrices, we show that the proposed estimator is stable under the condition that the i.i.d. <span> <span>(d-)</span> </span>dimensional random sampling training points follow a <span> <span>(d-)</span> </span>dimensional Beta distribution. In addition, we provide the reader with an estimate for the <span> <span>(L^2-)</span> </span>risk error of the estimator. This risk error depends on the <span> <span>(L^2-)</span> </span>error of the orthogonal projection error of the regression function over the considered polynomials space. An involved study of this orthogonal projection error is done under the condition that the regression function belongs to a given weighted Sobolev space. Thanks to this novel estimate of the orthogonal projection error, we give the optimal convergence rate of our estimator. Furthermore, we give a regularized extension version of our estimator, that is capable of handling random sampling training vectors drawn according to an unknown multivariate pdf. Moreover, we derive an upper bound for the empirical risk error of this regularized estimator. Finally, we give some numerical simulations that illustrate the various theoretical results of this work. In particular, we provide simulations on a real data that compares the performance of our estimator with some existing and popular NP regression estimators.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"27 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139969273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-23DOI: 10.1007/s00184-024-00952-6
Tsz Chai Fung
{"title":"Robust estimation and diagnostic of generalized linear model for insurance losses: a weighted likelihood approach","authors":"Tsz Chai Fung","doi":"10.1007/s00184-024-00952-6","DOIUrl":"https://doi.org/10.1007/s00184-024-00952-6","url":null,"abstract":"<p>This paper presents a score-based weighted likelihood estimator (SWLE) for robust estimations of the generalized linear model (GLM) for insurance loss data. The SWLE exhibits a limited sensitivity to the outliers, theoretically justifying its robustness against model contaminations. Also, with the specially designed weight function to effectively diminish the contributions of extreme losses to the GLM parameter estimations, most statistical quantities can still be derived analytically, minimizing the computational burden for parameter calibrations. Apart from robust estimations, the SWLE can also act as a quantitative diagnostic tool to detect outliers and systematic model misspecifications. Motivated by the coverage modifications which make insurance losses often random censored and truncated, the SWLE is extended to accommodate censored and truncated data. We exemplify the SWLE on three simulation studies and two real insurance datasets. Empirical results suggest that the SWLE produces more reliable parameter estimates than the MLE if outliers contaminate the dataset. The SWLE diagnostic tool also successfully detects any systematic model misspecifications with high power, accompanying some potential model improvements.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"107 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139950315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-21DOI: 10.1007/s00184-024-00950-8
{"title":"Bayesian composite $$L^p$$ -quantile regression","authors":"","doi":"10.1007/s00184-024-00950-8","DOIUrl":"https://doi.org/10.1007/s00184-024-00950-8","url":null,"abstract":"<h3>Abstract</h3> <p><span> <span>(L^p)</span> </span>-quantiles are a class of generalized quantiles defined as minimizers of an asymmetric power function. They include both quantiles, <span> <span>(p=1)</span> </span>, and expectiles, <span> <span>(p=2)</span> </span>, as special cases. This paper studies composite <span> <span>(L^p)</span> </span>-quantile regression, simultaneously extending single <span> <span>(L^p)</span> </span>-quantile regression and composite quantile regression. A Bayesian approach is considered, where a novel parameterization of the skewed exponential power distribution is utilized. Further, a Laplace prior on the regression coefficients allows for variable selection. Through a Monte Carlo study and applications to empirical data, the proposed method is shown to outperform Bayesian composite quantile regression in most aspects.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"77 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139921337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-18DOI: 10.1007/s00184-024-00949-1
Yuri S. Maluf, Silvia L. P. Ferrari, Francisco F. Queiroz
{"title":"Robust beta regression through the logit transformation","authors":"Yuri S. Maluf, Silvia L. P. Ferrari, Francisco F. Queiroz","doi":"10.1007/s00184-024-00949-1","DOIUrl":"https://doi.org/10.1007/s00184-024-00949-1","url":null,"abstract":"<p>Beta regression models are employed to model continuous response variables in the unit interval, like rates, percentages, or proportions. Their applications rise in several areas, such as medicine, environment research, finance, and natural sciences. The maximum likelihood estimation is widely used to make inferences for the parameters. Nonetheless, it is well-known that the maximum likelihood-based inference suffers from the lack of robustness in the presence of outliers. Such a case can bring severe bias and misleading conclusions. Recently, robust estimators for beta regression models were presented in the literature. However, these estimators require non-trivial restrictions in the parameter space, which limit their application. This paper develops new robust estimators that overcome this drawback. Their asymptotic and robustness properties are studied, and robust Wald-type tests are introduced. Simulation results evidence the merits of the new robust estimators. Inference and diagnostics using the new estimators are illustrated in an application to health insurance coverage data. The new R package robustbetareg is introduced.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"7 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139921323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-17DOI: 10.1007/s00184-024-00946-4
Othmane Kortbi
{"title":"On Bayesian predictive density estimation for skew-normal distributions","authors":"Othmane Kortbi","doi":"10.1007/s00184-024-00946-4","DOIUrl":"https://doi.org/10.1007/s00184-024-00946-4","url":null,"abstract":"<p>This paper is concerned with prediction for skew-normal models, and more specifically the Bayes estimation of a predictive density for <span>(Y left. right| mu sim {mathcal {S}} {mathcal {N}}_p (mu , v_y I_p, lambda ))</span> under Kullback–Leibler loss, based on <span>(X left. right| mu sim {mathcal {S}} {mathcal {N}}_p (mu , v_x I_p, lambda ))</span> with known dependence and skewness parameters. We obtain representations for Bayes predictive densities, including the minimum risk equivariant predictive density <span>(hat{p}_{pi _{o}})</span> which is a Bayes predictive density with respect to the noninformative prior <span>(pi _0equiv 1)</span>. George et al. (Ann Stat 34:78–91, 2006) used the parallel between the problem of point estimation and the problem of estimation of predictive densities to establish a connection between the difference of risks of the two problems. The development of similar connection, allows us to determine sufficient conditions of dominance over <span>(hat{p}_{pi _{o}})</span> and of minimaxity. First, we show that <span>(hat{p}_{pi _{o}})</span> is a minimax predictive density under KL risk for the skew-normal model. After this, for dimensions <span>(pge 3)</span>, we obtain classes of Bayesian minimax densities that improve <span>(hat{p}_{pi _{o}})</span> under KL loss, for the subclass of skew-normal distributions with small value of skewness parameter. Moreover, for dimensions <span>(pge 4)</span>, we obtain classes of Bayesian minimax densities that improve <span>(hat{p}_{pi _{o}})</span> under KL loss, for the whole class of skew-normal distributions. Examples of proper priors, including generalized student priors, generating Bayesian minimax densities that improve <span>(hat{p}_{pi _{o}})</span> under KL loss, were constructed when <span>(pge 5)</span>. This findings represent an extension of Liang and Barron (IEEE Trans Inf Theory 50(11):2708–2726, 2004), George et al. (Ann Stat 34:78–91, 2006) and Komaki (Biometrika 88(3):859–864, 2001) results to a subclass of asymmetrical distributions.\u0000</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"6 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139772832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-09DOI: 10.1007/s00184-024-00947-3
{"title":"Stochastic comparisons, differential entropy and varentropy for distributions induced by probability density functions","authors":"","doi":"10.1007/s00184-024-00947-3","DOIUrl":"https://doi.org/10.1007/s00184-024-00947-3","url":null,"abstract":"<h3>Abstract</h3> <p>Stimulated by the need of describing useful notions related to information measures, we introduce the ‘pdf-related distributions’. These are defined in terms of transformation of absolutely continuous random variables through their own probability density functions. We investigate their main characteristics, with reference to the general form of the distribution, the quantiles, and some related notions of reliability theory. This allows us to obtain a characterization of the pdf-related distribution being uniform for distributions of exponential and Laplace type as well. We also face the problem of stochastic comparing the pdf-related distributions by resorting to suitable stochastic orders. Finally, the given results are used to analyse properties and to compare some useful information measures, such as the differential entropy and the varentropy.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"18 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139758426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-02-05DOI: 10.1007/s00184-023-00945-x
Shuji Ando
{"title":"Measure of deviancy from marginal mean equality based on cumulative marginal probabilities in square contingency tables","authors":"Shuji Ando","doi":"10.1007/s00184-023-00945-x","DOIUrl":"https://doi.org/10.1007/s00184-023-00945-x","url":null,"abstract":"<p>This study proposes a measure that can concurrently evaluate the degree and direction of deviancy from the marginal mean equality (ME) model in square contingency tables with ordered categories. The proposed measure is constructed as the function of the row and column cumulative marginal probabilities. When the ME model does not fit data, we are interested in measuring the degree of deviancy from the ME model, because the model having weaker restrictions than the ME model is only the saturated model. This existing measure, which represents the degree of deviancy from the ME model, does not depend on the probabilities that observations will fall in the main diagonal cells of the table. For the data in which observations are concentrated in the main diagonal cells, the existing measure may overestimate the degree of deviancy from the ME model. The proposed measure can address this issue. This study derives an estimator and an approximate confidence interval for the proposed measure using the delta method. The proposed measure would be utility for comparing degrees of deviancy from the ME model in two datasets. The proposed measure is evaluated the usefulness with the application to real data of clinical trials.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"6 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139758331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-01-08DOI: 10.1007/s00184-023-00941-1
Cao Xuan Phuong, Le Thi Hong Thuy
{"title":"Nonparametric estimation of $${mathbb {P}}(X","authors":"Cao Xuan Phuong, Le Thi Hong Thuy","doi":"10.1007/s00184-023-00941-1","DOIUrl":"https://doi.org/10.1007/s00184-023-00941-1","url":null,"abstract":"<p>Let <i>X</i>, <i>Y</i> be continuous random variables with unknown distributions. The aim of this paper is to study the problem of estimating the probability <span>(theta := {mathbb {P}}(X<Y))</span> based on independent random samples from the distributions of <span>(X')</span>, <span>(Y')</span>, <span>(zeta )</span> and <span>(eta )</span>, where <span>(X' = X + zeta )</span>, <span>(Y' = Y + eta )</span> and <i>X</i>, <i>Y</i>, <span>(zeta )</span>, <span>(eta )</span> are mutually independent random variables. In this context, <span>(zeta )</span>, <span>(eta )</span> are referred to as measurement errors. We apply the ridge-parameter regularization method to derive a nonparametric estimator for <span>(theta )</span> depending on two parameters. Our estimator is shown to be consistent with respect to mean squared error if the characteristic functions of <span>(zeta )</span>, <span>(eta )</span> only vanish on Lebesgue measure zero sets. Under some further assumptions on the densities of <i>X</i>, <i>Y</i>, <span>(zeta )</span> and <span>(eta )</span>, we obtain some upper and lower bounds on the convergence rate of the estimator. A numerical example is also given to illustrate the efficiency of our method.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"216 3 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139408140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
MetrikaPub Date : 2024-01-06DOI: 10.1007/s00184-023-00942-0
Ibrahim M. Almanjahie, Salim Bouzebda, Zoulikha Kaid, Ali Laksaci
{"title":"The local linear functional kNN estimator of the conditional expectile: uniform consistency in number of neighbors","authors":"Ibrahim M. Almanjahie, Salim Bouzebda, Zoulikha Kaid, Ali Laksaci","doi":"10.1007/s00184-023-00942-0","DOIUrl":"https://doi.org/10.1007/s00184-023-00942-0","url":null,"abstract":"<p>The main purpose of the present paper is to investigate the problem of the nonparametric estimation of the expectile regression in which the response variable is scalar while the covariate is a random function. More precisely, an estimator is constructed by using the local linear <i>k</i> Nearest Neighbor procedures (<i>k</i>NN). The main contribution of this study is the establishment of the Uniform consistency in Number of Neighbors of the constructed estimators. These results are established under fairly general structural conditions on the classes of functions and the underlying models. The usefulness of our result for the smoothing parameter automatic selection is discussed. Some simulation studies are carried out to show the finite sample performances of the <i>k</i>NN estimator. The theoretical uniform consistency results, established in this paper, are (or will be) key tools for many further developments in functional data analysis.</p>","PeriodicalId":49821,"journal":{"name":"Metrika","volume":"44 1","pages":""},"PeriodicalIF":0.7,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139375279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}