Jan Pablo Burgard, Joscha Krause, Domingo Morales, Anna-Lena Wölwer
{"title":"Empirical best predictors under multivariate Fay-Herriot models and their numerical approximation","authors":"Jan Pablo Burgard, Joscha Krause, Domingo Morales, Anna-Lena Wölwer","doi":"10.1016/j.ecosta.2024.09.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.09.001","url":null,"abstract":"Small area estimation of multivariable non-linear domain indicators using aggregated data is addressed. By assuming that the target vector follows a multivariate Fay-Herriot model, empirical best predictors of domain parameters that are arbitrary Lebesgue-measurable functions of multiple target variables are derived. In this context, Monte Carlo and Gauss-Hermite quadrature methods for integral approximation are discussed. A parametric bootstrap algorithm for mean squared error estimation is presented. Simulation experiments are conducted to study the behaviour of the introduced methodology. Moreover, an illustrative application to real data from the Spanish labour force survey is provided. In this example, province-level unemployment rates, crossed by age classes and sex, are estimated.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"14 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nikoleta Anesti, Eleni Kalamara, George Kapetanios
{"title":"Forecasting with Machine Learning methods and multiple large datasets[formula omitted]","authors":"Nikoleta Anesti, Eleni Kalamara, George Kapetanios","doi":"10.1016/j.ecosta.2024.08.003","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.003","url":null,"abstract":"The usefulness of machine learning techniques for forecasting macroeconomic variables using multiple large datasets is considered. The predictive content of surveys is compared with text-based indicators from newspaper articles and a standard macroeconomic dataset, extending the evidence on the contribution of each dataset in predicting economic activity. Among the linear models, the Ridge regression and the Partial Least Squares models report the largest gains consistently for most of the forecasting horizons, and among the non linear machine learning models, Support Vector Regression performs better at shorter horizons compared to the Neural Networks and Random Forest that yield more accurate forecasts up to two years ahead. Text based indicators have similar informational content to surveys, albeit combining the two datasets provides with more accurate forecasts for most of the forecast horizons. The largest forecasting gains are overwhelmingly concentrated at the shorter horizons for the majority of models and datasets and they decrease significantly after one year. Non-linear machine learning models appear to be mostly useful during the Great Financial Crisis and perform similarly to their linear counterparts in more normal periods.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"27 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142218394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christos K. Papadimitriou, Simos G. Meintanis, Bernardo B. Andrade, Mike G. Tsionas
{"title":"Specification tests for normal/gamma and stable/gamma stochastic frontier models based on empirical transforms","authors":"Christos K. Papadimitriou, Simos G. Meintanis, Bernardo B. Andrade, Mike G. Tsionas","doi":"10.1016/j.ecosta.2024.08.002","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.002","url":null,"abstract":"Goodness–of–fit tests for the distribution of the composed error term in a Stochastic Frontier Model (SFM) are suggested. The focus is on the case of a normal/gamma SFM and the heavy–tailed stable/gamma SFM. In the first case the moment generating function is used as tool while in the latter case the characteristic function of the error term is employed. In both cases our test statistics are formulated as weighted integrals of properly standardized data. The new normal/gamma test is consistent, and is shown to have an intrinsic relation to moment–based tests. The finite–sample behavior of resampling versions of both tests is investigated by Monte Carlo simulation, while several real–data applications are also included.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"5 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142250987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Bayesian flexible model for testing Granger causality","authors":"Iván Gutiérrez, Danilo Alvares, Luis Gutiérrez","doi":"10.1016/j.ecosta.2024.08.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.08.001","url":null,"abstract":"A new Bayesian hypothesis testing procedure for evaluating the Granger causality between two or more time series is proposed. The test is based on a flexible model for the joint evolution of multiple series, where a latent binary matrix indicates whether there is a Granger-causal relationship between such time series. The model is specified through a dependent Geometric stick-breaking process that generalizes the standard parametric Gaussian vector autoregression model, whereas the prior distribution of the latent matrix ensures a multiple testing correction. A Monte Carlo simulation study is provided for comparing the robustness of the proposed hypothesis test with state-of-the-art alternatives. The results show that this proposal performs better than competing approaches. Finally, the new test is applied to real economic data.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"3 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142218395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Seung Woo Kwak , Jeongyoun Ahn , Jaewoo Lee , Cheolwoo Park
{"title":"Differentially Private Goodness-of-Fit Tests for Continuous Variables","authors":"Seung Woo Kwak , Jeongyoun Ahn , Jaewoo Lee , Cheolwoo Park","doi":"10.1016/j.ecosta.2021.09.007","DOIUrl":"10.1016/j.ecosta.2021.09.007","url":null,"abstract":"<div><p>Data privacy is a growing concern in modern data analyses as more and more types of information about individuals are collected and shared. Statistical analysis in consideration of privacy is thus becoming an exciting area of research. Differential privacy can provide a means by which one can measure the stochastic risk of violating the privacy of individuals that can result from conducting an analysis, such as a simple query from a database and a hypothesis test. The main interest of the work is a goodness-of-fit test that compares the sampled data to a known distribution. Many differentially private goodness-of-fit tests have been proposed for discrete random variables, but little work has been done for continuous variables. The objective is to review some existing tests that guarantee differential privacy for discrete random variables, and to propose an extension to continuous cases via a discretization process. The proposed test procedures are demonstrated through simulated examples and applied to the Household Financial Welfare Survey of South Korea in 2018.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 81-99"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74095071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semiparametric Averaging of Nonlinear Marginal Logistic Regressions and Forecasting for Time Series Classification","authors":"Rong Peng , Zudi Lu","doi":"10.1016/j.ecosta.2021.11.001","DOIUrl":"10.1016/j.ecosta.2021.11.001","url":null,"abstract":"<div><p>Binary classification is an important issue in many applications but mostly studied for independent data in the literature. A binary time series classification is investigated by proposing a semiparametric procedure named “Model Averaging nonlinear MArginal LOgistic Regressions” (MAMaLoR) for binary time series data based on the time series information of predictor variables. The procedure involves approximating the logistic multivariate conditional regression function by combining low-dimensional non-parametric nonlinear marginal logistic regressions, in the sense of Kullback-Leibler distance. A time series conditional likelihood method is suggested for estimating the optimal averaging weights together with local maximum likelihood estimations of the nonparametric marginal time series logistic (auto)regressions. The asymptotic properties of the procedure are established under mild conditions on the time series observations that are of <span><math><mi>β</mi></math></span>-mixing property. The procedure is less computationally demanding and can avoid the “curse of dimensionality” for, and be easily applied to, high dimensional lagged information based nonlinear time series classification forecasting. The performances of the procedure are further confirmed both by Monte-Carlo simulation and an empirical study for market moving direction forecasting of the financial FTSE 100 index data.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 19-37"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86385087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Conditional Quantile Functions for Zero-Inflated Longitudinal Count Data","authors":"Carlos Lamarche , Xuan Shi , Derek S. Young","doi":"10.1016/j.ecosta.2021.09.003","DOIUrl":"10.1016/j.ecosta.2021.09.003","url":null,"abstract":"<div><p>The identification and estimation of conditional quantile functions for count responses using longitudinal data are considered. The approach is based on a continuous approximation to distribution functions for count responses within a class of parametric models that are commonly employed. It is first shown that conditional quantile functions for count responses are identified in zero-inflated models with subject heterogeneity. Then, a simple three-step approach is developed to estimate the effects of covariates on the quantiles of the response variable. A simulation study is presented to show the small sample performance of the estimator. Finally, the advantages of the proposed estimator in relation to some existing methods is illustrated by estimating a model of annual visits to physicians using data from a health insurance experiment.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 49-65"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86565670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Forecasting Near-equivalence of Linear Dimension Reduction Methods in Large Panels of Macro-variables","authors":"Alessandro Barbarino , Efstathia Bura","doi":"10.1016/j.ecosta.2021.10.007","DOIUrl":"10.1016/j.ecosta.2021.10.007","url":null,"abstract":"<div><p>In an extensive pseudo out-of-sample horserace, classical estimators (dynamic factor models, RIDGE and partial least squares regression) and the novel to forecasting, Regularized Sliced Inverse Regression, exhibit almost near-equivalent forecasting accuracy in a large panel of macroeconomic variables across targets, horizons and subsamples. This finding motivates the theoretical contributions in this paper. Most widely used linear dimension reduction methods are shown to solve closely related maximization problems with solutions that can be decomposed in <em>signal</em> and <em>scaling</em> components. They are organized under a common scheme that sheds light on their commonalities and differences as well as on their functionality. Regularized Sliced Inverse Regression delivers the most parsimonious forecast model and obtains the greatest reduction of the complexity of the forecasting problem. Nevertheless, the study’s findings are that (a) the intrinsic relationship between forecast target and the other macroseries in the panel is linear and (b) targeting contributes in reducing the complexity of modeling yet does not induce significant gains in macroeconomic forecasting accuracy.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 1-18"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80948953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bias correction for Vandermonde low-rank approximation","authors":"Antonio Fazzi , Alexander Kukush , Ivan Markovsky","doi":"10.1016/j.ecosta.2021.09.001","DOIUrl":"10.1016/j.ecosta.2021.09.001","url":null,"abstract":"<div><p>The low-rank approximation problem, that is the problem of approximating a given matrix with a matrix of lower rank, appears in many scientific fields. In some applications the given matrix is structured and the approximation is required to have the same structure. Examples of linear structures are Hankel, Toeplitz, and Sylvester. Currently, there are only a few results for nonlinearly structured low-rank approximation problems. The problem of Vandermonde structured low-rank approximation is considered. The high condition number of the Vandermonde matrix, in combination with the noise in the data, makes the problem challenging. A numerical method based on a bias correction procedure is proposed and its properties are demonstrated by simulation. The performance of the method is illustrated on numerical results.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 38-48"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74509464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multivariate Count Time Series Modelling","authors":"Konstantinos Fokianos","doi":"10.1016/j.ecosta.2021.11.006","DOIUrl":"10.1016/j.ecosta.2021.11.006","url":null,"abstract":"<div><p>Autoregressive models are reviewed for the analysis of multivariate count time series. A particular topic of interest which is discussed in detail is that of the choice of a suitable distribution for a vectors of count random variables. The focus is on three main approaches taken for multivariate count time series analysis: (a) integer autoregressive processes, (b) parameter-driven models and (c) observation-driven models. The aim is to highlight some recent methodological developments and propose some potentially useful research topics.</p></div>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"31 ","pages":"Pages 100-116"},"PeriodicalIF":2.0,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89113611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}