{"title":"Cluster Regularization via a Hierarchical Feature Regression","authors":"Johann Pfitzinger","doi":"10.1016/j.ecosta.2024.01.003","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.01.003","url":null,"abstract":"<p>The hierarchical feature regression (HFR) is a novel graph-based regularized regression estimator, which mobilizes insights from the domains of machine learning and graph theory to estimate robust parameters for a linear regression. The estimator constructs a supervised feature graph that decomposes parameters along its edges, adjusting first for common variation and successively incorporating idiosyncratic patterns into the fitting process. The graph structure has the effect of shrinking parameters towards group targets, where the extent of shrinkage is governed by a hyperparameter, and group compositions as well as shrinkage targets are determined endogenously. The method offers rich resources for the visual exploration of the latent effect structure in the data, and demonstrates good predictive accuracy and versatility when compared to a panel of commonly used regularization techniques across a range of empirical and simulated regression tasks.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"2020 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139517665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Inference on Multiple Change Points in High Dimensional Linear Regression Models","authors":"Hongjin Zhang, Abhishek Kaul","doi":"10.1016/j.ecosta.2024.01.002","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.01.002","url":null,"abstract":"<p>Confidence intervals are constructed for multiple change points in high-dimensional linear regression models. Locally refitted estimators are developed, and their rate of convergence is evaluated. The componentwise rate of estimation obtained is optimal, and the simultaneous rate is the sharpest available in the literature. Limiting distributions of the considered estimates are provided in both vanishing and non-vanishing jump size regimes, along with the joint limiting distributions. The relationship between the distributions in the two regimes is further examined, and an adaptation property is illustrated to allow for inference without knowledge of the underlying regime. Theoretical results are supported by Monte Carlo simulations and further demonstrated by a real data example.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"213 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139517994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Highly irregular serial correlation tests","authors":"Dante Amengual, Xinyue Bei, Enrique Sentana","doi":"10.1016/j.ecosta.2024.01.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2024.01.001","url":null,"abstract":"<p>Tests are developed for neglected serial correlation when the information matrix is repeatedly singular under the null hypothesis. Specifically, consideration is given to white noise against a multiplicative seasonal <span>Ar</span> model, and a local-level model against a nesting <span>Ucarima</span>one. The proposed tests, which involve higher-order derivatives, are asymptotically equivalent to the likelihood ratio test but only require estimation under the null. It is shown that the tests effectively check that certain autocorrelations of the observations are zero, so their asymptotic distribution is standard. Monte Carlo exercises examine finite sample size and power properties, with comparisons made to alternative approaches.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"69 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139411045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christina Erlwein-Sayer, Stefanie Grimm, Alexander Pieper, Rümeysa Alsaç
{"title":"Forecasting corporate credit spreads: regime-switching in LSTM","authors":"Christina Erlwein-Sayer, Stefanie Grimm, Alexander Pieper, Rümeysa Alsaç","doi":"10.1016/j.ecosta.2023.12.002","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.12.002","url":null,"abstract":"<p>A long short-term memory model (LSTM) which utilises regime-switching state information as a feature to predict the change of credit spreads is developed. Latent changes in the market are filtered out from observable credit spread time series. These hidden information of regime changes are incorporated into an LSTM, where the state probability is utilised as a feature for one-step ahead predictions of the credit spreads. Firstly, time series from corporate credit spreads are modelled through a Hidden Markov model (HMM) which is based on a discretised Ornstein-Uhlenbeck process. State-related information of the Markov chain, like the jump frequency and state occupation time hidden in the observed spreads are filtered out and adaptive HMM filters are built to estimate probabilities of hidden market states. The performance of the LSTM with regime-switching information is analysed and compared to the accuracy of a pure LSTM without state features. Furthermore, purely utilising the HMM forecast, the prediction of the credit spread is compared to the prediction within the LSTM. Beyond a simulations study, the HMM-LSTM model is calibrated on corporate credit spreads from three European countries between 2004 and 2019. The findings show that the LSTM forecast error is improved when regime information is added, mostly in cases with stronger market fluctuations.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"23 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139062651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Muhammad Qasim, Kristofer Månsson, Pär Sjölander, B. M. Golam Kibria
{"title":"Stein-type control function maximum likelihood estimator for the probit model in the presence of endogeneity","authors":"Muhammad Qasim, Kristofer Månsson, Pär Sjölander, B. M. Golam Kibria","doi":"10.1016/j.ecosta.2023.12.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.12.001","url":null,"abstract":"<p>A Stein-type control function maximum likelihood (CFML) estimator is suggested for the probit model in the presence of endogeneity. This novel estimator combines the probit maximum likelihood and CFML estimators. The asymptotic distribution and risk function for the new estimator is derived. It is demonstrated that, subject to certain conditions of the shrinkage parameter, the asymptotic risk of the new estimator is strictly smaller than the risk of the CFML. Monte Carlo simulations illustrate the method's superiority in finite samples. The method is also applied to analyze the impact of managerial incentives on the use of foreign-exchange derivatives.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"87 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138565753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Large Sample Properties of Entropy Balancing Estimators of Average Causal Effects","authors":"David Källberg, Ingeborg Waernbaum","doi":"10.1016/j.ecosta.2023.11.004","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.11.004","url":null,"abstract":"<p>Weighting methods are used in observational studies to adjust for covariate imbalances between treatment and control groups. Entropy balancing (EB) is an alternative to inverse probability weighting with an estimated propensity score. The EB weights are constructed to satisfy balance constraints and optimized towards stability. Large sample properties of EB estimators of the average causal treatment effect, based on the Kullback-Leibler and quadratic Rényi relative entropies, are described. Additionally, estimators of their asymptotic variances are proposed. Even though the objective of EB is to reduce model dependence, the estimators are generally not consistent unless implicit parametric assumptions for the propensity score or conditional outcomes are met. The finite sample properties of the estimators are investigated through a simulation study. The average causal effect of smoking on blood lead levels is estimated using data from the National Health and Nutrition Examination Survey.</p>","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"63 6","pages":""},"PeriodicalIF":1.9,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138526119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Dynamic Spatiotemporal Stochastic Volatility Model with an Application to Environmental Risks","authors":"Philipp Otto, Osman Doğan, Süleyman Taşpınar","doi":"10.1016/j.ecosta.2023.11.002","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.11.002","url":null,"abstract":"A dynamic spatiotemporal stochastic volatility (SV) model is introduced, incorporating explicit terms accounting for spatial, temporal, and spatiotemporal spillover effects. Alongside these features, the model encompasses time-invariant site-specific factors, allowing for differentiation in volatility levels across locations. The statistical properties of an outcome variable within this model framework are examined, revealing the induction of spatial dependence in the outcome variable. Additionally, a Bayesian estimation procedure employing the Markov Chain Monte Carlo (MCMC) approach, complemented by a suitable data transformation, is presented. Simulation experiments are conducted to assess the performance of the proposed Bayesian estimator. Subsequently, the model is applied in the domain of environmental risk modeling, addressing the scarcity of empirical studies in this field. The significance of climate variation studies is emphasized, illustrated by an analysis of local air quality in Northern Italy during 2021, which underscores pronounced spatial and temporal clusters and increased uncertainties/risks during the winter season compared to the summer season.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"5 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135564688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sergi Jiménez-Martín, José M. Labeaga, Majid al Sadoon
{"title":"Consistent estimation of panel data sample selection models","authors":"Sergi Jiménez-Martín, José M. Labeaga, Majid al Sadoon","doi":"10.1016/j.ecosta.2023.11.003","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.11.003","url":null,"abstract":"The properties of classical panel data estimators including fixed effect, first-differences, random effects, and generalized method of moments-instrumental variables estimators in both static as well as dynamic panel data models are investigated under sample selection. The correlation of the unobserved errors is shown not to be sufficient for the inconsistency of these estimators. A necessary condition for this to arise is the presence of common (and/or non-independent) non-deterministic covariates in the selection and outcome equations. When both equations do not have covariates in common and independent of each other, the fixed effects, and random effects estimators in static models with exogenous covariates are consistent. Furthermore, the first-differenced generalized method of moments estimator uncorrected for sample selection as well as the instrumental variables estimator uncorrected for sample selection are both consistent for autoregressive models even with endogenous covariates. The same results hold when both equations have no covariates in common but are correlated once we account for such correlation. Under the same circumstances, the system generalized method of moments estimator adding more moments from the levels equation has moderate bias. Alternatively, when both equations have common covariates the appropriate correction method is suggested. Serial correlation of the errors being a key determinant for that choice. The finite sample properties of the proposed estimators are evaluated using a Monte Carlo study. Two empirical illustrations are provided.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"46 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136103382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust Clustering with Normal Mixture Models: A Pseudo β-Likelihood Approach","authors":"Soumya Chakraborty, Ayanendranath Basu, Abhik Ghosh","doi":"10.1016/j.ecosta.2023.10.004","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.10.004","url":null,"abstract":"As in other estimation scenarios, likelihood based estimation in the normal mixture set-up is highly non-robust against model misspecification and presence of outliers (apart from being an ill-posed optimization problem). A robust alternative to the ordinary likelihood approach for this estimation problem is proposed which performs simultaneous estimation and data clustering and leads to subsequent anomaly detection. To invoke robustness, the methodology based on the minimization of the density power divergence (or alternatively, the maximization of the β-likelihood) is utilized under suitable constraints. An iteratively reweighted least squares approach has been followed in order to compute the proposed estimators for the component means (or equivalently cluster centers) and component dispersion matrices which leads to simultaneous data clustering. Some exploratory techniques are also suggested for anomaly detection, a problem of great importance in the domain of statistics and machine learning. The proposed method is validated with simulation studies under different set-ups; it performs competitively or better compared to the popular existing methods like K-medoids, TCLUST, trimmed K-means and MCLUST, especially when the mixture components (i.e., the clusters) share regions with significant overlap or outlying clusters exist with small but non-negligible weights (particularly in higher dimensions). Two real datasets are also used to illustrate the performance of the newly proposed method in comparison with others along with an application in image processing. The proposed method detects the clusters with lower misclassification rates and successfully points out the outlying (anomalous) observations from these datasets.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"49 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135410228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pooled Bewley Estimator of Long Run Relationships in Dynamic Heterogenous Panels","authors":"Alexander Chudik, M. Hashem Pesaran, Ron P. Smith","doi":"10.1016/j.ecosta.2023.11.001","DOIUrl":"https://doi.org/10.1016/j.ecosta.2023.11.001","url":null,"abstract":"Using a transformation of the autoregressive distributed lag model due to Bewley, a novel pooled Bewley (PB) estimator of long-run coefficients for dynamic panels with heterogeneous short-run dynamics is proposed. The PB estimator is directly comparable to the widely used Pooled Mean Group (PMG) estimator, and is shown to be consistent and asymptotically normal. Monte Carlo simulations show good small sample performance of PB compared to the existing estimators in the literature, namely PMG, panel dynamic OLS (PDOLS), and panel fully-modified OLS (FMOLS). Application of two bias-correction methods and a bootstrapping of critical values to conduct inference robust to cross-sectional dependence of errors are also considered. The utility of the PB estimator is illustrated in an empirical application to the aggregate consumption function.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"49 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135411092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}