{"title":"Asymptotic theory of the best-choice rerandomization using the Mahalanobis distance","authors":"Yuhao Wang , Xinran Li","doi":"10.1016/j.jeconom.2025.106049","DOIUrl":"10.1016/j.jeconom.2025.106049","url":null,"abstract":"<div><div>Rerandomization, a design that utilizes pretreatment covariates and improves their balance between different treatment groups, has received attention recently in both theory and practice. From a survey by Bruhn and McKenzie (2009), there are at least two types of rerandomization that are used in practice: the first rerandomizes the treatment assignment until covariate imbalance is below a prespecified threshold; the second randomizes the treatment assignment multiple times and chooses the one with the best covariate balance. In this paper we will consider the second type of rerandomization, namely the best-choice rerandomization, whose theory and inference are still lacking in the literature. In particular, we will focus on the best-choice rerandomization that uses the Mahalanobis distance to measure covariate imbalance, which is one of the most commonly used imbalance measure for multivariate covariates and is invariant to affine transformations of covariates. We will study the large-sample repeatedly sampling properties of the best-choice rerandomization, allowing both the number of covariates and the number of tried complete randomizations to increase with the sample size. We show that the asymptotic distribution of the difference-in-means estimator is more concentrated around the true average treatment effect under rerandomization than under the complete randomization, and propose large-sample accurate confidence intervals for rerandomization that are shorter than that for the completely randomized experiment. We further demonstrate that, with moderate number of covariates and with the number of tried randomizations increasing polynomially with the sample size, the best-choice rerandomization can achieve the ideally optimal precision that one can expect even with perfectly balanced covariates. The developed theory and methods for rerandomization are also illustrated using real field experiments.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106049"},"PeriodicalIF":9.9,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144479985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A robust residual-based test for structural changes in factor models","authors":"Bin Peng , Liangjun Su , Yayi Yan","doi":"10.1016/j.jeconom.2025.106042","DOIUrl":"10.1016/j.jeconom.2025.106042","url":null,"abstract":"<div><div>In this paper, we propose an easy-to-implement residual-based specification testing procedure for detecting structural changes in factor models, which is powerful against both smooth and abrupt structural changes with unknown break dates. The proposed test is robust to the over-specified number of factors, and serially and cross-sectionally correlated error processes. A new central limit theorem is given for the quadratic forms of panel data with dependence over both dimensions, thereby filling a gap in the literature. We establish the asymptotic properties of the proposed test statistic, and accordingly develop a simulation-based scheme to select critical value in order to improve finite sample performance. Through extensive simulations and a real-world application, we confirm our theoretical results and demonstrate that the proposed test exhibits desirable size and power in practice.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106042"},"PeriodicalIF":9.9,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144271055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multivariate stochastic volatility models based on generalized Fisher transformation","authors":"Han Chen , Yijie Fei , Jun Yu","doi":"10.1016/j.jeconom.2025.106041","DOIUrl":"10.1016/j.jeconom.2025.106041","url":null,"abstract":"<div><div>Modeling multivariate stochastic volatility (MSV) can pose significant challenges, particularly when both variances and covariances are time-varying. In this study, we tackle these complexities by introducing novel MSV models based on the generalized Fisher transformation (GFT) proposed by Archakov and Hansen (2021). Our model exhibits remarkable flexibility, ensuring the positive-definiteness of the variance–covariance matrix, and disentangling the driving forces of volatilities and correlations. To conduct Bayesian analysis of the models, we employ a Particle Gibbs Ancestor Sampling (PGAS) method, facilitating efficient Bayesian model comparisons. Furthermore, we extend our MSV model to cover leverage effects and incorporate realized measures. Our simulation studies demonstrate that the proposed method performs well for our GFT-based MSV model. Furthermore, empirical studies based on equity returns show that the MSV models outperform alternative specifications in both in-sample and out-of-sample performances.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106041"},"PeriodicalIF":9.9,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144262373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Time-varying vector error-correction models: Estimation and inference","authors":"Jiti Gao , Bin Peng , Yayi Yan","doi":"10.1016/j.jeconom.2025.106035","DOIUrl":"10.1016/j.jeconom.2025.106035","url":null,"abstract":"<div><div>This paper considers a time-varying vector error-correction model that allows for different time series behaviors (e.g., unit-root and locally stationary processes) to interact with each other and co-exist. From a practical perspective, this framework can be used to estimate shifts in the predictability of non-stationary variables, and test whether economic theories hold periodically, etc. We first develop a time-varying Granger Representation Theorem, which facilitates the establishment of an asymptotic theory for the model, and then propose estimation and inferential methods for both short-run and long-run coefficients. We also propose an information criterion to estimate the lag length, a singular-value ratio test to determine the cointegration rank, and a hypothesis test to examine the parameter stability. Finally, we extend the framework to allow for unknown structural breaks in either cointegration relationship or time-varying coefficient functions. To validate the theoretical findings, we conduct extensive simulations, and demonstrate the empirical relevance by testing the present value model for stock returns.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106035"},"PeriodicalIF":9.9,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144212972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distribution regression with censored selection","authors":"Songnian Chen , Nianqing Liu , Hanghui Zhang","doi":"10.1016/j.jeconom.2025.106030","DOIUrl":"10.1016/j.jeconom.2025.106030","url":null,"abstract":"<div><div>Chernozhukov, Fernández-Val, and Luo (2023, CFL (2023) hereafter) considered a distribution regression model subject to sample selection with a binary selection mechanism. In this paper, we show how to identify and estimate a semi-parametric distribution regression model subject to a censored selection rule. With censored selection, we do not need to impose the usual outcome exclusion restriction or exclusion of the level of the latent selection variable from the selection sorting function for model identification, unlike CFL (2023). We propose new semiparametric estimators and corresponding inference procedures for model parameters and related functional parameters. We apply our method to investigate wage inequality in the UK for the period 1978–2000 using the Family Expenditure Survey (FES) data. Our findings reveal that (i) the selection sorting exclusion and outcome exclusion restrictions imposed by CFL (2023) are rejected; (ii) there is negative selection into work at most quantile levels for females, but not for males; (iii) in contrast to CFL (2023), our selection sorting effect pattern does not offer clear evidence on assortative matching or glass ceiling in the UK labor market; (iv) the latent gender wage gaps after correcting for selection bias are about 25%–50% of CFL (2023)’s estimates, and are also significantly smaller than the observed wage gaps; (v) similar to CFL (2023), there exists some strong evidence on gender discrimination in the UK labor market.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106030"},"PeriodicalIF":9.9,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144195267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multilevel matrix factor model","authors":"Yuteng Zhang , Yongchang Hui , Junrong Song , Shurong Zheng","doi":"10.1016/j.jeconom.2025.106033","DOIUrl":"10.1016/j.jeconom.2025.106033","url":null,"abstract":"<div><div>Large scale matrix data has been widely discovered and continuously studied in various fields recently. We propose a multilevel matrix factor model considering the existence of multi level factor structure in matrix time series. This model incorporates both global factors influencing all matrix time series and local factors confined to impact specific matrix time series. Asymptotic properties are established to ensure the consistency of our procedure for estimating factor loading matrices. To demonstrate the finite-sample performance of our estimation, we present comprehensive simulation results. Finally, we apply our model to an empirical analysis of eight indexes, including return, trading volume, and trading value, across 200 stocks from ten distinct industries.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"251 ","pages":"Article 106033"},"PeriodicalIF":9.9,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144184328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Realized candlestick wicks","authors":"Yifan Li , Ingmar Nolte , Sandra Nolte , Shifan Yu","doi":"10.1016/j.jeconom.2025.106014","DOIUrl":"10.1016/j.jeconom.2025.106014","url":null,"abstract":"<div><div>We develop a novel nonparametric estimator of integrated variance by summing up the squared wick lengths of intraday candlesticks over a fixed time interval. The proposed wick-based estimator is robust to short-lived extreme price movements, such as gradual jumps and flash crashes. We investigate the asymptotic properties of the proposed estimator, and show that its asymptotic variance is about four times smaller than the state-of-the-art differenced-return volatility (DV) estimator. We also develop a Hausman-type test for the presence of both jumps and episodic extreme price movements. Monte Carlo simulations and empirical applications further validate the practical reliability of our proposed estimator.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"250 ","pages":"Article 106014"},"PeriodicalIF":9.9,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144139050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Inference for large dimensional factor models under general missing data patterns","authors":"Liangjun Su , Fa Wang","doi":"10.1016/j.jeconom.2025.106022","DOIUrl":"10.1016/j.jeconom.2025.106022","url":null,"abstract":"<div><div>This paper establishes the inferential theory for the least squares estimation of large factor models with missing data. We propose a unified framework for asymptotic analysis of factor models that covers a wide range of missing patterns, including heterogenous random missing, selection on covariates/factors/loadings, block/staggered missing, mixed frequency and ragged edge. We establish the average convergence rates of the estimated factor space and loading space, the limit distributions of the estimated factors and loadings, as well as the limit distributions of the estimated average treatment effects and the parameter estimates in the factor-augmented regressions. These results allow us to impute the unbalanced panel appropriately or make inference for the heterogenous treatment effects. For computation, we can use the nuclear norm regularized estimator as the initial value for the EM algorithm and iterate until convergence. Empirically, we apply our method to test the average treatment effects of partisan alignment on grant allocation in UK.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"250 ","pages":"Article 106022"},"PeriodicalIF":9.9,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144134114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Cooper Boniece , Lajos Horváth , Lorenzo Trapani
{"title":"On changepoint detection in functional data using empirical energy distance","authors":"B. Cooper Boniece , Lajos Horváth , Lorenzo Trapani","doi":"10.1016/j.jeconom.2025.106023","DOIUrl":"10.1016/j.jeconom.2025.106023","url":null,"abstract":"<div><div>We propose a novel family of test statistics to detect the presence of changepoints in a sequence of dependent, possibly multivariate, functional-valued observations. Our approach allows to test for a very general class of changepoints, including the “classical” case of changes in the mean, and even changes in the whole distribution. Our statistics are based on a generalisation of the empirical energy distance; we propose weighted functionals of the energy distance process, which are designed in order to enhance the ability to detect breaks occurring at sample endpoints. The limiting distribution of the maximally selected version of our statistics requires only the computation of the eigenvalues of the covariance function, thus being readily implementable in the most commonly employed packages, e.g. <span>R</span>. We show that, under the alternative, our statistics are able to detect changepoints occurring even very close to the beginning/end of the sample. In the presence of multiple changepoints, we propose a binary segmentation algorithm to estimate the number of breaks and the locations thereof. Simulations show that our procedures work very well in finite samples. We complement our theory with applications to financial and temperature data.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"250 ","pages":"Article 106023"},"PeriodicalIF":9.9,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144124770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sergei Bazylik , Magne Mogstad , Joseph P. Romano , Azeem M. Shaikh , Daniel Wilhelm
{"title":"Finite- and large-sample inference for ranks using multinomial data with an application to ranking political parties","authors":"Sergei Bazylik , Magne Mogstad , Joseph P. Romano , Azeem M. Shaikh , Daniel Wilhelm","doi":"10.1016/j.jeconom.2025.106010","DOIUrl":"10.1016/j.jeconom.2025.106010","url":null,"abstract":"<div><div>It is common to rank different categories by means of preferences that are revealed through data on choices. A prominent example is the ranking of political candidates or parties using the estimated share of support each one receives in surveys or polls about political attitudes. Since these rankings are computed using estimates of the share of support rather than the true share of support, there may be considerable uncertainty concerning the true ranking of the political candidates or parties. In this paper, we consider the problem of accounting for such uncertainty by constructing confidence sets for the rank of each category. We consider both the problem of constructing marginal confidence sets for the rank of a particular category as well as simultaneous confidence sets for the ranks of all categories. A distinguishing feature of our analysis is that we exploit the multinomial structure of the data to develop confidence sets that are valid in finite samples. We additionally develop confidence sets using the bootstrap that are valid only approximately in large samples. We use our methodology to rank political parties in Australia using data from the 2019 Australian Election Survey. We find that our finite-sample confidence sets are informative across the entire ranking of political parties, even in Australian territories with few survey respondents and/or with parties that are chosen by only a small share of the survey respondents. In contrast, the bootstrap-based confidence sets may sometimes be considerably less informative. These findings motivate us to compare these methods in an empirically-driven simulation study, in which we conclude that our finite-sample confidence sets often perform better than their large-sample, bootstrap-based counterparts, especially in settings that resemble our empirical application.</div></div>","PeriodicalId":15629,"journal":{"name":"Journal of Econometrics","volume":"250 ","pages":"Article 106010"},"PeriodicalIF":9.9,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143936819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}