{"title":"Accelerated failure time model under dependent truncated data","authors":"Jin-Jian Hsieh, Siang-Ying Chen","doi":"10.1016/j.jspi.2025.106297","DOIUrl":"10.1016/j.jspi.2025.106297","url":null,"abstract":"<div><div>This paper delves into the accelerated failure time model within the framework of dependent truncation data and leverages the copula model to establish correlations within the dataset. Building upon the work of Chaieb et al. (2006), who utilized the copula-graphic method to estimate survival functions and proposed an approach for estimating correlation parameters, we further extend the methodology by introducing two distinct estimation techniques for regression parameters. The first method involves parameter evaluation through the calculation of the area between survival curves, while the second method employs the weight of survival jump in conjunction with the least squares approach to estimate regression parameters. We evaluate the efficacy of these proposed estimation procedures through simulation studies and conduct a comparative analysis between the two approaches. Furthermore, we apply these methodologies to two real-world datasets, providing insights into their practical applicability. Through this analysis, we gain a deeper understanding of how these approaches can be effectively utilized in real-world scenarios.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106297"},"PeriodicalIF":0.8,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144166460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Semiparametric modal regression with varying coefficients and measurement error","authors":"Aman Ullah , Tao Wang","doi":"10.1016/j.jspi.2025.106307","DOIUrl":"10.1016/j.jspi.2025.106307","url":null,"abstract":"<div><div>We in this paper propose a stepwise estimation procedure for semiparametric modal regression with varying coefficients and measurement error, where the linear covariate is unobserved but an ancillary variable is available. This modal regression framework, which is built on the mode value rather than the mean, captures the “most likely” effect instead of the traditional average effect. The proposed stepwise procedure introduces a restricted regression mode by imposing a structural constraint on the model, allowing us to concentrate out the varying coefficients using the “correction for attenuation” method commonly employed in mean regression. This transformation reduces the original model to a parametric modal regression. We establish the consistency and asymptotic normality of the resulting modal estimators by analyzing the tail behavior of the characteristic function of the error distribution, distinguishing between ordinary smooth and super smooth cases. Additionally, we investigate bandwidth selection strategies and propose a simulation-extrapolation algorithm as a practical approach for optimal bandwidth choice. We conduct Monte Carlo simulations to assess the finite sample performance of the resulting estimators and present a real data analysis to further illustrate the effectiveness of the suggested estimation procedure.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106307"},"PeriodicalIF":0.8,"publicationDate":"2025-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144134836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Divide and conquer for generalized approximately expectile regression","authors":"Zhen Zeng , Weixin Yao","doi":"10.1016/j.jspi.2025.106300","DOIUrl":"10.1016/j.jspi.2025.106300","url":null,"abstract":"<div><div>When the size of the dataset becomes extremely large, it is computationally challenge for traditional statistical estimation methods and might be infeasible to store all the data on a single computer. Under the massive data framework, we extend the divide and conquer method to the generalized approximately expectile regression and investigate both of their finite and asymptotic properties. Bahadur representation of the estimators are established. Moreover, we prove that with the appropriate number of subsamples, the proposed method can ensure the accuracy of statistical inference. Simulations studies validate our theoretical findings. Supplementary materials for this article are available online.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106300"},"PeriodicalIF":0.8,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144139243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Francesco Gili, Geurt Jongbloed, Aad van der Vaart
{"title":"Asymptotically efficient estimation under local constraint in Wicksell’s problem","authors":"Francesco Gili, Geurt Jongbloed, Aad van der Vaart","doi":"10.1016/j.jspi.2025.106299","DOIUrl":"10.1016/j.jspi.2025.106299","url":null,"abstract":"<div><div>We consider nonparametric estimation of the distribution function <span><math><mi>F</mi></math></span> of squared sphere radii in the classical Wicksell problem. Under smoothness conditions on <span><math><mi>F</mi></math></span> in a neighborhood of <span><math><mi>x</mi></math></span>, in Gili et al. (2024) it is shown that the Isotonic Inverse Estimator (IIE) is asymptotically efficient and attains rate of convergence <span><math><msqrt><mrow><mi>n</mi><mo>/</mo><mo>log</mo><mi>n</mi></mrow></msqrt></math></span>. If <span><math><mi>F</mi></math></span> is constant on an interval containing <span><math><mi>x</mi></math></span>, the optimal rate of convergence increases to <span><math><msqrt><mrow><mi>n</mi></mrow></msqrt></math></span> and the IIE attains this rate adaptively, i.e. without explicitly using the knowledge of local constancy. However, in this case, the asymptotic distribution is not normal. In this paper, we introduce three <em>informed</em> projection-type estimators of <span><math><mi>F</mi></math></span>, which use knowledge on the interval of constancy and show these are all asymptotically equivalent and normal. Furthermore, we establish a local asymptotic minimax lower bound in this setting, proving that the three <em>informed</em> estimators are asymptotically efficient and a convolution result showing that the IIE is not efficient. We also derive the asymptotic distribution of the difference of the IIE with the efficient estimators, demonstrating that the IIE is <em>not</em> asymptotically equivalent to the <em>informed</em> estimators. Through a simulation study, we provide evidence that the performance of the IIE closely resembles that of its competitors, supporting the use of the IIE as the standard choice when no information about <span><math><mi>F</mi></math></span> is available.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106299"},"PeriodicalIF":0.8,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144088647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A nonparametric test for the heterogeneity of the spatial autoregressive parameter","authors":"Yangbing Tang , Jiang Du , Zhongzhan Zhang","doi":"10.1016/j.jspi.2025.106298","DOIUrl":"10.1016/j.jspi.2025.106298","url":null,"abstract":"<div><div>We propose a new test for the heterogeneity of the spatial autoregressive parameter in semiparametric varying-coefficient spatial autoregressive models. Our specification test is built on the difference of parametric and nonparametric estimates of the spatial autoregressive coefficient, where the two estimates are obtained by the sieve GMM estimation method. Under mild conditions, we derive the limiting null distribution, the local power property and consistency of the test statistic. Numerical simulations show promising performance of the proposed test for finite samples in the considered cases, and the crime data of Tokyo is analyzed to illustrate the usefulness of the test.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106298"},"PeriodicalIF":0.8,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143936935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Copula-based semiparametric nonnormal transformed linear model for survival data with dependent censoring","authors":"Huazhen Yu , Lixin Zhang","doi":"10.1016/j.jspi.2025.106296","DOIUrl":"10.1016/j.jspi.2025.106296","url":null,"abstract":"<div><div>Although the independent censoring assumption is commonly used in survival analysis, it can be violated when the censoring time is related to the survival time, which often happens in many practical applications. To address this issue, we propose a flexible semiparametric method for dependent censored data. Our approach involves fitting the survival time and the censoring time with a joint transformed linear model, where the transformed function is unspecified. This allows for a very general class of models that can account for possible covariate effects, while also accommodating administrative censoring. We assume that the transformed variables have a bivariate nonnormal distribution based on parametric copulas and parametric marginals, which further enhances the flexibility of our method. We demonstrate the identifiability of the proposed model and establish the consistency and asymptotic normality of the model parameters under appropriate regularity conditions and assumptions. Furthermore, we evaluate the performance of our method through extensive simulation studies, and provide a real data example for illustration.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"240 ","pages":"Article 106296"},"PeriodicalIF":0.8,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143927463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Maximum Projection Gini Correlation (MaGiC) for mixed categorical and numerical data","authors":"Hong Xiao , Radhakrishna Adhikari , Yixin Chen , Xin Dang","doi":"10.1016/j.jspi.2025.106294","DOIUrl":"10.1016/j.jspi.2025.106294","url":null,"abstract":"<div><div>We propose a projection correlation for measure of dependence between numerical multivariate variables and categorical variables. The projection correlation, defined as the maximum of the Gini correlations (i.e., MaGiC) between the categorical variable and the univariate projections of the multivariate vector, is non-parametric, and intuitively produces a high coefficient when the two variables are dependent, and zero when they are independent. We show that MaGiC possesses the property of nestedness, in that it is non-decreasing with the increasing number of features in the numerical vector, while remaining unchanged if additional numerical features are independent of the categorical variable and original features. We establish <span><math><msqrt><mrow><mi>n</mi></mrow></msqrt></math></span>-consistency of the sample projection correlation. A powerful <span><math><mi>K</mi></math></span>-sample test can be carried out via the MaGiC-based independence test. When compared with related correlation definitions for multivariate variables, MaGiC also enjoys a faster implementation, with the computational complexity <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mi>m</mi><mi>n</mi><mrow><mo>(</mo><mi>d</mi><mo>+</mo><mo>log</mo><mi>n</mi><mo>)</mo></mrow><mo>)</mo></mrow></mrow></math></span> where <span><math><mi>d</mi></math></span> is the dimension of the numerical variable, <span><math><mi>n</mi></math></span> is the sample size, and <span><math><mi>m</mi></math></span> is the number of projections performed, as opposed to <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mi>d</mi><mspace></mspace><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>)</mo></mrow></mrow></math></span> for Gini correlation. We demonstrate these properties through simulation and application to real datasets.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"239 ","pages":"Article 106294"},"PeriodicalIF":0.8,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143874462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hao Jin , Jiating Hu , Ling Zhu , Shiyu Tian , Si Zhang
{"title":"M-procedures robust to structural changes detection under strong mixing heavy-tailed time series models","authors":"Hao Jin , Jiating Hu , Ling Zhu , Shiyu Tian , Si Zhang","doi":"10.1016/j.jspi.2025.106295","DOIUrl":"10.1016/j.jspi.2025.106295","url":null,"abstract":"<div><div>Many tests of change points resort to least squares estimation method, but it can lead to bias if these observations are heavy-tailed processes. The aim of this paper is to construct a ratio-typed test based on M-estimation, which avoids the long-range variance estimation and is robust to structural change detection under strong mixing series with heavy-tailed. The proposed test consisting of M-procedures has more utility in that it allows processes in the domain of attraction of a stable law with index <span><math><mrow><mi>κ</mi><mo>∈</mo><mrow><mo>(</mo><mn>0</mn><mo>,</mo><mn>2</mn><mo>)</mo></mrow></mrow></math></span>, not limited to <span><math><mrow><mo>(</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>)</mo></mrow></math></span>. Under some regular conditions, asymptotic distribution under the null hypothesis of no change is functional of a Brownian motion, and the divergent rate under the alternative hypothesis is also provided. Furthermore, the convergence rate of a ratio-typed change point estimator is established. Simulation study illustrates there is no distortion in empirical sizes, and empirical powers have satisfactory performance. Finally, two practical applications to real examples are presented as well.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"239 ","pages":"Article 106295"},"PeriodicalIF":0.8,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143891417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pursuing sparsity and homogeneity for multi-source high-dimensional current status data","authors":"Xin Ye , Yanyan Liu","doi":"10.1016/j.jspi.2025.106293","DOIUrl":"10.1016/j.jspi.2025.106293","url":null,"abstract":"<div><div>Nowadays, current status data with high-dimensional predictors are prevalent in observational studies. However, for a single study, the high dimensionality and the presence of censoring pose substantial challenges to statistical analysis with limited sample size. Although integrative analysis has been widely regarded as an effective strategy to improve the estimation, the source-level heterogeneity has to be carefully addressed. In this paper, we propose an integrative analysis method for multi-source high-dimensional current status data, which can simultaneously identify the homogeneity/heterogeneity structure and select important variables. We prove that the proposed approach attains consistency in estimation, sparsity recovery, and the pursuit of homogeneity. Extensive simulation studies have been carried out to assess the finite sample performance of the proposed method. A real data analysis of multi-source ovarian cancer recurrence studies further demonstrates its practical applicability.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"239 ","pages":"Article 106293"},"PeriodicalIF":0.8,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143891416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhihao Hu , Shyam Ranganathan , Yang Shao , Xinwei Deng
{"title":"Neighborhood VAR: Efficient estimation of multivariate timeseries with neighborhood information","authors":"Zhihao Hu , Shyam Ranganathan , Yang Shao , Xinwei Deng","doi":"10.1016/j.jspi.2025.106277","DOIUrl":"10.1016/j.jspi.2025.106277","url":null,"abstract":"<div><div>Vector autoregression (VAR) models are popular in modeling multivariate time series in data sciences and other areas. When the number of time series is large, the number of parameters in the VAR model increases dramatically, posing great challenges for proper model estimation and inference. In this work, we propose a so-called neighborhood vector autoregression (NVAR) model to efficiently analyze large-dimensional multivariate time series. We assume that the time series have underlying neighborhood relationships, e.g., spatial or network, among them based on the inherent setting of the problem. When this neighborhood information is available or can be summarized using a distance matrix, we demonstrate that our proposed NVAR method provides a computationally efficient and theoretically sound estimation of model parameters. The performance of the proposed method is compared with other existing approaches in both simulation studies and a real-data application in environmental science.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"239 ","pages":"Article 106277"},"PeriodicalIF":0.8,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143768174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}