Deniz A. Sanchez S., Rubén D. Guevara G., Sergio A. Calderón V.
{"title":"Comparison of two statistical methodologies for a binary classification problem of two-dimensional images","authors":"Deniz A. Sanchez S., Rubén D. Guevara G., Sergio A. Calderón V.","doi":"10.1080/02664763.2023.2279012","DOIUrl":"https://doi.org/10.1080/02664763.2023.2279012","url":null,"abstract":"The present work intends to compare two statistical classification methods using images as covariates and under the comparison criterion of the ROC curve. The first implemented procedure is based o...","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138514116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"GWR-assisted integrated estimator of finite population total under two-phase sampling: a model-assisted approach","authors":"Nobin Chandra Paul, Anil Rai, Tauqueer Ahmad, Ankur Biswas, Prachi Misra Sahoo","doi":"10.1080/02664763.2023.2280879","DOIUrl":"https://doi.org/10.1080/02664763.2023.2280879","url":null,"abstract":"AbstractIn survey sampling, auxiliary information is used to precisely estimate the finite population parameters. There are several approaches available in the literature that provide a practical method for incorporating auxiliary information during the estimation stage. In order to effectively utilize the auxiliary information, a geographically weighted regression (GWR) model-assisted integrated estimator of finite population total under a two-phase sampling design has been proposed in this article. Spatial simulation studies have been conducted to empirically assess the statistical properties of the proposed estimator. In the presence of spatial non-stationarity, empirical findings reveal that the proposed estimator outperforms all existing estimators such as two-phase HT, ratio, and regression estimators, demonstrating the importance of spatial information in survey sampling.KEYWORDS: Data integrationgeographically weighted regressionmodel-assisted approachspatial non-stationaritytwo-phase regression AcknowledgementThe authors are thankful to the blind reviewers for providing valuable suggestions that have greatly enhanced the quality of the article. The first author would like to express his heartfelt gratitude to the ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India, for providing the real CCE survey data, lab facilities, and overall support to conduct the research work during Ph.D. programme.Disclosure statementNo potential conflict of interest was reported by the authors.Data availability statementData sharing is not applicable.","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134900822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Forecasting of the true satellite carbon monoxide data with ensemble empirical mode decomposition, singular value decomposition and moving average","authors":"Sameer Poongadan, M. C. Lineesh","doi":"10.1080/02664763.2023.2277115","DOIUrl":"https://doi.org/10.1080/02664763.2023.2277115","url":null,"abstract":"AbstractThe forecasting of carbon monoxide in the atmosphere is essential as it causes the pollution of the atmosphere and hence severe health problems for humans. This study proposes a time-series prognosis EEMD-SVD-MA technique which incorporates Ensemble Empirical Mode Decomposition, Singular Value Decomposition and Moving Average, to predict the prospects of carbon monoxide data taken from the Indian region. The collected data are non-linear. The technique can be applied for non-stationary and non-linear data. In this approach, there are three levels: EEMD level, SVD level and MA level. The first level deploys EEMD to fragment data series into a limited number of Intrinsic Mode Function (IMF) components along with a residue. To denoise each IMF component, SVD is deployed in the second level. In the third level, each denoised IMF component is predicted by MA. The future values of the original data are obtained by adding all the predicted series of the components. In this study, we proposed two variants of the model: EEMD-SVD-MA(3) and EEMD-SVD-MA(4) and compared the results with other forecasting techniques, namely LSTM (Long Short Term Memory network), EMD-LSTM, EMD-MA, EEMD-MA and CEEMDAN-MA. The results show that the proposed EEMD-SVD-MA model is more efficient than other models.Keywords: Intrinsic mode functionempirical mode decompositionensemble empirical mode decompositionsingular value decompositionmoving averagelong short term memory networkMathematics Subject Classifications: 37M1068T0715A18 AcknowledgmentsThe author's deep appreciation goes out to NASA's teams for AIRS/AMSU, MODIS and MOPPIT data for tropospheric CO.Disclosure statementNo potential conflict of interest was reported by the author(s).","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134900804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Phase II control charts for monitoring the depth-ratio of ball-bearings involving three normal variables","authors":"Li Jin, Amitava Mukherjee, Zhi Song, Jiujun Zhang","doi":"10.1080/02664763.2023.2279015","DOIUrl":"https://doi.org/10.1080/02664763.2023.2279015","url":null,"abstract":"AbstractThis paper investigates the problem of monitoring the ratio involving three variables, jointly distributed as trivariate normal. The Shewhart-type and two exponentially weighted moving average (EWMA) type schemes for monitoring depth ratio are proposed. The ratio of a normal variable to the average of two other normal variables has wide applications in natural science, production, and engineering. It is defined with slightly different terminology in various contexts, such as depth or aspect ratios. In modern bearing manufacturing, the aspect ratio of width to the average of inner and outer diameters can be an essential indicator of product quality and process stability. While there are many helpful existing charts for monitoring the three components separately or jointly when these characteristics follow a normal distribution, the ratio aspect is often ignored. The Shewhart-type schemes' exact and approximated control limits are considered and analyzed. Numerical results based on Monte-Carlo are conducted using the average run length as a metric with different values of in-control ratio and correlation between the three variables. An application based on the parts manufacturing data illustrates the implementation design of the two control charts. The real-life data analysis shows the efficacy of the proposed monitoring schemes in practice.Keywords: Charting schemesparts manufacturingratio involving three variablesphase-II process monitoringtrivariate normal AcknowledgmentsThe authors are grateful to the Editor-in-chief, Associate Editor and three reviewers for various constructive comments and suggestions.Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 https://products.emersonbearing.com/viewitems/deep-groove-radial-ball-bearings/6300-series-deep-groove-radial-ball-bearingsAdditional informationFundingThis work was supported by the National Natural Science Foundation of China [Grant Nos. 12171328,12201429]; the Beijing Natural Science Foundation [Grant No. Z210003]; the Liaoning BaiQianWan Talents Program; the Natural Science Foundation of Liaoning Province [Grant Nos. 2020-MS-139, 2023-MS-142]; the Scientific Research Fund of Liaoning Provincial Education Department of China [Grant No. LJC202006]; the Project of Science and Research of Hebei Educational Department of China [Grant No. ZD2022020]; the Doctoral Research Start-up Fund of Liaoning Province [Grant No. 2021-BS-142]; the Research on Humanities and Social Sciences of the Ministry of Education [Grant No. 22YJC910009]; and the Research of economic and social development in Liaoning Province [Grant No. 20231s1ybkt-103].","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135341864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rosa M. García-Fernández, Federico Palacios-González
{"title":"Smoothing level selection for density estimators based on the moments","authors":"Rosa M. García-Fernández, Federico Palacios-González","doi":"10.1080/02664763.2023.2277125","DOIUrl":"https://doi.org/10.1080/02664763.2023.2277125","url":null,"abstract":"AbstractThis paper introduces an approach to select the bandwidth or smoothing parameter in multiresolution (MR) density estimation and nonparametric density estimation. It is based on the evolution of the second, third and fourth central moments and the shape of the estimated densities for different bandwidths and resolution levels. The proposed method has been applied to density estimation by means of multiresolution densities as well as kernel density estimation (MRDE and KDE respectively). The results of the simulations and the empirical application demonstrate that the level of resolution resulting from the moments method performs better with multimodal densities than the Bayesian Information Criterion (BIC) for multiresolution densities estimation and the plug-in for kernel densities estimation.KEYWORDS: Multiresolution density estimationkernel density estimationbandwidthmoments and level of resolution Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The multiresolution densities are a particular case of semiparametric models (see, [Citation12,Citation14]).2 This is a well-known fact underlying all the bandwidth selection methods.3 Remind that these intervals form a partition of the real line and their amplitude converges to zero as j increases.4 Unless this is done parametrically using the EM algorithm on a mixture model of three double exponential distributions. But for a sample of size 10,000 the process time is too long.5 Note that the values for the Gini coefficient can differ from other publications since our illustration is based on gross income instead of net income.6 The expected value of the density is zero and the central and non-central moments are equal.","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135474671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A computationally efficient sequential regression imputation algorithm for multilevel data","authors":"Tugba Akkaya Hocagil, Recai M. Yucel","doi":"10.1080/02664763.2023.2277669","DOIUrl":"https://doi.org/10.1080/02664763.2023.2277669","url":null,"abstract":"ABSTRACTDue to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.KEYWORDS: Sequential regression imputationmultilevel datacomputational efficiencyfast variable by variable imputationmultiple imputation by chained equations AcknowledgmentsWe thank Dr. Tabassum Insaf for providing assistance in accessing the New York State Vital Records Registry data.Disclosure statementNo potential conflict of interest was reported by the author(s).","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135635744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An exact projection pursuit-based algorithm for multivariate two-sample nonparametric testing applicable to retrospective and group sequential studies","authors":"Li Zou, Gregory Gurevich, Ablert Vexler","doi":"10.1080/02664763.2023.2277118","DOIUrl":"https://doi.org/10.1080/02664763.2023.2277118","url":null,"abstract":"AbstractNonparametric tests for equality of multivariate distributions are frequently desired in research. It is commonly required that test-procedures based on relatively small samples of vectors accurately control the corresponding Type I Error (TIE) rates. Often, in the multivariate testing, extensions of null-distribution-free univariate methods, e.g., Kolmogorov-Smirnov and Cramér-von Mises type schemes, are not exact, since their null distributions depend on underlying data distributions. The present paper extends the density-based empirical likelihood technique in order to nonparametrically approximate the most powerful test for the multivariate two-sample (MTS) problem, yielding an exact finite-sample test statistic. We rigorously apply one-to-one-mapping between the equality of vectors' distributions and the equality of distributions of relevant univariate linear projections. We establish a general algorithm that simplifies the use of projection pursuit, employing only a few of the infinitely many linear combinations of observed vectors' components. The displayed distribution-free strategy is employed in retrospective and group sequential manners. A novel MTS nonparametric procedure in the group sequential manner is proposed. The asymptotic consistency of the proposed technique is shown. Monte Carlo studies demonstrate that the proposed procedures exhibit extremely high and stable power characteristics across a variety of settings. Supplementary materials for this article are available online.KEYWORDS: Density-based empirical likelihoodexact testmultivariate two-sample testnonparametric testprojection pursuit AcknowledgementWe are grateful to the Editor, the AE and two reviewers for helpful comments.Disclosure statementNo potential conflict of interest was reported by the author(s).","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135636034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust estimation and bias-corrected empirical likelihood in generalized linear models with right censored data","authors":"Liugen Xue, Junshan Xie, Xiaohui Yang","doi":"10.1080/02664763.2023.2277117","DOIUrl":"https://doi.org/10.1080/02664763.2023.2277117","url":null,"abstract":"AbstractIn this paper, we study the robust estimation and empirical likelihood for the regression parameter in generalized linear models with right censored data. A robust estimating equation is proposed to estimate the regression parameter, and the resulting estimator has consistent and asymptotic normality. A bias-corrected empirical log-likelihood ratio statistic of the regression parameter is constructed, and it is shown that the statistic converges weakly to a standard χ2 distribution. The result can be directly used to construct the confidence region of regression parameter. We use the bias correction method to directly calibrate the empirical log-likelihood ratio, which does not need to be multiplied by an adjustment factor. We also propose a method for selecting the tuning parameters in the loss function. Simulation studies show that the estimator of the regression parameter is robust and the bias-corrected empirical likelihood is better than the normal approximation method. An example of a real dataset from Alzheimer's disease studies shows that the proposed method can be applied in practical problems.Keywords: Generalized linear modelright censored datarobust estimationempirical likelihoodregression parameter AcknowledgmentsThe authors thank the Editor, Associate Editor and two referees for their helpful comments. The dataset used was provided by Dr. Chunling Liu of the Hong Kong Polytechnic University. The source of this dataset is available on https://adni.loni.usc.edu/about/.Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThe research were supported by the National Natural Science Foundation of China (11971001), the Natural Science Foundation of Henan (222300420417), and the Science and Technology Project (2103004).","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135819996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimation of time-varying kernel densities and chronology of the impact of COVID-19 on financial markets","authors":"Matthieu Garcin, Jules Klein, Sana Laaribi","doi":"10.1080/02664763.2023.2272226","DOIUrl":"https://doi.org/10.1080/02664763.2023.2272226","url":null,"abstract":"The time-varying kernel density estimation relies on two free parameters: the bandwidth and the discount factor. We propose to select these parameters so as to minimize a criterion consistent with the traditional requirements of the validation of a probability density forecast. These requirements are both the uniformity and the independence of the so-called probability integral transforms, which are the forecast time-varying cumulated distributions applied to the observations. We thus build a new numerical criterion incorporating both the uniformity and independence properties by the mean of an adapted Kolmogorov-Smirnov statistic. We apply this method to financial markets during the COVID-19 crisis. We determine the time-varying density of daily price returns of several stock indices and, using various divergence statistics, we are able to describe the chronology of the crisis as well as regional disparities. For instance, we observe a more limited impact of COVID-19 on financial markets in China, a strong impact in the US, and a slow recovery in Europe.","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136018115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A longitudinal study of the influence of air pollutants on children: a robust multivariate approach","authors":"Ian Meneghel Danilevicz, Pascal Bondon, Valdério Anselmo Reisen, Faradiba Sarquis Serpa","doi":"10.1080/02664763.2023.2272228","DOIUrl":"https://doi.org/10.1080/02664763.2023.2272228","url":null,"abstract":"","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136104508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}