{"title":"Bayesian modelling of nonlinear negative binomial integer-valued GARCHX models","authors":"Cathy W. S. Chen, K. Khamthong","doi":"10.1177/1471082X19845541","DOIUrl":"https://doi.org/10.1177/1471082X19845541","url":null,"abstract":"This study focuses on modelling dengue cases in northeastern Thailand through two meteorological covariates: cumulative rainfall and average maximum temperature. We propose two nonlinear integer-valued GARCHX models (Markov switching and threshold specification) with a negative binomial distribution, as they take into account the stylized features of weekly dengue haemorrhagic fever cases, which contain nonlinear dynamics, lagged dependence, overdispersion, consecutive zeros and asymmetric effects of meteorological covariates. We conduct parameter estimation and one-step-ahead forecasting for two proposed models based on Bayesian Markov chain Monte Carlo (MCMC) methods. A simulation study illustrates that the adaptive MCMC sampling scheme performs well. The empirical results offer strong support for the Markov switching integer-valued GARCHX model over its competitors via Bayes factor and deviance information criterion. We also provide one-step-ahead forecasting based on the prediction interval that offers a useful early warning signal of outbreak detection.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"20 1","pages":"537 - 561"},"PeriodicalIF":1.0,"publicationDate":"2019-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X19845541","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43760628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Flexible parametric multistate modelling of employment history","authors":"A. Hout, Wen. Tan","doi":"10.1177/1471082X19836299","DOIUrl":"https://doi.org/10.1177/1471082X19836299","url":null,"abstract":"Abstract A multistate model is used to describe employment history. Transition-specific rates are defined using generalized gamma distributions and Gompertz distributions. This flexible parametric modelling of the rate of change is combined with latent classes for unobserved propensity to change jobs. The propensity is described by two latent classes which can be interpreted as consisting of movers and stayers. The modelling is illustrated by analysing longitudinal data from the German Life History Study.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"19 1","pages":"323 - 338"},"PeriodicalIF":1.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X19836299","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42077348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wesley Bertoli, K. S. Conceição, M. Andrade, F. Louzada
{"title":"A Bayesian approach for some zero-modified Poisson mixture models","authors":"Wesley Bertoli, K. S. Conceição, M. Andrade, F. Louzada","doi":"10.1177/1471082X19841984","DOIUrl":"https://doi.org/10.1177/1471082X19841984","url":null,"abstract":"In this article, we propose a class of zero-modified Poisson mixture models as an alternative to model overdispersed count data exhibiting inflation or deflation of zeros. A relevant feature of this class is that the zero modification can be incorporated using a zero truncation process and consequently, the proposed models can be expressed in the hurdle version. This procedure leads to the fact that the proposed models can be fitted without any previous information about the zero modification present in agiven dataset. A fully Bayesian approach has been considered for estimation and inference concerns. Three different simulation studies have been conducted to illustrate the performance of the developed methodology. The usefulness of the proposed class of models has been assessed by using three real datasets provided by the literature. A general model comparison with some well-known discrete distributions has been presented.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"20 1","pages":"467 - 501"},"PeriodicalIF":1.0,"publicationDate":"2019-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X19841984","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46956187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Bayesian approach for the segmentation of series with a functional effect","authors":"M. Baragatti, K. Bertin, É. Lebarbier, C. Meza","doi":"10.1177/1471082X18755539","DOIUrl":"https://doi.org/10.1177/1471082X18755539","url":null,"abstract":"Abstract: In some application fields, series are affected by two different types of effects: abrupt changes (or change-points) and functional effects. We propose here a Bayesian approach that allows us to estimate these two parts. Here, the underlying piecewise-constant part (associated to the abrupt changes) is expressed as the product of a lower triangular matrix by a sparse vector and the functional part as a linear combination of functions from a large dictionary where we want to select the relevant ones. This problem can thus lead to a global sparse estimation and a stochastic search variable selection approach is used to this end. The performance of our proposed method is assessed using simulation experiments. Applications to three real datasets from geodesy, agronomy and economy fields are also presented.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"19 1","pages":"194 - 220"},"PeriodicalIF":1.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X18755539","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42525333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Bayesian two-stage regression approach of analysing longitudinal outcomes with endogeneity and incompleteness","authors":"P. Bhuyan, J. Biswas, Pulak Ghosh, Kiranmoy Das","doi":"10.1177/1471082X17747806","DOIUrl":"https://doi.org/10.1177/1471082X17747806","url":null,"abstract":"Abstract: Two-stage regression methods are typically used for handling endogeneity in the simultaneous equations models in economics and other social sciences. However, the problem is challenging in the presence of incomplete response and/or incomplete endogenous covariate(s). We propose a Bayesian approach for the joint modelling of incomplete longitudinal continuous response and an incomplete count endogenous covariate, where the incompleteness is caused by the censorship through a selection mechanism. We define latent continuous variables which are left-censored at zero and develop a Gibbs sampling algorithm for the simultaneous estimation of the model parameters. We consider partially varying coefficients regression models containing covariates with fixed and time-varying effects on the response. Our work is motivated by a sample dataset from the Health and Retirement Study (HRS) for modelling the out-of-pocket medical cost, where the number of hospital admissions is considered as an endogenous covariate. Our analysis addresses some of the previously unanswered questions on the physical and financial health of the older population based on HRS data. Simulation studies are performed for assessing the usefulness of the proposed method compared to its competitors.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"19 1","pages":"157 - 173"},"PeriodicalIF":1.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X17747806","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41822036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimating transition coefficients for reconstructing coherent series of mortality by cause of Death","authors":"C. Camarda","doi":"10.1177/1471082X19832398","DOIUrl":"https://doi.org/10.1177/1471082X19832398","url":null,"abstract":"Abstract Regular revisions of the classification of diseases and the consequent disruptions of mortality series are well-known issues in long-term cause-of-death analysis. Given basic assumptions and medical knowledge about possible exchanges across causes of death in the revision years, redistribution of counts of causes of death into a new classification can be viewed as a constrained optimization problem. Penalized likelihood within a quadratic programming framework allows estimation of exchanges that vary smoothly over age groups. The approach is illustrated using both German data on malignant neoplasms and French data on heart diseases.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"19 1","pages":"299 - 322"},"PeriodicalIF":1.0,"publicationDate":"2019-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X19832398","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46908544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Random scaling factors in Bayesian distributional regression models with an application to real estate data","authors":"Alexander Razen, S. Lang","doi":"10.1177/1471082X18823099","DOIUrl":"https://doi.org/10.1177/1471082X18823099","url":null,"abstract":"Distributional structured additive regression provides a flexible framework for modelling each parameter of a potentially complex response distribution in dependence of covariates. Structured additive predictors allow for an additive decomposition of covariate effects with non-linear effects and time trends, unit- or cluster-specific heterogeneity, spatial heterogeneity and complex interactions between covariates of different type. Within this framework, we present a simultaneous estimation approach for multiplicative random effects that allow for cluster-specific heterogeneity with respect to the scaling of a covariate′s effect. More specifically, a possibly non-linear function f(z) of a covariate z may be scaled by a multiplicative and possibly spatially correlated cluster-specific random effect (1+αc). Inference is fully Bayesian and is based on highly efficient Markov Chain Monte Carlo (MCMC) algorithms. We investigate the statistical properties of our approach within extensive simulation experiments for different response distributions. Furthermore, we apply the methodology to German real estate data where we identify significant district-specific scaling factors. According to the deviance information criterion, the models incorporating these factors perform significantly better than standard models without (spatially correlated) random scaling factors.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"20 1","pages":"347 - 368"},"PeriodicalIF":1.0,"publicationDate":"2019-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X18823099","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45220294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exploring and modelling team performances of the Kaggle European Soccer database","authors":"M. Carpita, E. Ciavolino, Paola Pasca","doi":"10.1177/1471082X18810971","DOIUrl":"https://doi.org/10.1177/1471082X18810971","url":null,"abstract":"This study explores a big and open database of soccer leagues in 10 European countries. Data related to players, teams and matches covering seven seasons (from 2009/2010 to 2015/2016) were retrieved from Kaggle, an online platform in which big data are available for predictive modelling and analytics competition among data scientists. Based on both preliminary data analysis, experts’ evaluation and players’ position on the football pitch, role-based indicators of teams’ performance have been built and used to estimate the win probability of the home team with the binomial logistic regression (BLR) model that has been extended including the ELO rating predictor and two random effects due to the hierarchical structure of the dataset. The predictive power of the BLR model and its extensions has been compared with the one of other statistical modelling approaches (Random Forest, Neural Network, k-NN, Naïve Bayes). Results showed that role-based indicators substantially improved the performance of all the models used in both this work and in previous works available on Kaggle. The base BLR model increased prediction accuracy by 10 percentage points, and showed the importance of defence performances, especially in the last seasons. Inclusion of both ELO rating predictor and the random effects did not substantially improve prediction, as the simpler BLR model performed equally good. With respect to the other models, only Naïve Bayes showed more balanced results in predicting both win and no-win of the home team.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":"19 1","pages":"101 - 74"},"PeriodicalIF":1.0,"publicationDate":"2019-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1471082X18810971","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47485726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}