Statistical Modelling最新文献

筛选
英文 中文
A statistical modelling approach to feedforward neural network model selection 前馈神经网络模型选择的统计建模方法
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-09-17 DOI: 10.1177/1471082x241258261
Andrew McInerney, Kevin Burke
{"title":"A statistical modelling approach to feedforward neural network model selection","authors":"Andrew McInerney, Kevin Burke","doi":"10.1177/1471082x241258261","DOIUrl":"https://doi.org/10.1177/1471082x241258261","url":null,"abstract":"Feedforward neural networks (FNNs) can be viewed as non-linear regression models, where covariates enter the model through a combination of weighted summations and non-linear functions. Although these models have some similarities to the approaches used within statistical modelling, the majority of neural network research has been conducted outside of the field of statistics. This has resulted in a lack of statistically based methodology, and, in particular, there has been little emphasis on model parsimony. Determining the input layer structure is analogous to variable selection, while the structure for the hidden layer relates to model complexity. In practice, neural network model selection is often carried out by comparing models using out-of-sample performance. However, in contrast, the construction of an associated likelihood function opens the door to information-criteria-based variable and architecture selection. A novel model selection method, which performs both input- and hidden-node selection, is proposed using the Bayesian information criterion (BIC) for FNNs. The choice of BIC over out-of-sample performance as the model selection objective function leads to an increased probability of recovering the true model, while parsimoniously achieving favourable out-of-sample performance. Simulation studies are used to evaluate and justify the proposed method, and applications on real data are investigated.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Skellam distribution revisited: Estimating the unobserved incoming and outgoing ICU COVID-19 patients on a regional level in Germany 重新审视斯凯拉姆分布:估算德国地区一级未观察到的 ICU COVID-19 病人进出情况
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-05-27 DOI: 10.1177/1471082x241235024
Martje Rave, Göran Kauermann
{"title":"The Skellam distribution revisited: Estimating the unobserved incoming and outgoing ICU COVID-19 patients on a regional level in Germany","authors":"Martje Rave, Göran Kauermann","doi":"10.1177/1471082x241235024","DOIUrl":"https://doi.org/10.1177/1471082x241235024","url":null,"abstract":"With the beginning of the COVID-19 pandemic, we became aware of the need for comprehensive data collection and its provision to scientists and experts for proper data analyses. In Germany, the Robert Koch Institute (RKI) has tried to keep up with this demand for data on COVID-19, but there were (and still are) relevant data missing that are needed to understand the whole picture of the pandemic. In this article, we take a closer look at the severity of the course of COVID-19 in Germany, for which ideal information would be the number of incoming patients to ICU units. This information was (and still is) not available. Instead, the current occupancy of ICU units on the district level was reported daily. We demonstrate how this information can be used to predict the number of incoming as well as released COVID-19 patients using a stochastic version of the Expectation Maximization algorithm (SEM). This, in turn, allows for estimating the influence of district-specific and age-specific infection rates as well as further covariates, including spatial effects, on the number of incoming patients. The article demon-strates that even if relevant data are not recorded or provided officially, statistical modelling allows for reconstructing them. This also includes the quantification of uncertainty which naturally results from the application of the SEM algorithm.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian effect selection in structured additive quantile regression 结构化加性量子回归中的贝叶斯效应选择
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-05-23 DOI: 10.1177/1471082x241242617
Anja Rappl, Manuel Carlan, Thomas Kneib, Sebastiaan Klokman, Elisabeth Bergherr
{"title":"Bayesian effect selection in structured additive quantile regression","authors":"Anja Rappl, Manuel Carlan, Thomas Kneib, Sebastiaan Klokman, Elisabeth Bergherr","doi":"10.1177/1471082x241242617","DOIUrl":"https://doi.org/10.1177/1471082x241242617","url":null,"abstract":"Bayesian structured additive quantile regression is an established tool for regressing outcomes with unknown distributions on a set of explanatory variables and/or when interest lies with effects on the more extreme values of the outcome. Even though variable selection for quantile regression exists, its scope is limited. We propose the use of the Normal Beta Prime Spike and Slab (NBPSS) prior in Bayesian quantile regression to aid the researcher in not only variable but also effect selection. We compare the Bayesian NBPSS approach to statistical boosting for quantile regression, a current standard in automated variable selection in quantile regression, in a simulation study with varying degrees of model complexity and illustrate both methods on an example of childhood malnutrition in Nigeria. The NBPSS prior shows good performance in variable and effect selection as well as prediction compared to boosting and can thus be recommended as an additional tool for quantile regression model building.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141105754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling dependence in football match outcomes: Traditional assumptions and an alternative proposal 足球比赛结果的依赖性建模:传统假设和替代方案
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-05-15 DOI: 10.1177/1471082x241238802
M. Petretta, Lorenzo Schiavon, Jacopo Diquigiovanni
{"title":"Modelling dependence in football match outcomes: Traditional assumptions and an alternative proposal","authors":"M. Petretta, Lorenzo Schiavon, Jacopo Diquigiovanni","doi":"10.1177/1471082x241238802","DOIUrl":"https://doi.org/10.1177/1471082x241238802","url":null,"abstract":"The approaches routinely used to model the outcomes of football matches are characterized by strong assumptions about the dependence between the number of goals scored by the two competing teams and their marginal distribution. In this work, we argue that the assumptions traditionally made are not always based on solid arguments. Although most of these assumptions have been relaxed in the recent literature, the model introduced by Dixon and Coles in 1997 still represents a point of reference in the betting industry. While maintaining its conceptual simplicity, alternatives based on modelling the conditional distributions allow for the specification of more comprehensive dependence structures. In view of this, we propose a straightforward modification of the usual Poisson marginal models by means of thoroughly chosen marginal and conditional distributions. Careful model validation is provided, and a real data application involving five European leagues is conducted. The novel dependence structure allows to extract key insights on league dynamics and presents practical gains in several betting scenarios.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140975487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian hierarchical model for predicting rates of oxygen consumption in mechanically ventilated intensive care patients 用于预测机械通气重症监护患者耗氧量的贝叶斯分层模型
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-05-14 DOI: 10.1177/1471082x241238810
Luke Hardcastle, S. S. Livingstone, Claire Black, Federico Ricciardi, Gianluca Baio
{"title":"A Bayesian hierarchical model for predicting rates of oxygen consumption in mechanically ventilated intensive care patients","authors":"Luke Hardcastle, S. S. Livingstone, Claire Black, Federico Ricciardi, Gianluca Baio","doi":"10.1177/1471082x241238810","DOIUrl":"https://doi.org/10.1177/1471082x241238810","url":null,"abstract":"Patients who are mechanically ventilated in the Intensive Care Unit participate in exercise as a component of their rehabilitation to ameliorate the long-term impact of critical illness on their physical function. The effective implementation of these programmes is limited, however, as clinicians do not have access to a patient's [Formula: see text] values, a physiological measure that quantifies an individual patient's exercise intensity level in real-time. In this work we have developed a Bayesian hierarchical model with temporally correlated latent Gaussian processes to predict [Formula: see text] using readily available physiological data, providing clinicians with information to personalise rehabilitation sessions in real-time. The model was fitted using the Integrated Nested Laplace Approximation and validated using posterior predictive checks, and the impact of alternate specifications of the latent process was examined. Assessed using leave-one-patientout cross-validation, we show that the ability to provide probabilistic statements describing classification uncertainty gives the model favourable predictive power compared to a state-of-the-art comparator based on the oxygen uptake efficiency slope, with a more than seven-fold increase in accuracy in identifying when a patient is at risk of over-exertion.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140978977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel mixture model for characterizing human aiming performance data 用于描述人类瞄准表演数据特征的新型混合模型
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-04-25 DOI: 10.1177/1471082x241234139
Yanxi Li, Derek S. Young, Julien Gori, Olivier Rioul
{"title":"A novel mixture model for characterizing human aiming performance data","authors":"Yanxi Li, Derek S. Young, Julien Gori, Olivier Rioul","doi":"10.1177/1471082x241234139","DOIUrl":"https://doi.org/10.1177/1471082x241234139","url":null,"abstract":"Fitts’ law is often employed as a predictive model for human movement, especially in the field of human-computer interaction. Models with an assumed Gaussian error structure are usually adequate when applied to data collected from controlled studies. However, observational data (often referred to as data gathered ‘in the wild’) typically display noticeable positive skewness relative to a mean trend as users do not routinely try to minimize their task completion time. As such, the exponentially modified Gaussian (EMG) regression model has been applied to aimed movements data. However, it is also of interest to reasonably characterize those regions where a user likely was not trying to minimize their task completion time. In this article, we propose a novel model with a two-component mixture structure—one Gaussian and one exponential—on the errors to identify such a region. An expectation-conditional-maximization (ECM) algorithm is developed for estimation of such a model and some properties of the algorithm are established. The efficacy of the proposed model, as well as its ability to inform model-based clustering, are addressed in this work through extensive simulations and an insightful analysis of a human aiming performance study.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast, effective, and coherent time series modelling using the sparsity-ranked lasso 利用稀疏性排序套索进行快速、有效和连贯的时间序列建模
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-03-08 DOI: 10.1177/1471082x231225307
Ryan Peterson, Joseph Cavanaugh
{"title":"Fast, effective, and coherent time series modelling using the sparsity-ranked lasso","authors":"Ryan Peterson, Joseph Cavanaugh","doi":"10.1177/1471082x231225307","DOIUrl":"https://doi.org/10.1177/1471082x231225307","url":null,"abstract":"The sparsity-ranked lasso (SRL) has been developed for model selection and estimation in the presence of interactions and polynomials. The main tenet of the SRL is that an algorithm should be more sceptical of higher-order polynomials and interactions a priori compared to main effects, and hence the inclusion of these more complex terms should require a higher level of evidence. In time series, the same idea of ranked prior scepticism can be applied to characterize the potentially complex seasonal autoregressive (AR) structure of a series during the model fitting process, becoming especially useful in settings with uncertain or multiple modes of seasonality. The SRL can naturally incorporate exogenous variables, with streamlined options for inference and/or feature selection. The fitting process is quick even for large series with a high-dimensional feature set. In this work, we discuss both the formulation of this procedure and the software we have developed for its implementation via the fastTS R package. We explore the performance of our SRL-based approach in a novel application involving the autoregressive modelling of hourly emergency room arrivals at the University of Iowa Hospitals and Clinics. We find that the SRL is considerably faster than its competitors, while generally producing more accurate predictions.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140071345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Taking advantage of sampling designs in spatial small-area survey studies 在小区域空间调查研究中利用抽样设计的优势
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-03-05 DOI: 10.1177/1471082x231226287
Carlos Vergara-Hernández, Marc Marí-Dell’Olmo, Laura Oliveras, Miguel Angel Martinez-Beneito
{"title":"Taking advantage of sampling designs in spatial small-area survey studies","authors":"Carlos Vergara-Hernández, Marc Marí-Dell’Olmo, Laura Oliveras, Miguel Angel Martinez-Beneito","doi":"10.1177/1471082x231226287","DOIUrl":"https://doi.org/10.1177/1471082x231226287","url":null,"abstract":"Spatial small area estimation models have become very popular in some contexts, such as disease mapping. Data in disease mapping studies are exhaustive, that is, the available data are supposed to be a complete register of all the observable events. In contrast, some other small area studies do not use exhaustive data, such as survey based studies, where a particular sampling design is typically followed and inferences are later extrapolated to the entire population. In this article we propose a spatial model for small area survey studies, taking advantage of spatial dependence between units, which is the key assumption used for yielding reliable estimates in exhaustive data based studies. In addition, and in contrast to most survey-based spatial studies, we also take into account information on the sampling design and additional supplementary variables to obtain estimates in small areas. This makes it possible to merge spatial and sampling models into a common proposal.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140044367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Copula-based pairwise estimator for quantile regression with hierarchical missing data 基于 Copula 的分层缺失数据量化回归成对估计器
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-02-28 DOI: 10.1177/1471082x231225806
Anneleen Verhasselt, Alvaro J. Flórez, Geert Molenberghs, Ingrid Van Keilegom
{"title":"Copula-based pairwise estimator for quantile regression with hierarchical missing data","authors":"Anneleen Verhasselt, Alvaro J. Flórez, Geert Molenberghs, Ingrid Van Keilegom","doi":"10.1177/1471082x231225806","DOIUrl":"https://doi.org/10.1177/1471082x231225806","url":null,"abstract":"Quantile regression can be a helpful technique for analysing clustered (such as longitudinal) data. It can characterize the change in response over time without making distributional assumptions and is robust to outliers in the response. A quantile regression model using a copula-based multivariate asymmetric Laplace distribution for addressing correlation due to clustering is introduced. Furthermore, we propose a pairwise estimator for the parameters of the model. Since it is based on pseudo-likelihood, it needs to be modified to avoid bias in presence of missingness. Therefore, we enhance the model with inverse probability weighting. In this way, our proposal is unbiased under the missing at random assumption. Based on simulations, the estimator is efficient and computationally fast. Finally, the methodology is illustrated using a study in ophthalmology.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140003632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation for vector autoregressive model under multivariate skew-t-normal innovations 多元倾斜-t-正态创新下的向量自回归模型估计
IF 1 4区 数学
Statistical Modelling Pub Date : 2024-02-15 DOI: 10.1177/1471082x231224910
U. Nduka, E. O. Ossai, M. Madukaife, T. E. Ugah
{"title":"Estimation for vector autoregressive model under multivariate skew-t-normal innovations","authors":"U. Nduka, E. O. Ossai, M. Madukaife, T. E. Ugah","doi":"10.1177/1471082x231224910","DOIUrl":"https://doi.org/10.1177/1471082x231224910","url":null,"abstract":"Current procedures for estimating the parameters of [Formula: see text]th order vector autoregressive (VAR [Formula: see text]) model are usually based on assuming that the ensuing error distribution is multivariate normal. But there exists large body of evidence that several data encountered in real life are skewed; thereby making estimators derived based on normality assumption not suitable in such scenarios. This prompts for the search of appropriate methods for skewed distributions. Therefore, this article proposes estimators for the mean and covariance matrices of the [Formula: see text] model under multivariate skew- [Formula: see text]-normal (MSTN) distribution. Also, estimators for the shape and skewness parameters are provided. The expectation conditional maximization (ECM) and its extension the expectation conditional maximization either (ECME) algorithms are the tools used to derive the estimators. The performance of the estimators were examined through extensive simulations, and results show that they compete favourably with other numerical methods especially when the underlying distribution is skewed. The usefulness of our estimators was illustrated using a real data set on some US economic indicators. The VAR [Formula: see text] model under MSTN distribution provides a good fit, better than [Formula: see text] model under the assumption of normality.","PeriodicalId":49476,"journal":{"name":"Statistical Modelling","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139774326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信