Laura Calvet, Madeleine Lopeman, J. Adrián, G. Franco, A. Juan
{"title":"Statistical and machine learning approaches for the minimization of trigger errors in parametric earthquake catastrophe bonds","authors":"Laura Calvet, Madeleine Lopeman, J. Adrián, G. Franco, A. Juan","doi":"10.2436/20.8080.02.64","DOIUrl":"https://doi.org/10.2436/20.8080.02.64","url":null,"abstract":"Catastrophe bonds are financial instruments designed to transfer risk of monetary losses arising from earthquakes, hurricanes, or floods to the capital markets. The insurance and reinsurance industry, governments, and private entities employ them frequently to obtain coverage. Parametric catastrophe bonds base their payments on physical features. For instance, given parameters such as magnitude of the earthquake and the location of its epicentre, the bond may pay a fixed amount or not pay at all. This paper reviews statistical and machine learning techniques for designing trigger mechanisms and includes a computational experiment. Several lines of future research are discussed.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72915584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alfredo José Mainar Causapé, José Manuel Rueda Cantuche, M. Flores, Patricia D. Fuentes Saguar, M. C. D. López, F. Santini, S. Mary, S. G. Paloma
{"title":"Estimating regional social accounting matrices to analyse rural development","authors":"Alfredo José Mainar Causapé, José Manuel Rueda Cantuche, M. Flores, Patricia D. Fuentes Saguar, M. C. D. López, F. Santini, S. Mary, S. G. Paloma","doi":"10.2436/20.8080.02.62","DOIUrl":"https://doi.org/10.2436/20.8080.02.62","url":null,"abstract":"This paper has two complementary objectives: on the one hand, it introduces the EURO method for the estimation of (regional) Social Accounting Matrices. This method is widely used by Eurostat for the estimation of missing national Supply, Use and Input-output tables but it has not been used before within the context of social accounting matrices or of regional statistics and/or regional impact analyses. On the other hand, this work discusses the possibility of producing non-survey based regional Social Accounting Matrices that may eventually allow the user to carry out impact analyses such as those of rural development policies, among others. The analysis is carried out for 12 selected European regions based on clusters.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89471206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Irantzu Barrio, M. Rodríguez-Álvarez, Luís Meira-Machado, C. Esteban, I. Arostegui
{"title":"Comparison of two discrimination indexes in the categorisation of continuous predictors in time-to-event studies","authors":"Irantzu Barrio, M. Rodríguez-Álvarez, Luís Meira-Machado, C. Esteban, I. Arostegui","doi":"10.2436/20.8080.02.51","DOIUrl":"https://doi.org/10.2436/20.8080.02.51","url":null,"abstract":"The Cox proportional hazards model is the most widely used survival prediction model for analysing time-to-event data. To measure the discrimination ability of a survival model the concordance probability index is widely used. In this work we studied and compared the performance of two different estimators of the concordance probability when a continuous predictor variable is categorised in a Cox proportional hazards regression model. In particular, we compared the c-index and the concordance probability estimator. We evaluated the empirical performance of both estimators through simulations. To categorise the predictor variable we propose a methodology which considers the maximal discrimination attained for the categorical variable. We applied this methodology to a cohort of patients with chronic obstructive pulmonary disease, in particular, we categorised the predictor variable forced expiratory volume in one second in percentage.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79964021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Goodness-of-fit test for randomly censored data based on maximum correlation","authors":"E. Strzalkowska-Kominiak, A. Grané","doi":"10.2436/20.8080.02.54","DOIUrl":"https://doi.org/10.2436/20.8080.02.54","url":null,"abstract":"In this paper we study a goodness-of-fit test based on the maximum correlation coefficient, in the context of randomly censored data. We construct a new test statistic under general right- censoring and prove its asymptotic properties. Additionally, we study a special case, when the censoring mechanism follows the well-known Koziol-Green model. We present an extensive simulation study on the empirical power of these two versions of the test statistic, showing their ad- vantages over the widely used Pearson-type test. Finally, we apply our test to the head-and-neck cancer data.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78809954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bayesian correlated models for assessing the prevalence of viruses in organic and non-organic agroecosystems","authors":"Elena Lázaro, C. Armero, L. Rubio","doi":"10.2436/20.8080.02.52","DOIUrl":"https://doi.org/10.2436/20.8080.02.52","url":null,"abstract":"Cultivation of horticultural species under organic management has increased in importance in recent years. However, the sustainability of this new production method needs to be supported by scientific research, especially in the field of virology. We studied the prevalence of three important virus diseases in agroecosystems with regard to its management system: organic versus non-organic, with and without greenhouse. Prevalence was assessed by means of a Bayesian correlated binary model which connects the risk of infection of each virus within the same plot and was defined in terms of a logit generalized linear mixed model (GLMM). Model robustness was checked through a sensitivity analysis based on different hyperprior scenarios. Inferential results were examined in terms of changes in the marginal posterior distributions, both for fixed and for random effects, through the Hellinger distance and a derived measure of sensitivity. Statistical results suggested that organic systems show lower or similar prevalence than non-organic ones in both single and multiple infections as well as the relevance of the prior specification of the random effects in the inferential process.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88278530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marc Jornet-Sanz, A. Corberán-Vallet, F. Santonja, R. Villanueva
{"title":"A Bayesian stochastic SIRS model with a vaccination strategy for the analysis of respiratory syncytial virus","authors":"Marc Jornet-Sanz, A. Corberán-Vallet, F. Santonja, R. Villanueva","doi":"10.2436/20.8080.02.56","DOIUrl":"https://doi.org/10.2436/20.8080.02.56","url":null,"abstract":"Our objective in this paper is to model the dynamics of respiratory syncytial virus in the region of Valencia (Spain) and analyse the effect of vaccination strategies from a health-economic point of view. Compartmental mathematical models based on differential equations are commonly used in epidemiology to both understand the underlying mechanisms that influence disease transmission and analyse the impact of vaccination programs. However, a recently proposed Bayesian stochastic susceptible-infected-recovered-susceptible model in discrete-time provided an improved and more natural description of disease dynamics. In this work, we propose an extension of that stochastic model that allows us to simulate and assess the effect of a vaccination strategy that consists on vaccinating a proportion of newborns.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88647467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Thirty years of progeny from Chao’s inequality: Estimating and comparing richness with incidence data and incomplete sampling","authors":"A. Chao, Robert K. Colwell","doi":"10.2436/20.8080.02.49","DOIUrl":"https://doi.org/10.2436/20.8080.02.49","url":null,"abstract":"In the context of capture-recapture studies, Chao (1987) derived an inequality among capture frequency counts to obtain a lower bound for the size of a population based on individuals’ capture/non-capture records for multiple capture occasions. The inequality has been applied to obtain a non-parametric lower bound of species richness of an assemblage based on species incidence (detection/non-detection) data in multiple sampling units. The inequality implies that the number of undetected species can be inferred from the species incidence frequency counts of the uniques (species detected in only one sampling unit) and duplicates (species detected in exactly two sampling units). In their pioneering paper, Colwell and Coddington (1994) gave the name “Chao2” to the estimator for the resulting species richness. (The “Chao1” estimator refers to a similar type of estimator based on species abundance data). Since then, the Chao2 estimator has been applied to many research fields and led to fruitful generalizations. Here, we first review Chao’s inequality under various models and discuss some related statistical inference questions: (1) Under what conditions is the Chao2 estimator an unbiased point estimator? (2) How many additional sampling units are needed to detect any arbitrary proportion (including 100%) of the Chao2 estimate of asymptotic species richness? (3) Can other incidence frequency counts be used to obtain similar lower bounds? We then show how the Chao2 estimator can be also used to guide a non-asymptotic analysis in which species richness estimators can be compared for equally-large or equally-complete samples via sample-size-based and coverage-based rarefaction and extrapolation. We also review the generalization of Chao’s inequality to estimate species richness under other sampling-without-replacement schemes (e.g. a set of quadrats, each surveyed only once), to obtain a lower bound of undetected species shared between two or multiple assemblages, and to allow inferences about undetected phylogenetic richness (the total length of undetected branches of a phylogenetic tree connecting all species), with associated rarefaction and extrapolation. A small empirical dataset for Australian birds is used for illustration, using online software SpadeR, iNEXT, and PhD.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83649859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Statistical modeling of warm-spell duration series using hurdle models","authors":"J. Rydén","doi":"10.2436/20.8080.02.57","DOIUrl":"https://doi.org/10.2436/20.8080.02.57","url":null,"abstract":"Regression models for counts could be applied to the earth sciences, for instance when studying trends of extremes of climatological quantities. Hurdle models are modified count models which can be regarded as mixtures of distributions. In this paper, hurdle models are applied to model the sums of lengths of periods of high temperatures. A modification to the common versions presented in the literature is presented, as left truncation as well as a particular treatment of zeros is needed for the problem. The outcome of the model is compared to those of simpler count models.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85093422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Corrigendum to \"Transmuted geometric distribution with applications in modelling and regression analysis of count data","authors":"S. Chakraborty, D. Bhati","doi":"10.2436/20.8080.02.53","DOIUrl":"https://doi.org/10.2436/20.8080.02.53","url":null,"abstract":"In our paper “Transmuted geometric distribution with applications in modelling and regression analysis of count data” (Chakraborty and Bhati, 2016) there is a mistake in the expression for the variance V(Y ), which induced a wrong expression for the Index of Dispersion (ID) in Table 1, a wrong figure in Figure 2(c) and a wrong sentence in the paragraph just preceding Remark 5. These were noticed only after the article was published. These corrections, as well as few corrections in Table 6 and a correction of one reference, are as follows:","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75784521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A quadtree approach based on European geographic grids: reconciling data privacy and accuracy","authors":"R. Lagonigro, Ramon Oller, J. Martori","doi":"10.2436/20.8080.02.55","DOIUrl":"https://doi.org/10.2436/20.8080.02.55","url":null,"abstract":"Methods to preserve confidentiality when publishing geographic information conflict with the need to publish accurate data. The goal of this paper is to create a European geographic grid frame- work to disseminate statistical data over maps. We propose a methodology based on quadtree hierarchical geographic data structures. We create a varying size grid adapted to local area densities. High populated zones are disaggregated in small squares to allow dissemination of accurate data. Alternatively, information on low populated zones is published in big squares to avoid identification of individual data. The methodology has been applied to the 2014 population register data in Catalonia","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2017-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77827265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}