{"title":"Spatial MCUSUM Control Chart","authors":"Juan David Giraldo Rojas, R. D. González","doi":"10.15446/RCE.V43N1.78748","DOIUrl":"https://doi.org/10.15446/RCE.V43N1.78748","url":null,"abstract":"Este documento propone una carta de control CUSUM multivariada espacial para monitorear la media de una sola caracteristica de un producto o proceso, cuando las mediciones se toman en diferentes ubicaciones en cada elemento muestreado. Para estimar la matriz de varianza y covarianza, se utilizan algunas herramientas de la geoestadistica, teniendo en cuenta la correlacion espacial entre las mediciones. El desempeno de esta carta de control se explora por simulacion y su uso se ilustra con un ejemplo.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"75 1","pages":"49-70"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90648958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two Useful Discrete Distributions to Model Overdispersed Count Data","authors":"J. Mazucheli, Wesley Bertoli, R. P. Oliveira","doi":"10.15446/RCE.V43N1.77052","DOIUrl":"https://doi.org/10.15446/RCE.V43N1.77052","url":null,"abstract":"The methods to obtain discrete analogues of continuous distributions have been widely considered in recent years. In general, the discretization process provides probability mass functions that can be competitive with the traditional model used in the analysis of count data, the Poisson distribution. The discretization procedure also avoids the use of continuous distribution in the analysis of strictly discrete data. In this paper, we seek to introduce two discrete analogues for the Shanker distribution using the method of the infinite series and the method based on the survival function as alternatives to model overdispersed datasets. Despite the difference between discretization methods, the resulting distributions are interchangeable. However, the distribution generated by the method of infinite series method has simpler mathematical expressions for the shape, the generating functions and the central moments. The maximum likelihood theory is considered for estimation and asymptotic inference concerns. A simulation study is carried out in order to evaluate some frequentist properties of the developed methodology. The usefulness of the proposed models is evaluated using real datasets provided by the literature.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"41 1","pages":"21-48"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91356316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Relationship Between Kendall's tau Correlation and Mutual Information","authors":"M. B. Ghalibaf","doi":"10.15446/rce.v43n1.78054","DOIUrl":"https://doi.org/10.15446/rce.v43n1.78054","url":null,"abstract":"La informacion mutua (MI) puede ser vista como una medida de asociacion multivariante en un vector aleatorio. Sin embargo, la estimacion de MI es dificil ya que la estimacion de la funcion de densidad de probabilidad conjunta (PDF) de datos distribuidos no gaussianos es un problema dificil. La funcion copula es una herramienta apropiada para estimar el MI ya que la funcion de densidad de probabilidad de las variables aleatorias se puede expresar como el producto de la funcion de densidad de copula asociada y de los PDF marginales. Con una pequena busqueda, encontramos que la informacion mutua propuesta basada en copulas es mucho mas precisa que los metodos convencionales, como el histograma de la articulacion y el MI basado en ventana de Parzen. En este articulo, al utilizar el metodo basado en copulas, calculamos el MI para algunas familias de funciones de distribucion bivariadas y estudiamos la relacion entre la correlacion tau de Kendall y el MI de las distribuciones bivariadas. Finalmente, usando un conjunto de datos real, ilustramos la eficiencia de este enfoque.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"63 1","pages":"3-20"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87101049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Generalized Poisson Hidden Markov Model for Overdispersed or Underdispersed Count Data","authors":"Sebastian George, Ambily Jose","doi":"10.15446/RCE.V43N1.77542","DOIUrl":"https://doi.org/10.15446/RCE.V43N1.77542","url":null,"abstract":"El metodo estadistico mas adecuado para explicar la dependencia serial en los datos de recuento de series de tiempo se basan en los modelos ocultos de Markov (HMM). Estos modelos suponen que las observaciones se generan a partir de un finito mezcla de distribuciones regidas por el principio de la cadena de Markov (MC). El modelo de Markov oculto de Poisson (P-HMM) puede ser el metodo mas utilizado para modelar las situaciones mencionadas anteriormente. Sin embargo, en el escenario de la vida real, este modelo no puede considerarse como la mejor opcion. Teniendo en cuenta este hecho, nosotros, en este articulo, apostamos por la distribucion generalizada de Poisson (GPD) para modelar datos de conteo. Este metodo puede rectificar la sobredispersion y subdispersion en el modelo de Poisson. Aqui desarrollamos Poisson generalizado Modelo de Markov oculto (GP-HMM) combinando GPD con HMM para modelando tales datos. Los resultados del estudio sobre datos simulados y una aplicacion de datos reales, casos mensuales de leptospirosis en el estado de Kerala en South India, muestra buenas propiedades de convergencia, lo que demuestra que el GP-HMM Es un metodo mejor en comparacion con P-HMM.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"14 1","pages":"71-82"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77717997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Editorial","authors":"Ramón Giraldo","doi":"10.15446/rce.v43n1.84593","DOIUrl":"https://doi.org/10.15446/rce.v43n1.84593","url":null,"abstract":"Dear readers, Welcome to the first issue of the 43th volume of Revista Colombiana de Es-tadística (Colombian Journal of Statistics).Professor Liliana López has stepped down as co-editor after four years of ser-vice. On behalf of the Statistics Department of the Universidad Nacional deColombia, I want to thank Liliana for her outstanding work during these years.We worked together since 2016 with the aim of improving the standard and visi-bility of the Journal. In addition to her teaching and scientific responsibilities, shewill continue to serve the Statistics Department in other academic tasks.I would like to thank all authors for submitting their articles and all refereeswho did a very great job of reviewing a great number of manuscripts. I also wantto thank all members of the scientific and editorial committee for their valuablehelp in coediting some works. I offer a special recognition to our assistant PatriciaChávez for her dedicated daily work. The response time on a submitted manuscriptis approximately two months. We have tried to make the review processes as fastas possible maintaining scientific standards.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42700433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Diana Carolina Franco Soto, Antonio Carlos Pedroso de Lima, J. Singer
{"title":"A Birnbaum-Saunders Model for Joint Survival and Longitudinal Analysis of Congestive Heart Failure Data","authors":"Diana Carolina Franco Soto, Antonio Carlos Pedroso de Lima, J. Singer","doi":"10.15446/RCE.V43N1.77851","DOIUrl":"https://doi.org/10.15446/RCE.V43N1.77851","url":null,"abstract":"Consideramos una modelacion conjunta parametrica de mediciones longitudinales y tiempos de supervivencia, motivados por un estudio realizado en el Instituto do Coracao (Incor), Sao Paulo, Brasil, con el objetivo de evaluar el impacto del Peptido Natriuretico tipo B (BNP) recolectado en diferentes instantes, sobre la supervivencia de pacientes con Insuficiencia Cardiaca Congestiva (ICC). Empleamos un modelo lineal de efectos mixtos para la respuesta longitudinal y un modelo Birnbaum-Saunders para los tiempos de supervivencia, permitiendo la inclusion de sujetos sin observaciones longitudinales. Obtenemos los estimadores de maxima verosimilitud de los parametros del modelo conjunto y realizamos un estudio de simulacion para comparar las probabilidades de supervivencia verdaderas con las predicciones dinamicas obtenidas al ajustar el modelo conjunto propuesto y para evaluar el desempeno del metodo para estimar los parametros del modelo. El modelo conjunto propuesto se aplica a la cohorte de 1609 pacientes con ICC, de los cuales 1080 no tienen mediciones de BNP. Las estimaciones de los parametros y sus errores estandar obtenidos por medio de: i) el enfoque tradicional, donde unicamente se incluyen individuos con al menos una medicion de la respuesta longitudinal y ii) el enfoque propuesto, que incluye la informacion de supervivencia de todos los individuos; se comparan con los obtenidos por medio de los modelos marginales (longitudinal y de supervivencia).","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"108 1","pages":"83-101"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74652141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Nested and Repeated Cross Validation for Classification Model With High-Dimensional Data","authors":"Yi Zhong, Jianghua He, P. Chalise","doi":"10.15446/RCE.V43N1.80000","DOIUrl":"https://doi.org/10.15446/RCE.V43N1.80000","url":null,"abstract":"Con la llegada de las tecnologias de alto rendimiento, los conjuntos de datos de alta dimension estan cada vez mas disponibles. Esto no solo ha abierto una nueva vision acerca de los sistemas biologicos, sino que tambien plantea desafios analiticos. Un problema importante es la seleccion de subconjuntos de variables y la prediccion de resultados futuros. Es crucial que los modelos no sean sobreajustados y que den resultados precisos con nuevos datos. Ademas, la identificaci on confiable de variables informativas con alto poder predictivo (seleccion de caracteristicas) es de interes en entornos clinicos. Proponemos un procedimiento de dos etapas para la seleccion de variables y la construccion de modelos de clasificacion, el cual utiliza un metodo de validacion cruzada anidada y repetida. Evaluamos nu-estro enfoque utilizando tanto datos simulados como dos conjuntos de datos de expresion genica disponibles publicamente. El metodo propuesto mostro una precision predictiva comparativamente mejor para casos nuevos en comparacion con el metodo estandar de validacion cruzada.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"605 1","pages":"103-125"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88588544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Randomized Two Stage Adaptively Censored Design With Application to Testing","authors":"U. Bandyopadhyay, Rahul Bhattacharya","doi":"10.15446/RCE.V42N2.65344","DOIUrl":"https://doi.org/10.15446/RCE.V42N2.65344","url":null,"abstract":"A two stage randomized design is developed for two treatment clinical trials in which response variables are exponential and the observations are censored by using failure censoring and time censoring in the first and second stages respectively. The censoring time for the second stage is determined from the outcomes of the first stage. An application to testing for the equality of treatment effects is given along with a comparative study with relevant properties.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.15446/RCE.V42N2.65344","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43547958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simultaneously Testing for Location and Scale Parameters of Two Multivariate Distributions","authors":"A. Chavan, D. T. Shirke","doi":"10.15446/RCE.V42N2.70815","DOIUrl":"https://doi.org/10.15446/RCE.V42N2.70815","url":null,"abstract":"In this article, we propose nonparametric tests for simultaneously testing equality of location and scale parameters of two multivariate distributions by using nonparametric combination theory. Our approach is to combine the data depth based location and scale tests using combining function to construct a new data depth based test for testing both location and scale parameters. Based on this approach, we have proposed several tests. Fisher's permutation principle is used to obtain p-values of the proposed tests. Performance of proposed tests has been evaluated in terms of empirical power for symmetric and skewed multivariate distributions and compared to the existing test based on data depth. The proposed tests are also applied to a real-life data set for illustrative purpose.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.15446/RCE.V42N2.70815","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43538491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extreme Value Theory Applied to r Largest Order Statistics Under the Bayesian Approach","authors":"Renato S Silva, F. Nascimento","doi":"10.15446/RCE.V42N2.70271","DOIUrl":"https://doi.org/10.15446/RCE.V42N2.70271","url":null,"abstract":"Extreme Value Theory (EVT) is an important tool to predict efficient gains and losses. Its main areas of analyses are economic and environmental. Initially, for that form of event, it was developed the use of patterns of parametric distribution such as Normal and Gamma. However, economic and environmental data presents, in most cases, a heavy-tailed distribution, in contrast to those distributions. Thus, it was faced a great difficult to frame extreme events. Furthermore, it was almost impossible to use conventional models, making predictions about non-observed events, which exceed the maximum of observations. In some situations EVT is used to analyse only the maximum of some dataset, which provide few observations, and in those cases it is more effective to use the r largest-order statistics. This paper aims to propose Bayesian estimators' for parameters of the r largest-order statistics. During the research, it was used Monte Carlo simulation to analyze the data, and it was observed some properties of those estimators, such as mean, variance, bias and Root Mean Square Error (RMSE). The estimation of the parameters provided inference for its parameters and return levels. This paper also shows a procedure to the choice of the r-optimal to the r largest-order statistics, based on the Bayesian approach applying Markov chains Monte Carlo (MCMC). Simulation results reveal that the Bayesian approach has a similar performance to the Maximum Likelihood Estimation, and the applications were developed using the Bayesian approach and showed a gain in accurary compared with otherestimators.","PeriodicalId":54477,"journal":{"name":"Revista Colombiana De Estadistica","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.15446/RCE.V42N2.70271","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43370312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}