{"title":"Multinomial Naïve Bayes Classifier: Bayesian versus Nonparametric Classifier Approach","authors":"R. O. Olanrewaju, S. A. Olanrewaju, L. Nafiu","doi":"10.28924/ada/stat.2.8","DOIUrl":"https://doi.org/10.28924/ada/stat.2.8","url":null,"abstract":"This paper proposes a Naïve Bayes Classifier for Bayesian and nonparametric methods of analyzing multinomial regression. The Naïve Bayes classifier adopted Bayes’ rule for solving the posterior of the multinomial regression via its link function known as Logit link. The nonparametric adopted Gaussian, bi-weight kernels, Silverman’s rule of thumb bandwidth selector, and adjusted bandwidth as kernel density estimation. Three categorical responses of information on 78 people using one of three diets (Diet A, B, and C) that consist of scaled variables: age (in years), height (in cm), weight (in kg) before the diet (that is, pre-weight), weight (in kg) gained after 6 weeks of diet were subjected to the classifier multinomial regression of Naïve Bayes and nonparametric. The Gaussian and bi-weight kernel density estimation produced the minimum bandwidths across the three categorical responses for the four influencers. The Naïve Bayes classifier and nonparametric kernel density estimation for the multinomial regression produced the same prior probabilities of 0.3077, 0.3462, and 0.3462; and A prior probabilities of 0.3077, 0.3462, and 0.3462 for Diet A, Diet B, and Diet C at different smoothing bandwidths.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126976485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Effect of Sample Size on Random Component in Multilevel Modeling","authors":"Asadullah, M. Husain","doi":"10.28924/ada/stat.2.7","DOIUrl":"https://doi.org/10.28924/ada/stat.2.7","url":null,"abstract":"In cluster-correlated data arise when there exists any condition for that individual are grouped among themselves. Data of this kind arise frequently in social science, behavioral, and medical sciences since individuals can be grouped in so many different ways. Multilevel modeling (MLM) is an approach that can be used to handle cluster or grouped data. Analyzing of correlated data is different from the usual way for independent data since we have to consider the correlation structure among individuals within cluster. In random effects models’ correlation structure can be estimated by considering the models parameters are allowed to vary across the cluster. Random effect models have two components, within cluster components, cluster-specific response is described by a regression model with a population-level intercept and slope, other is between-cluster component: variation in cluster-intercepts and slopes is captured. In a multilevel model, cluster level variance component is more affected by no. of cluster as well as cluster size. So, this is important to aware the researcher about no. of cluster and cluster size in estimating the random components of random effect models for correlated continuous and discrete outcome respectively in MLM since it produces bias estimate for few no. of cluster and cluster size. The parameters of random effects models can be estimated by Maximum Likelihood and Restricted Maximum Likelihood (REML) estimation for correlated continuous outcome, on the contrast besides REML, Penalized Quassi Likelihood (PQL) and Adaptive Gaussian Quadrature (AGQ) estimation techniques are applied for correlated discrete outcome. In this thesis, using the simulation procedure we would be compared among these estimation techniques by exploring the influence of no. of cluster and cluster size on estimated random parameters from random effect models of two-level and three levels for continuous and discrete outcome respectively. Relative bias, mean square error and coverage probability would be used for comparison purpose among the estimation techniques.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130412954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Discrete Distribution: Properties and Application Using Nipah Virus Infection Data Set","authors":"Fatma Zohra Seghier, Halim zeghdoudi, Vinoth Raman","doi":"10.28924/ada/stat.3.3","DOIUrl":"https://doi.org/10.28924/ada/stat.3.3","url":null,"abstract":"In this study, the Poisson (PD) distribution was compounded with a continuous distribution to produce the Poisson XLindley distribution (PXLD). Its raw moments and central moments are acquired as a result of a general expression for its rth factorial moment concerning origin being derived. Additionally, the expressions for its coefficient of variation, skewness, kurtosis and index of dispersion have been provided. For the estimate of its parameters, in particular, the methods of maximum likelihood and moments have been addressed. The applicability of the proposed distribution in modeling real data sets on Nipah virus infection, number of Hemocytometer yeast cell count data, and epileptic seizure counts data is examined by analyzing two real-world data sets.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120994374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Fatema, Muhammad Habibulla Alamin, M. Z. Hasan, M. Hossain
{"title":"Analyzing the Best Fitted Probabilistic Model for the Seasonal Rainfall Data in Khulna Region of Bangladesh","authors":"K. Fatema, Muhammad Habibulla Alamin, M. Z. Hasan, M. Hossain","doi":"10.28924/ada/stat.2.5","DOIUrl":"https://doi.org/10.28924/ada/stat.2.5","url":null,"abstract":"There are several pieces of research on the statistical modeling of rainfall data in Bangladesh. Since all the seasons of a year do not receive a similar amount of rainfall, hence one single statistical model might not be able to explain the pattern of rainfall at any season of a year. According to the climatologists, Bangladesh has four seasons which are Monsoon, Post-monsoon, Summer, and Winter based on the geographical characteristics of this country. This paper aims to determine the best-fitted probability distribution model for the monthly rainfall data of each particular season in the Khulna district of Bangladesh using the rainfall data of the Khulna region from 1951 to 2018. Very commonly used seven continuous distributions- Normal, Weibull, Gamma, Log-normal, Exponential, Cauchy, and Logistic distributions were used to model the data and to evaluate the performances of the distributions, three non-parametric goodness-of-fit tests were conducted, and AIC, BIC values were calculated. Parameters of the distributions were estimated by the maximum likelihood method. The best-fit result of each season was taken as the distribution with the lowest AIC and BIC values. Among the seven distributions, the Gamma distribution showed the best-fit results of the monthly rainfall data for the Monsoon, Post-Monsoon, and Winter Season, and the Weibull distribution showed the best-fit result for Summer Season.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133036206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Some Resultats on Optimal Allocation of Policy Limits and Deductibles: Mixture Model","authors":"Bouhadjar Meriem, Halim zeghdoudi, Abdelali Ezzebsa","doi":"10.28924/ada/stat.2.4","DOIUrl":"https://doi.org/10.28924/ada/stat.2.4","url":null,"abstract":"The main purpose of this paper is to introduce and investigate stochastic orders of scalar products of random vectors. We study the problem of finding maximal expected utility for some functional on insurance portfolios involving some additional (independent) randomization. Furthermore, applications in policy limits and deductible are obtained, we consider the scalar product of two random vectors which separates the severity effect and the frequency effect in the study of the optimal allocation of policy limits and deductibles. In that respect, we obtain the ordering of the optimal allocation of policy limits and deductibles when the dependence structure of the losses is unknown. Our application is a further study of [1 − 6].","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124127326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the T-X Class of Topp Leone-G Family of Distributions: Statistical Properties and Applications","authors":"Femi Samuel Adeyinka","doi":"10.28924/ada/stat.2.2","DOIUrl":"https://doi.org/10.28924/ada/stat.2.2","url":null,"abstract":"This article investigates the T-X class of Topp Leone- G family of distributions. Some members of the new family are discussed. The exponential-Topp Leone-exponential distribution (ETLED) which is one of the members of the family is derived and some of its properties which include central and non-central moments, quantiles, incomplete moments, conditional moments, mean deviation, Bonferroni and Lorenz curves, survival and hazard functions, moment generating function, characteristic function and R`enyi entropy are established. The probability density function (pdf) of order statistics of the model is obtained and the parameter estimation is addressed with the maximum likelihood method (MLE). Three real data sets are used to demonstrate its application and the results are compared with two other models in the literature.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124404125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mixingale Estimation Function for SPDEs with Random Sampling","authors":"J. Bishwal","doi":"10.28924/ada/stat.2.3","DOIUrl":"https://doi.org/10.28924/ada/stat.2.3","url":null,"abstract":"We study the mixingale estimation function estimator of the drift parameter in the stochastic partial differential equation when the process is observed at the arrival times of a Poisson process. We use a two stage estimation procedure. We first estimate the intensity of the Poisson process. Then we substitute this estimate in the estimation function to estimate the drift parameter. We obtain the strong consistency and the asymptotic normality of the mixingale estimation function estimator.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117314889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel Chen Extension: Theory, Characterizations and Different Estimation Methods","authors":"H. Yousof, M. C. Korkmaz, G. Hamedani, M. Ibrahim","doi":"10.28924/ada/stat.2.1","DOIUrl":"https://doi.org/10.28924/ada/stat.2.1","url":null,"abstract":"In this work, we derive a novel extension of Chen distribution. Some statistical properties of the new model are derived. Numerical analysis for mean, variance, skewness and kurtosis is presented. Some characterizations of the proposed distribution are presented. Different classical estimation methods under uncensored schemes such as the maximum likelihood, Anderson-Darling, weighted least squares and right-tail Anderson–Darling methods are considered. Simulation studies are performed in order to compare and assess the above-mentioned estimation methods. For comparing the applicability of the four classical methods, two application to real data set are analyzed.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128738777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fatma Zohra Seghier, Halim zeghdoudi, A. Benchaabane
{"title":"A Size-Biased Poisson-Gamma Lindley Distribution with Application","authors":"Fatma Zohra Seghier, Halim zeghdoudi, A. Benchaabane","doi":"10.28924/ada/stat.1.132","DOIUrl":"https://doi.org/10.28924/ada/stat.1.132","url":null,"abstract":"In this paper, a size-biased Poisson-Gamma Lindley distribution (SBPGLD) has been obtained by size-biasing the Poisson-Gamma Lindley distribution (PGLD) introduced recently by Nedjar and Zeghdoudi (2020). Some of its statistical properties have been discussed. The method of maximum likelihood and the method of moments for the estimation of its parameters have been discussed. Also, an application on the real data of survival times of (56) Indian state of Kerala individus infected with Nipah virus is given.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131162372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hesham Reyad, Farrukh Jamal, G. Hamedani, Soha Othman
{"title":"The Alpha Power Transformed Dagum Distribution: Properties and Applications","authors":"Hesham Reyad, Farrukh Jamal, G. Hamedani, Soha Othman","doi":"10.28924/ada/stat.1.108","DOIUrl":"https://doi.org/10.28924/ada/stat.1.108","url":null,"abstract":"In this study, we propose a new extension of the Dagum distribution called the alpha power transformed Dagum distribution. Basic statistical properties of the new distribution such as; quantile function, raw and incomplete moments, moment generating function, order statistics, Rényi entropy, stochastic ordering and stress strength model are investigated. The characterizations of the new model is investigated. The method of maximum likelihood is used to estimate the model parameters of the new distribution and the observed information matrix is also obtained. A Monte Carlo simulation is presented to examine the behavior of the parameter estimates. The applicability of the new model is demonstrated by means of three applications.","PeriodicalId":153849,"journal":{"name":"European Journal of Statistics","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123569815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}