{"title":"empirical likelihood ratio based comparative study on tests for normality of residuals in linear models","authors":"C. Marange, Yongsong Qin","doi":"10.51936/ramh7128","DOIUrl":"https://doi.org/10.51936/ramh7128","url":null,"abstract":"The application of goodness-of-fit (GoF) tests in linear regression modeling is a common practice in applied statistical sciences. For instance, in simple linear regression the assumption of normality of residuals is always necessary to test before making any further inferences. The growing popularity of the use of powerful and efficient empirical likelihood ratio (ELR) based GoF tests in checking for departures from normality in various continuous distributions can be of great use in checking for distributional assumptions of residuals in linear models. Motivated by the attractive properties of the ELR based GoF tests the researchers conducted an extensive Type I error rate assessment as well as a Monte Carlo power comparison of selected ELR GoF tests with well-known existing tests against symmetric and asymmetric alternative OLS and BLUS residuals. Under the simulated scenarios, all the studied tests have good control of Type I error rates. The Monte Carlo experiments revealed the superiority of the ELR GoF tests under certain alternatives of both the OLS and BLUS residuals. Our findings also demonstrated the superiority of OLS over BLUS residuals when one is testing for normality in simple linear regression models. A real data study further revealed the applicability of the ELR based GoF tests in testing normality of residuals in linear regression models.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126493422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two stage adaptive cluster sampling based on ordered statistics","authors":"Girish Chandra, Neeraj Tiwari, R. Nautiyal","doi":"10.51936/wuqh7533","DOIUrl":"https://doi.org/10.51936/wuqh7533","url":null,"abstract":"The estimation problem on sparsely distributed populations using adaptive cluster sampling (ACS) is discussed. In the first phase of ACS, two stage sampling is used in which primary and secondary sampling units are selected using simple random sampling without replacement. The idea of Thompson (1996) is introduced in order to choose an appropriate fixed value of pre-specified condition, which might represent the number of rare species, before conducting the survey by the use of order statistics. Different estimators of the population mean under the two possible schemes (open and closed boundaries of primary sampling units) are studied and the Rao-Blackwell theorem for improving these estimators is also used. Numerical illustrations, one on real life data and the other based on simulation study, are discussed for these two schemes. This design may be quite useful in environmental, forestry and other areas of research dealing with rare, endangered or threatened species.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120956464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Effects of the same-gender vs. cross-gender mentoring on a protégé outcome in academia","authors":"Metka Kogovšek, I. Ograjensek","doi":"10.51936/qsdi6398","DOIUrl":"https://doi.org/10.51936/qsdi6398","url":null,"abstract":"Mentoring seems to be an important way to start and advance individual researcher's career in science. Therefore, it is essential to examine the factors related to successful mentoring in order to find ways of efficiently supporting young academics on their career development path. Building on the similarity-attraction and social identity theories, our research indicates that gender similarity in academic mentoring might be related to the protégés' postdoctoral publication scores that lead to career advancement. The scores in a typical five-year publication cycle are higher for the protégés situated within same-gender mentoring dyads. Furthermore, the mentors' research performance importantly adds to the protégés' postdoctoral research performance.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133325708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mechanisms generating asymmetric core-cohesive blockmodels","authors":"M. Cugmas, A. Žiberna, A. Ferligoj","doi":"10.51936/eupq5786","DOIUrl":"https://doi.org/10.51936/eupq5786","url":null,"abstract":"The paper addresses the relationship between different local network mechanisms and different global network structures, described by blockmodels. The research question is narrowed to the context of preschool children networks. Based on the studies regarding friendship, liking and interactional networks among preschool children, the popularity, transitivity, mutuality and assortativity mechanisms are assumed to be important for the evolution of such networks. It is assumed that the global network structure is defined by an asymmetric core-cohesive blockmodel consisting of one core group of units and two or more cohesive groups of units. Therefore, the main research question is whether the emergence of an asymmetric core-cohesive blockmodel can be a result of the influence of the listed mechanisms. Different initial global network structures are considered. Monte Carlo simulations were used. The relative fit measure is proposed and used to compare different blockmodel types on generated networks. The results show that the listed mechanisms indeed lead to the assumed global network structure.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114549574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Internal evaluation criteria for categorical data in hierarchical clustering","authors":"Z. Šulc, J. Cibulková, J. Procházka, H. Řezanková","doi":"10.51936/lxut1974","DOIUrl":"https://doi.org/10.51936/lxut1974","url":null,"abstract":"The paper compares 11 internal evaluation criteria for hierarchical clustering of categorical data regarding a correct number of clusters determination. The criteria are divided into three groups based on a way of treating the cluster quality. The variability-based criteria use the within-cluster variability, the likelihood-based criteria maximize the likelihood function, and the distance-based criteria use distances within and between clusters. The aim is to determine which evaluation criteria perform well and under what conditions. Different analysis settings, such as the used method of hierarchical clustering, and various dataset properties, such as the number of variables or the minimal between-cluster distances, are examined. The experiment is conducted on 810 generated datasets, where the evaluation criteria are assessed regarding the optimal number of clusters determination and mean absolute errors. The results indicate that the likelihood-based BIC1 and variability-based BK criteria perform relatively well in determining the optimal number of clusters and that some criteria, usually the distance-based ones, should be avoided.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"164 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114181518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Estimation of power function distribution based on selective order statistic","authors":"M. Alodat, M. Al-Rawwash, S. Al-Subh","doi":"10.51936/albt8757","DOIUrl":"https://doi.org/10.51936/albt8757","url":null,"abstract":"In this article, we present the selective order statistic sampling scheme as a promising approach to estimate the parameter of the univariate power function distribution. We derive the maximum likelihood estimator and the method of moments estimator of the power function distribution parameter as well as the reliability parameter and the ratio of two means. Moreover, we derive the asymptotic properties of the proposed estimators. Finally, we conduct simulation studies to investigate the performance of the selective order statistic scheme and concluded that it suits the power function distribution and we found that the maximum likelihood estimator is better than the method of moments estimator under the selective order statistic sampling scheme.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121436686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mode effects on socially desirable responding in web surveys compared to face-to-face and telephone surveys","authors":"N. Berzelak, Vasja Vehovar","doi":"10.51936/lrkv4884","DOIUrl":"https://doi.org/10.51936/lrkv4884","url":null,"abstract":"This paper elaborates upon differences in socially desirable responding as being the result of mode effects between web, telephone, and face-to-face survey modes. Social desirability is one of the main threats to comparability of data between different modes. The paper conceptualises socially desirable responding as a specific type of mode effect, which is not only a result of inherent characteristics of a survey mode, but is also mediated and moderated by complex interdependencies of specific survey implementations, contextual factors, and characteristics and behaviours of respondents. While web surveys are generally less prone to socially desirable responding, it is essential to be wary of circumstances that may reduce the perceived privacy of the survey situation and lead to biased reporting. The presented empirical study analyses the answers to a large number of items used in a pilot implementation of the Generations and Gender Survey across the three modes to gain insights into the incidence of socially desirable responding and its role in the observed differences in estimates. The comparison of means, distributions, and proportions of extreme responses to scale questions is performed across 89 survey items. The results are in line with the previous findings on lower susceptibility of web surveys to social desirability bias. More importantly, the findings suggest that the problem of socially desirable responding is likely to be a major contributor to the differences in mean estimates, response distributions, and the level of extreme responding between the studied modes.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133917922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Behind the curve and beyond","authors":"Bastian Becker","doi":"10.51936/lpfv6993","DOIUrl":"https://doi.org/10.51936/lpfv6993","url":null,"abstract":"Parameter coefficients from non-linear models are inherently difficult to interpret, and scholars frequently opt for computing and comparing predicted probabilities for variables of interest. In an influential article, Hanmer and Ozan Kalkan (2013) discuss the two most common approaches, the average case respectively observed values approach, and make a strong case for the latter. In this paper, I propose a further refinement of the observed values approach for the purpose of computing predicted probability changes. This refinement concerns the use of counterfactual values for the independent variable of interest. I demonstrate that accounting for non-linearities with regards to the variable of interest is important to avoid estimation biases. I also discuss the implications of this insight for estimating average treatment effects from observational data.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129172886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Web survey paradata on response time outliers","authors":"M. Matjašič, Vasja Vehovar, Katja Lozar Manfreda","doi":"10.51936/yoqn3590","DOIUrl":"https://doi.org/10.51936/yoqn3590","url":null,"abstract":"In the last two decades, survey researchers have intensively used computerised methods for the collection of different types of paradata, such as keystrokes, mouse clicks and response times, to evaluate and improve survey instruments as well as to understand the survey response process. With the growing popularity of web surveys, the importance of paradata has further increased. Within this context, response time measurement is the prevailing paradata approach. Papers typically analyse the time (measured in milliseconds or seconds) a respondent needs to answer a certain item, question, page or questionnaire. One of the key challenges when analysing the response time is to identify and separate units that are answering too quickly or too slowly. These units can have a poor response quality and are typically labelled as response time outliers. This paper focuses on approaches for identifying and processing response time outliers. It presents a systematic overview of scientific papers on response time outliers in web surveys. The key observed characteristics of the papers are the approaches used, the level of time measurement, the processing of response time outliers and the relationship between response time and response quality. The results show that knowledge on response time outliers is scattered, inconsistent and lacking systematic comparisons of approaches. Consequently, there is a need to improve and upgrade the knowledge on this issue and to develop new approaches that will overcome existing deficiencies and inconsistencies in identifying and dealing with response time outliers.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128446804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparing two partitions of non-equal sets of units","authors":"M. Cugmas, Anuvska Ferligoj","doi":"10.51936/aidm7546","DOIUrl":"https://doi.org/10.51936/aidm7546","url":null,"abstract":"Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.","PeriodicalId":242585,"journal":{"name":"Advances in Methodology and Statistics","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121091325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}