{"title":"分类变量间关联的假设检验:卡方检验的实证应用","authors":"Basil Msuha, T. Mdendemi","doi":"10.7176/mtm/9-2-02","DOIUrl":null,"url":null,"abstract":"Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test, only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02","PeriodicalId":394772,"journal":{"name":"Mathematical Theory and Modeling","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Hypothesis Testing for the Association Between Categorical Variables: Empirical Application of Chi square Test\",\"authors\":\"Basil Msuha, T. Mdendemi\",\"doi\":\"10.7176/mtm/9-2-02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test, only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02\",\"PeriodicalId\":394772,\"journal\":{\"name\":\"Mathematical Theory and Modeling\",\"volume\":\"134 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Theory and Modeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7176/mtm/9-2-02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Theory and Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7176/mtm/9-2-02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hypothesis Testing for the Association Between Categorical Variables: Empirical Application of Chi square Test
Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test, only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02