分类变量间关联的假设检验:卡方检验的实证应用

Basil Msuha, T. Mdendemi
{"title":"分类变量间关联的假设检验:卡方检验的实证应用","authors":"Basil Msuha, T. Mdendemi","doi":"10.7176/mtm/9-2-02","DOIUrl":null,"url":null,"abstract":"Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test,  only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02","PeriodicalId":394772,"journal":{"name":"Mathematical Theory and Modeling","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Hypothesis Testing for the Association Between Categorical Variables: Empirical Application of Chi square Test\",\"authors\":\"Basil Msuha, T. Mdendemi\",\"doi\":\"10.7176/mtm/9-2-02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test,  only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02\",\"PeriodicalId\":394772,\"journal\":{\"name\":\"Mathematical Theory and Modeling\",\"volume\":\"134 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Theory and Modeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7176/mtm/9-2-02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Theory and Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7176/mtm/9-2-02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

卡方检验和假设检验的逻辑是由卡尔·皮尔逊提出的。在本文中,我们从理论上和经验上证明了使用卡方检验对分类变量之间的关联进行假设检验。在研究中,有一些研究经常收集分类变量的数据,这些数据可以总结为一系列计数。这些计数通常以称为列联表的表格形式排列。在本文中,我们展示了如何使用卡方检验统计量来评估列联表中的行和列之间是否存在关联。我们详细描述了什么是卡方检验,使用哪种类型的数据以及与其应用相关的假设。我们考虑理论和经验案例。在实证案例中,我们使用了2017年9月至2018年3月在坦桑尼亚多多马和莫罗戈罗两个市进行的研究数据。我们在本文中得出结论,卡方检验只告诉我们数据分布独立的概率,或者简单地说,它只测试两个分类变量是否相互关联。它并没有告诉我们它们之间的联系有多紧密。因此,一旦我们知道这两个变量之间存在关系,我们就需要探索其他方法来计算它们之间的关联量。关键词:列联表,分类数据分析,卡方检验,假设检验
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hypothesis Testing for the Association Between Categorical Variables: Empirical Application of Chi square Test
Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. In this article we demonstrate theoretically and empirically the hypothesis testing for the association between categorical variables using Chi‑square Test. In research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. We show in this paper how the chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. We describes in detail what is a chi-square test, on which type of data it is used and the assumptions associated with its application. We consider both theoretical and empirical cases. On empirical case we use the data from the study which was conducted between September 2017 and March, 2018 in two municipalities of Dodoma and Morogoro, Tanzania. We conclude in this article that the Chi-square test,  only tells us the probability of independence of a distribution of data or in simple terms it does only test that whether two categorical variables are associated with each other or not. It does not tell us that how closely they are associated. Therefore, once we got to know that there is a relation between these two variables, we need to explore other methods to calculate the amount of association between them. Key words: Contingency table , categorical data analysis, Chi-square test, hypothesis testing DOI : 10.7176/MTM/9-2-02
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信