线性代数中Pearson残差的意义

2007 IEEE International Conference on Granular Computing (GRC 2007) Pub Date : 2007-11-02 DOI:10.1109/GrC.2007.126

S. Tsumoto, S. Hirano

{"title":"线性代数中Pearson残差的意义","authors":"S. Tsumoto, S. Hirano","doi":"10.1109/GrC.2007.126","DOIUrl":null,"url":null,"abstract":"Marginal distributions play an central role in statistical analysis of a contingency table. However, when the number of partition becomes large, the contribution from marginal distributions decreases. This paper focuses on a formal analysis of marginal distributions in a contingency table. The main approach is to take the difference between two matrices with the same sample size and the same marginal distributions, which we call difference matrix. The important nature of the difference matrix is that the determinant is equal to 0: when the rank of a matrix is r, the difference between a original matrix and the expected matrix will become r - 1 at most. Since the sum of rows or columns of the will become zero, which means that the information of one rank corresponds to information on the frequency of a contingency matrix. Interestingly, if we take an expected matrix whose elements are the expected values based on marginal distributions, the difference between an original matrix and expected matrix can be represented by linear combination of determinants of 2 times 2 submatrices.","PeriodicalId":259430,"journal":{"name":"2007 IEEE International Conference on Granular Computing (GRC 2007)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Meaning of Pearson Residuals Linear Algebra View\",\"authors\":\"S. Tsumoto, S. Hirano\",\"doi\":\"10.1109/GrC.2007.126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Marginal distributions play an central role in statistical analysis of a contingency table. However, when the number of partition becomes large, the contribution from marginal distributions decreases. This paper focuses on a formal analysis of marginal distributions in a contingency table. The main approach is to take the difference between two matrices with the same sample size and the same marginal distributions, which we call difference matrix. The important nature of the difference matrix is that the determinant is equal to 0: when the rank of a matrix is r, the difference between a original matrix and the expected matrix will become r - 1 at most. Since the sum of rows or columns of the will become zero, which means that the information of one rank corresponds to information on the frequency of a contingency matrix. Interestingly, if we take an expected matrix whose elements are the expected values based on marginal distributions, the difference between an original matrix and expected matrix can be represented by linear combination of determinants of 2 times 2 submatrices.\",\"PeriodicalId\":259430,\"journal\":{\"name\":\"2007 IEEE International Conference on Granular Computing (GRC 2007)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE International Conference on Granular Computing (GRC 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GrC.2007.126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Granular Computing (GRC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GrC.2007.126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

边际分布在列联表的统计分析中起着中心作用。然而，当分区数目增大时，边际分布的贡献减小。本文着重于列联表中边际分布的形式化分析。主要的方法是取具有相同样本量和相同边际分布的两个矩阵之间的差，我们称之为差矩阵。差矩阵的重要性质是行列式等于0:当矩阵的秩为r时，原始矩阵与期望矩阵的差不超过r - 1。的行或列的和将变为零，这意味着一个秩的信息对应于一个权变矩阵频率上的信息。有趣的是，如果我们取一个期望矩阵，它的元素是基于边际分布的期望值，原始矩阵和期望矩阵之间的差可以用2乘以2个子矩阵的行列式的线性组合来表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Meaning of Pearson Residuals Linear Algebra View

Marginal distributions play an central role in statistical analysis of a contingency table. However, when the number of partition becomes large, the contribution from marginal distributions decreases. This paper focuses on a formal analysis of marginal distributions in a contingency table. The main approach is to take the difference between two matrices with the same sample size and the same marginal distributions, which we call difference matrix. The important nature of the difference matrix is that the determinant is equal to 0: when the rank of a matrix is r, the difference between a original matrix and the expected matrix will become r - 1 at most. Since the sum of rows or columns of the will become zero, which means that the information of one rank corresponds to information on the frequency of a contingency matrix. Interestingly, if we take an expected matrix whose elements are the expected values based on marginal distributions, the difference between an original matrix and expected matrix can be represented by linear combination of determinants of 2 times 2 submatrices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 IEEE International Conference on Granular Computing (GRC 2007)

自引率

0.00%

发文量