统计、数学和独角兽

Credit Intelligence & Modelling Pub Date : 2021-11-29 DOI:10.1093/oso/9780192844194.003.0011

Raymond A. Anderson

{"title":"统计、数学和独角兽","authors":"Raymond A. Anderson","doi":"10.1093/oso/9780192844194.003.0011","DOIUrl":null,"url":null,"abstract":"This chapter covers basic statistical concepts. Most statistics relate to hypothesis testing, and others to variable selection and model fitting. The name is because an exact match between a theoretical and empirical distribution is as rare as a unicorn. (1) Dispersion—measures of random variations—variance and its inflation factor, covariance and correlations {Pearson’s product-moment, Spearman’s rank order}, and the Mahalanobis distance. (2) Goodness-of-fit—do observations match expectations? This applies to both continuous dependent variables {R-squared and adjusted R2} and categorical {Pearson’s chi-square, Hosmer–Lemeshow statistic}. (3) Likelihood—assesses estimates’ goodness-of-fit to binary dependent variables {log-likelihood, deviance}, plus the Akaike and Bayesian information criteria used to penalize complexity. (4) The Holy Trinity of Statistics—i) Neyman–Pearson’s ‘likelihood ratio’—the basis for model comparisons; ii) Wald’s chi-square—for potential variable removal; iii) Rao’s score chi-square—for potential variable inclusion. These are all used in Logistic Regression.","PeriodicalId":286194,"journal":{"name":"Credit Intelligence & Modelling","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stats & Maths & Unicorns\",\"authors\":\"Raymond A. Anderson\",\"doi\":\"10.1093/oso/9780192844194.003.0011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This chapter covers basic statistical concepts. Most statistics relate to hypothesis testing, and others to variable selection and model fitting. The name is because an exact match between a theoretical and empirical distribution is as rare as a unicorn. (1) Dispersion—measures of random variations—variance and its inflation factor, covariance and correlations {Pearson’s product-moment, Spearman’s rank order}, and the Mahalanobis distance. (2) Goodness-of-fit—do observations match expectations? This applies to both continuous dependent variables {R-squared and adjusted R2} and categorical {Pearson’s chi-square, Hosmer–Lemeshow statistic}. (3) Likelihood—assesses estimates’ goodness-of-fit to binary dependent variables {log-likelihood, deviance}, plus the Akaike and Bayesian information criteria used to penalize complexity. (4) The Holy Trinity of Statistics—i) Neyman–Pearson’s ‘likelihood ratio’—the basis for model comparisons; ii) Wald’s chi-square—for potential variable removal; iii) Rao’s score chi-square—for potential variable inclusion. These are all used in Logistic Regression.\",\"PeriodicalId\":286194,\"journal\":{\"name\":\"Credit Intelligence & Modelling\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Credit Intelligence & Modelling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/oso/9780192844194.003.0011\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Credit Intelligence & Modelling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/oso/9780192844194.003.0011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本章涵盖基本的统计概念。大多数统计与假设检验有关，而其他统计与变量选择和模型拟合有关。之所以取这个名字，是因为理论分布和经验分布之间的精确匹配就像独角兽一样罕见。(1)离散度——随机方差的度量——方差及其膨胀因子、协方差和相关性{Pearson积矩、Spearman秩阶}和马氏距离。(2)拟合优度观察结果与预期相符?这适用于连续因变量{r平方和调整R2}和分类{皮尔逊卡方，Hosmer-Lemeshow统计}。(3)似然评估对二元因变量{对数似然，偏差}的拟合优度，加上用于惩罚复杂性的赤池和贝叶斯信息标准。(4)统计学的神圣三位一体——i)内曼-皮尔逊的“似然比”——模型比较的基础;ii)潜在变量去除的Wald卡方;iii)潜在变量包含的Rao分数卡方。这些都是在逻辑回归中用到的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Stats & Maths & Unicorns

This chapter covers basic statistical concepts. Most statistics relate to hypothesis testing, and others to variable selection and model fitting. The name is because an exact match between a theoretical and empirical distribution is as rare as a unicorn. (1) Dispersion—measures of random variations—variance and its inflation factor, covariance and correlations {Pearson’s product-moment, Spearman’s rank order}, and the Mahalanobis distance. (2) Goodness-of-fit—do observations match expectations? This applies to both continuous dependent variables {R-squared and adjusted R2} and categorical {Pearson’s chi-square, Hosmer–Lemeshow statistic}. (3) Likelihood—assesses estimates’ goodness-of-fit to binary dependent variables {log-likelihood, deviance}, plus the Akaike and Bayesian information criteria used to penalize complexity. (4) The Holy Trinity of Statistics—i) Neyman–Pearson’s ‘likelihood ratio’—the basis for model comparisons; ii) Wald’s chi-square—for potential variable removal; iii) Rao’s score chi-square—for potential variable inclusion. These are all used in Logistic Regression.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Credit Intelligence & Modelling

自引率

0.00%

发文量