Stats & Maths & Unicorns

Credit Intelligence & Modelling Pub Date : 2021-11-29 DOI:10.1093/oso/9780192844194.003.0011

Raymond A. Anderson

引用次数: 0

Abstract

This chapter covers basic statistical concepts. Most statistics relate to hypothesis testing, and others to variable selection and model fitting. The name is because an exact match between a theoretical and empirical distribution is as rare as a unicorn. (1) Dispersion—measures of random variations—variance and its inflation factor, covariance and correlations {Pearson’s product-moment, Spearman’s rank order}, and the Mahalanobis distance. (2) Goodness-of-fit—do observations match expectations? This applies to both continuous dependent variables {R-squared and adjusted R2} and categorical {Pearson’s chi-square, Hosmer–Lemeshow statistic}. (3) Likelihood—assesses estimates’ goodness-of-fit to binary dependent variables {log-likelihood, deviance}, plus the Akaike and Bayesian information criteria used to penalize complexity. (4) The Holy Trinity of Statistics—i) Neyman–Pearson’s ‘likelihood ratio’—the basis for model comparisons; ii) Wald’s chi-square—for potential variable removal; iii) Rao’s score chi-square—for potential variable inclusion. These are all used in Logistic Regression.

查看原文本刊更多论文

统计、数学和独角兽

本章涵盖基本的统计概念。大多数统计与假设检验有关，而其他统计与变量选择和模型拟合有关。之所以取这个名字，是因为理论分布和经验分布之间的精确匹配就像独角兽一样罕见。(1)离散度——随机方差的度量——方差及其膨胀因子、协方差和相关性{Pearson积矩、Spearman秩阶}和马氏距离。(2)拟合优度观察结果与预期相符?这适用于连续因变量{r平方和调整R2}和分类{皮尔逊卡方，Hosmer-Lemeshow统计}。(3)似然评估对二元因变量{对数似然，偏差}的拟合优度，加上用于惩罚复杂性的赤池和贝叶斯信息标准。(4)统计学的神圣三位一体——i)内曼-皮尔逊的“似然比”——模型比较的基础;ii)潜在变量去除的Wald卡方;iii)潜在变量包含的Rao分数卡方。这些都是在逻辑回归中用到的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Credit Intelligence & Modelling

自引率

0.00%

发文量