{"title":"Fitting an Equation to Data Impartially","authors":"Chris Tofallis","doi":"arxiv-2409.02573","DOIUrl":null,"url":null,"abstract":"We consider the problem of fitting a relationship (e.g. a potential\nscientific law) to data involving multiple variables. Ordinary (least squares)\nregression is not suitable for this because the estimated relationship will\ndiffer according to which variable is chosen as being dependent, and the\ndependent variable is unrealistically assumed to be the only variable which has\nany measurement error (noise). We present a very general method for estimating\na linear functional relationship between multiple noisy variables, which are\ntreated impartially, i.e. no distinction between dependent and independent\nvariables. The data are not assumed to follow any distribution, but all\nvariables are treated as being equally reliable. Our approach extends the\ngeometric mean functional relationship to multiple dimensions. This is\nespecially useful with variables measured in different units, as it is\nnaturally scale-invariant, whereas orthogonal regression is not. This is\nbecause our approach is not based on minimizing distances, but on the symmetric\nconcept of correlation. The estimated coefficients are easily obtained from the\ncovariances or correlations, and correspond to geometric means of associated\nleast squares coefficients. The ease of calculation will hopefully allow\nwidespread application of impartial fitting to estimate relationships in a\nneutral way.","PeriodicalId":501293,"journal":{"name":"arXiv - ECON - Econometrics","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We consider the problem of fitting a relationship (e.g. a potential
scientific law) to data involving multiple variables. Ordinary (least squares)
regression is not suitable for this because the estimated relationship will
differ according to which variable is chosen as being dependent, and the
dependent variable is unrealistically assumed to be the only variable which has
any measurement error (noise). We present a very general method for estimating
a linear functional relationship between multiple noisy variables, which are
treated impartially, i.e. no distinction between dependent and independent
variables. The data are not assumed to follow any distribution, but all
variables are treated as being equally reliable. Our approach extends the
geometric mean functional relationship to multiple dimensions. This is
especially useful with variables measured in different units, as it is
naturally scale-invariant, whereas orthogonal regression is not. This is
because our approach is not based on minimizing distances, but on the symmetric
concept of correlation. The estimated coefficients are easily obtained from the
covariances or correlations, and correspond to geometric means of associated
least squares coefficients. The ease of calculation will hopefully allow
widespread application of impartial fitting to estimate relationships in a
neutral way.