{"title":"针对不一致数据的简约通用加权平均法","authors":"Martino Trassinelli, Marleen Maxton","doi":"arxiv-2406.08293","DOIUrl":null,"url":null,"abstract":"The weighted average of inconsistent data is a common and tedious problem\nthat many scientists have encountered. The standard weighted average is not\nrecommended for these cases, and different alternative methods are proposed in\nthe literature. Here, we introduce a new method based on Bayesian statistics\nfor a broad application that keeps the number of assumptions to a minimum. The\nuncertainty associated with each input value is considered just a lower bound\nof the true unknown uncertainty. By assuming a non-informative (Jeffreys')\nprior for true uncertainty and marginalising over its value, a modified\nGaussian distribution is obtained with smoothly decreasing wings, which allows\nfor a better treatment of scattered data and outliers. The proposed method is\ntested on a series of data sets: simulations, CODATA recommended value of the\nNewtonian gravitational constant, and some particle properties from the\nParticle Data Group, including the proton charge radius and the mass of the W\nboson. For the latter in particular, contrary to other works, our prediction\nlies in good agreement with the Standard Model. A freely available Python\nlibrary is also provided for a simple implementation of our averaging method.","PeriodicalId":501065,"journal":{"name":"arXiv - PHYS - Data Analysis, Statistics and Probability","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A minimalistic and general weighted averaging method for inconsistent data\",\"authors\":\"Martino Trassinelli, Marleen Maxton\",\"doi\":\"arxiv-2406.08293\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The weighted average of inconsistent data is a common and tedious problem\\nthat many scientists have encountered. The standard weighted average is not\\nrecommended for these cases, and different alternative methods are proposed in\\nthe literature. Here, we introduce a new method based on Bayesian statistics\\nfor a broad application that keeps the number of assumptions to a minimum. The\\nuncertainty associated with each input value is considered just a lower bound\\nof the true unknown uncertainty. By assuming a non-informative (Jeffreys')\\nprior for true uncertainty and marginalising over its value, a modified\\nGaussian distribution is obtained with smoothly decreasing wings, which allows\\nfor a better treatment of scattered data and outliers. The proposed method is\\ntested on a series of data sets: simulations, CODATA recommended value of the\\nNewtonian gravitational constant, and some particle properties from the\\nParticle Data Group, including the proton charge radius and the mass of the W\\nboson. For the latter in particular, contrary to other works, our prediction\\nlies in good agreement with the Standard Model. A freely available Python\\nlibrary is also provided for a simple implementation of our averaging method.\",\"PeriodicalId\":501065,\"journal\":{\"name\":\"arXiv - PHYS - Data Analysis, Statistics and Probability\",\"volume\":\"41 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Data Analysis, Statistics and Probability\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.08293\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Data Analysis, Statistics and Probability","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.08293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
不一致数据的加权平均是许多科学家都遇到过的一个常见而又繁琐的问题。标准加权平均法不推荐用于这些情况,文献中提出了不同的替代方法。在此,我们介绍一种基于贝叶斯统计的新方法,它应用广泛,能将假设的数量保持在最低水平。与每个输入值相关的不确定性被视为真实未知不确定性的下限。通过假设真实不确定性的非信息(杰弗里斯)先验值并对其值进行边际化处理,可以得到一个具有平滑递减翼的修正高斯分布,从而可以更好地处理分散数据和异常值。所提出的方法在一系列数据集上进行了测试:模拟、牛顿引力常数的 CODATA 推荐值,以及粒子数据组的一些粒子特性,包括质子电荷半径和 W 玻色子质量。特别是后者,与其他工作相反,我们的预测与标准模型非常一致。我们还提供了一个免费的 Python 库,用于简单实现我们的平均方法。
A minimalistic and general weighted averaging method for inconsistent data
The weighted average of inconsistent data is a common and tedious problem
that many scientists have encountered. The standard weighted average is not
recommended for these cases, and different alternative methods are proposed in
the literature. Here, we introduce a new method based on Bayesian statistics
for a broad application that keeps the number of assumptions to a minimum. The
uncertainty associated with each input value is considered just a lower bound
of the true unknown uncertainty. By assuming a non-informative (Jeffreys')
prior for true uncertainty and marginalising over its value, a modified
Gaussian distribution is obtained with smoothly decreasing wings, which allows
for a better treatment of scattered data and outliers. The proposed method is
tested on a series of data sets: simulations, CODATA recommended value of the
Newtonian gravitational constant, and some particle properties from the
Particle Data Group, including the proton charge radius and the mass of the W
boson. For the latter in particular, contrary to other works, our prediction
lies in good agreement with the Standard Model. A freely available Python
library is also provided for a simple implementation of our averaging method.