{"title":"Bootstrap-determined p-values in Lattice QCD","authors":"Norman Christ, Rajiv Eranki, Christopher Kelly","doi":"arxiv-2409.11379","DOIUrl":null,"url":null,"abstract":"We present a general method to determine the probability that stochastic\nMonte Carlo data, in particular those generated in a lattice QCD calculation,\nwould have been obtained were that data drawn from the distribution predicted\nby a given theoretical hypothesis. Such a probability, or p-value, is often\nused as an important heuristic measure of the validity of that hypothesis. The\nproposed method offers the benefit that it remains usable in cases where the\nstandard Hotelling $T^2$ methods based on the conventional $\\chi^2$ statistic\ndo not apply, such as for uncorrelated fits. Specifically, we analyze a general\nalternative to the correlated $\\chi^2$ statistic referred to as $q^2$, and show\nhow to use the bootstrap as a data-driven method to determine the expected\ndistribution of $q^2$ for a given hypothesis with minimal assumptions. This\ndistribution can then be used to determine the p-value for a fit to the data.\nWe also describe a bootstrap approach for quantifying the impact upon this\np-value of estimating population parameters from a single ensemble of $N$\nsamples. The overall method is accurate up to a $1/N$ bias which we do not\nattempt to quantify.","PeriodicalId":501191,"journal":{"name":"arXiv - PHYS - High Energy Physics - Lattice","volume":"39 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - High Energy Physics - Lattice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11379","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present a general method to determine the probability that stochastic
Monte Carlo data, in particular those generated in a lattice QCD calculation,
would have been obtained were that data drawn from the distribution predicted
by a given theoretical hypothesis. Such a probability, or p-value, is often
used as an important heuristic measure of the validity of that hypothesis. The
proposed method offers the benefit that it remains usable in cases where the
standard Hotelling $T^2$ methods based on the conventional $\chi^2$ statistic
do not apply, such as for uncorrelated fits. Specifically, we analyze a general
alternative to the correlated $\chi^2$ statistic referred to as $q^2$, and show
how to use the bootstrap as a data-driven method to determine the expected
distribution of $q^2$ for a given hypothesis with minimal assumptions. This
distribution can then be used to determine the p-value for a fit to the data.
We also describe a bootstrap approach for quantifying the impact upon this
p-value of estimating population parameters from a single ensemble of $N$
samples. The overall method is accurate up to a $1/N$ bias which we do not
attempt to quantify.