{"title":"Upper tails for arithmetic progressions revisited","authors":"Matan Harel, Frank Mousset, Wojciech Samotij","doi":"arxiv-2409.08383","DOIUrl":null,"url":null,"abstract":"Let $X$ be the number of $k$-term arithmetic progressions contained in the\n$p$-biased random subset of the first $N$ positive integers. We give\nasymptotically sharp estimates on the logarithmic upper-tail probability $\\log\n\\Pr(X \\ge E[X] + t)$ for all $\\Omega(N^{-2/k}) \\le p \\ll 1$ and all $t \\gg\n\\sqrt{Var(X)}$, excluding only a few boundary cases. In particular, we show\nthat the space of parameters $(p,t)$ is partitioned into three\nphenomenologically distinct regions, where the upper-tail probabilities either\nresemble those of Gaussian or Poisson random variables, or are naturally\ndescribed by the probability of appearance of a small set that contains nearly\nall of the excess $t$ progressions. We employ a variety of tools from\nprobability theory, including classical tilting arguments and martingale\nconcentration inequalities. However, the main technical innovation is a\ncombinatorial result that establishes a stronger version of `entropic\nstability' for sets with rich arithmetic structure.","PeriodicalId":501245,"journal":{"name":"arXiv - MATH - Probability","volume":"54 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Probability","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08383","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Let $X$ be the number of $k$-term arithmetic progressions contained in the
$p$-biased random subset of the first $N$ positive integers. We give
asymptotically sharp estimates on the logarithmic upper-tail probability $\log
\Pr(X \ge E[X] + t)$ for all $\Omega(N^{-2/k}) \le p \ll 1$ and all $t \gg
\sqrt{Var(X)}$, excluding only a few boundary cases. In particular, we show
that the space of parameters $(p,t)$ is partitioned into three
phenomenologically distinct regions, where the upper-tail probabilities either
resemble those of Gaussian or Poisson random variables, or are naturally
described by the probability of appearance of a small set that contains nearly
all of the excess $t$ progressions. We employ a variety of tools from
probability theory, including classical tilting arguments and martingale
concentration inequalities. However, the main technical innovation is a
combinatorial result that establishes a stronger version of `entropic
stability' for sets with rich arithmetic structure.