{"title":"Strong posterior contraction rates via Wasserstein dynamics","authors":"Emanuele Dolera, Stefano Favaro, Edoardo Mainini","doi":"10.1007/s00440-024-01260-w","DOIUrl":"https://doi.org/10.1007/s00440-024-01260-w","url":null,"abstract":"<p>In Bayesian statistics, posterior contraction rates (PCRs) quantify the speed at which the posterior distribution concentrates on arbitrarily small neighborhoods of a true model, in a suitable way, as the sample size goes to infinity. In this paper, we develop a new approach to PCRs, with respect to strong norm distances on parameter spaces of functions. Critical to our approach is the combination of a local Lipschitz-continuity for the posterior distribution with a dynamic formulation of the Wasserstein distance, which allows to set forth an interesting connection between PCRs and some classical problems arising in mathematical analysis, probability and statistics, e.g., Laplace methods for approximating integrals, Sanov’s large deviation principles in the Wasserstein distance, rates of convergence of mean Glivenko–Cantelli theorems, and estimates of weighted Poincaré–Wirtinger constants. We first present a theorem on PCRs for a model in the regular infinite-dimensional exponential family, which exploits sufficient statistics of the model, and then extend such a theorem to a general dominated model. These results rely on the development of novel techniques to evaluate Laplace integrals and weighted Poincaré–Wirtinger constants in infinite-dimension, which are of independent interest. The proposed approach is applied to the regular parametric model, the multinomial model, the finite-dimensional and the infinite-dimensional logistic-Gaussian model and the infinite-dimensional linear regression. In general, our approach leads to optimal PCRs in finite-dimensional models, whereas for infinite-dimensional models it is shown explicitly how the prior distribution affect PCRs.\u0000</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139952388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Geometry of the minimal spanning tree in the heavy-tailed regime: new universality classes","authors":"Shankar Bhamidi, Sanchayan Sen","doi":"10.1007/s00440-024-01259-3","DOIUrl":"https://doi.org/10.1007/s00440-024-01259-3","url":null,"abstract":"<p>A well-known open problem on the behavior of optimal paths in random graphs in the strong disorder regime, formulated by statistical physicists, and supported by a large amount of numerical evidence over the last decade (Braunstein et al. in Phys Rev Lett 91(16):168701, 2003; Braunstein et al. in Int J Bifurc Chaos 17(07):2215–2255, 2007; Chen et al. in Phys Rev Lett 96(6):068702, 2006; Wu et al. in Phys Rev Lett 96(14):148702, 2006) is as follows: for a large class of random graph models with degree exponent <span>(tau in (3,4))</span>, distances in the minimal spanning tree (MST) on the giant component in the supercritical regime scale like <span>(n^{(tau -3)/(tau -1)})</span>. The aim of this paper is to make progress towards a proof of this conjecture. We consider a supercritical inhomogeneous random graph model with degree exponent <span>(tau in (3, 4))</span> that is closely related to Aldous’s multiplicative coalescent, and show that the MST constructed by assigning i.i.d. continuous weights to the edges in its giant component, endowed with the tree distance scaled by <span>(n^{-(tau -3)/(tau -1)})</span>, converges in distribution with respect to the Gromov–Hausdorff topology to a random compact real tree. Further, almost surely, every point in this limiting space either has degree one (leaf), or two, or infinity (hub), both the set of leaves and the set of hubs are dense in this space, and the Minkowski dimension of this space equals <span>((tau -1)/(tau -3))</span>. The multiplicative coalescent, in an asymptotic sense, describes the evolution of the component sizes of various near-critical random graph processes. We expect the limiting spaces in this paper to be the candidates for the scaling limit of the MST constructed for a wide array of other heavy-tailed random graph models.\u0000</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139903742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The critical variational setting for stochastic evolution equations","authors":"Antonio Agresti, Mark Veraar","doi":"10.1007/s00440-023-01249-x","DOIUrl":"https://doi.org/10.1007/s00440-023-01249-x","url":null,"abstract":"<p>In this paper we introduce the critical variational setting for parabolic stochastic evolution equations of quasi- or semi-linear type. Our results improve many of the abstract results in the classical variational setting. In particular, we are able to replace the usual weak or local monotonicity condition by a more flexible local Lipschitz condition. Moreover, the usual growth conditions on the multiplicative noise are weakened considerably. Our new setting provides general conditions under which local and global existence and uniqueness hold. In addition, we prove continuous dependence on the initial data. We show that many classical SPDEs, which could not be covered by the classical variational setting, do fit in the critical variational setting. In particular, this is the case for the Cahn–Hilliard equation, tamed Navier–Stokes equations, and Allen–Cahn equation.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139664773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Geometric bounds on the fastest mixing Markov chain","authors":"Sam Olesker-Taylor, Luca Zanetti","doi":"10.1007/s00440-023-01257-x","DOIUrl":"https://doi.org/10.1007/s00440-023-01257-x","url":null,"abstract":"<p>In the Fastest Mixing Markov Chain problem, we are given a graph <span>(G = (V, E))</span> and desire the discrete-time Markov chain with smallest mixing time <span>(tau )</span> subject to having equilibrium distribution uniform on <i>V</i> and non-zero transition probabilities only across edges of the graph. It is well-known that the mixing time <span>(tau _textsf {RW})</span> of the lazy random walk on <i>G</i> is characterised by the edge conductance <span>(Phi )</span> of <i>G</i> via Cheeger’s inequality: <span>(Phi ^{-1} lesssim tau _textsf {RW} lesssim Phi ^{-2} log |V|)</span>. Analogously, we characterise the fastest mixing time <span>(tau ^star )</span> via a Cheeger-type inequality but for a different geometric quantity, namely the vertex conductance <span>(Psi )</span> of <i>G</i>: <span>(Psi ^{-1} lesssim tau ^star lesssim Psi ^{-2} (log |V|)^2)</span>. This characterisation forbids fast mixing for graphs with small vertex conductance. To bypass this fundamental barrier, we consider Markov chains on <i>G</i> with equilibrium distribution which need not be uniform, but rather only <span>(varepsilon )</span>-close to uniform in total variation. We show that it is always possible to construct such a chain with mixing time <span>(tau lesssim varepsilon ^{-1} ({text {diam}} G)^2 log |V|)</span>. Finally, we discuss analogous questions for continuous-time and time-inhomogeneous chains.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139648394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tractability from overparametrization: the example of the negative perceptron","authors":"Andrea Montanari, Yiqiao Zhong, Kangjie Zhou","doi":"10.1007/s00440-023-01248-y","DOIUrl":"https://doi.org/10.1007/s00440-023-01248-y","url":null,"abstract":"<p>In the negative perceptron problem we are given <i>n</i> data points <span>((varvec{x}_i,y_i))</span>, where <span>(varvec{x}_i)</span> is a <i>d</i>-dimensional vector and <span>(y_iin {+1,-1})</span> is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible <i>negative</i> margin. In other words, we want to find a unit norm vector <span>(varvec{theta })</span> that maximizes <span>(min _{ile n}y_ilangle varvec{theta },varvec{x}_irangle )</span>. This is a non-convex optimization problem (it is equivalent to finding a maximum norm vector in a polytope), and we study its typical properties under two random models for the data. We consider the proportional asymptotics in which <span>(n,drightarrow infty )</span> with <span>(n/drightarrow delta )</span>, and prove upper and lower bounds on the maximum margin <span>(kappa _{{textrm{s}}}(delta ))</span> or—equivalently—on its inverse function <span>(delta _{{textrm{s}}}(kappa ))</span>. In other words, <span>(delta _{{textrm{s}}}(kappa ))</span> is the overparametrization threshold: for <span>(n/dle delta _{{textrm{s}}}(kappa )-{varepsilon })</span> a classifier achieving vanishing training error exists with high probability, while for <span>(n/dge delta _{{textrm{s}}}(kappa )+{varepsilon })</span> it does not. Our bounds on <span>(delta _{{textrm{s}}}(kappa ))</span> match to the leading order as <span>(kappa rightarrow -infty )</span>. We then analyze a linear programming algorithm to find a solution, and characterize the corresponding threshold <span>(delta _{textrm{lin}}(kappa ))</span>. We observe a gap between the interpolation threshold <span>(delta _{{textrm{s}}}(kappa ))</span> and the linear programming threshold <span>(delta _{textrm{lin}}(kappa ))</span>, raising the question of the behavior of other algorithms.\u0000</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139558969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"W-entropy and Langevin deformation on Wasserstein space over Riemannian manifolds","authors":"Songzi Li, Xiang-Dong Li","doi":"10.1007/s00440-023-01256-y","DOIUrl":"https://doi.org/10.1007/s00440-023-01256-y","url":null,"abstract":"<p>We prove the Perelman type <i>W</i>-entropy formula for the geodesic flow on the <span>(L^2)</span>-Wasserstein space over a complete Riemannian manifold equipped with Otto’s infinite dimensional Riemannian metric. To better understand the similarity between the <i>W</i>-entropy formula for the geodesic flow on the Wasserstein space and the <i>W</i>-entropy formula for the heat flow of the Witten Laplacian on the underlying manifold, we introduce the Langevin deformation of flows on the Wasserstein space over a Riemannian manifold, which interpolates the gradient flow and the geodesic flow on the Wasserstein space over a Riemannian manifold, and can be regarded as the potential flow of the compressible Euler equation with damping on a Riemannian manifold. We prove the existence, uniqueness and regularity of the Langevin deformation on the Wasserstein space over the Euclidean space and a compact Riemannian manifold, and prove the convergence of the Langevin deformation for <span>(crightarrow 0)</span> and <span>(crightarrow infty )</span> respectively. Moreover, we prove the <i>W</i>-entropy-information formula along the Langevin deformation on the Wasserstein space on Riemannian manifolds. The rigidity theorems are proved for the <i>W</i>-entropy for the geodesic flow and the Langevin deformation on the Wasserstein space over complete Riemannian manifolds with the CD(0, <i>m</i>)-condition. Our results are new even in the case of Euclidean spaces and complete Riemannian manifolds with non-negative Ricci curvature.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139500645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alice Contat, Nicolas Curien, Perrine Lacroix, Etienne Lasalle, Vincent Rivoirard
{"title":"Eve, Adam and the preferential attachment tree","authors":"Alice Contat, Nicolas Curien, Perrine Lacroix, Etienne Lasalle, Vincent Rivoirard","doi":"10.1007/s00440-023-01253-1","DOIUrl":"https://doi.org/10.1007/s00440-023-01253-1","url":null,"abstract":"<p>We consider the problem of finding the initial vertex (Adam) in a Barabási–Albert tree process <span>( (mathcal {T}(n): n ge 1))</span> at large times. More precisely, given <span>( varepsilon >0)</span>, one wants to output a subset <span>( mathcal {P}_{ varepsilon }(n))</span> of vertices of <span>( mathcal {T}(n))</span> so that the initial vertex belongs to <span>( mathcal {P}_ varepsilon (n))</span> with probability at least <span>(1- varepsilon )</span> when <i>n</i> is large. It has been shown by Bubeck, Devroye and Lugosi, refined later by Banerjee and Huang, that one needs to output at least <span>( varepsilon ^{-1 + o(1)})</span> and at most <span>(varepsilon ^{-2 + o(1)})</span> vertices to succeed. We prove that the exponent in the lower bound is sharp and the key idea is that Adam is either a “large degree\" vertex or is a neighbor of a “large degree\" vertex (Eve).\u0000</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139459075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Superconcentration for minimal surfaces in first passage percolation and disordered Ising ferromagnets","authors":"Barbara Dembin, Christophe Garban","doi":"10.1007/s00440-023-01252-2","DOIUrl":"https://doi.org/10.1007/s00440-023-01252-2","url":null,"abstract":"<p>We consider the standard first passage percolation model on <span>({mathbb {Z}}^ d)</span> with a distribution <i>G</i> taking two values <span>(0<a<b)</span>. We study the maximal flow through the cylinder <span>([0,n]^ {d-1}times [0,hn])</span> between its top and bottom as well as its associated minimal surface(s). We prove that the variance of the maximal flow is superconcentrated, i.e. in <span>(O(frac{n^{d-1}}{log n}))</span>, for <span>(hge h_0)</span> (for a large enough constant <span>(h_0=h_0(a,b))</span>). Equivalently, we obtain that the ground state energy of a disordered Ising ferromagnet in a cylinder <span>([0,n]^ {d-1}times [0,hn])</span> is superconcentrated when opposite boundary conditions are applied at the top and bottom faces and for a large enough constant <span>(hge h_0)</span> (which depends on the law of the coupling constants). Our proof is inspired by the proof of Benjamini–Kalai–Schramm (Ann Probab 31:1970–1978, 2003). Yet, one major difficulty in this setting is to control the influence of the edges since the averaging trick used in Benjamini et al. (Ann Probab 31:1970–1978, 2003) fails for surfaces. Of independent interest, we prove that minimal surfaces (in the present discrete setting) cannot have long thin chimneys.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139103070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the Wiener chaos expansion of the signature of a Gaussian process","authors":"Thomas Cass, Emilio Ferrucci","doi":"10.1007/s00440-023-01255-z","DOIUrl":"https://doi.org/10.1007/s00440-023-01255-z","url":null,"abstract":"<p>We compute the Wiener chaos decomposition of the signature for a class of Gaussian processes, which contains fractional Brownian motion (fBm) with Hurst parameter <span>(H in (1/4,1))</span>. At level 0, our result yields an expression for the expected signature of such processes, which determines their law (Chevyrev and Lyons in Ann Probab 44(6):4049–4082, 2016). In particular, this formula simultaneously extends both the one for <span>(1/2 < H)</span>-fBm (Baudoin and Coutin in Stochast Process Appl 117(5):550–574, 2007) and the one for Brownian motion (<span>(H = 1/2)</span>) (Fawcett 2003), to the general case <span>(H > 1/4)</span>, thereby resolving an established open problem. Other processes studied include continuous and centred Gaussian semimartingales.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139103124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Annealed quantitative estimates for the quadratic 2D-discrete random matching problem","authors":"Nicolas Clozeau, Francesco Mattesini","doi":"10.1007/s00440-023-01254-0","DOIUrl":"https://doi.org/10.1007/s00440-023-01254-0","url":null,"abstract":"<p>We study a random matching problem on closed compact 2-dimensional Riemannian manifolds (with respect to the squared Riemannian distance), with samples of random points whose common law is absolutely continuous with respect to the volume measure with strictly positive and bounded density. We show that given two sequences of numbers <i>n</i> and <span>(m=m(n))</span> of points, asymptotically equivalent as <i>n</i> goes to infinity, the optimal transport plan between the two empirical measures <span>(mu ^n)</span> and <span>(nu ^{m})</span> is quantitatively well-approximated by <span>(big (text {Id},exp (nabla h^{n})big )_#mu ^n)</span> where <span>(h^{n})</span> solves a linear elliptic PDE obtained by a regularized first-order linearization of the Monge–Ampère equation. This is obtained in the case of samples of correlated random points for which a stretched exponential decay of the <span>(alpha )</span>-mixing coefficient holds and for a class of discrete-time sub-geometrically ergodic Markov chains having a unique absolutely continuous invariant measure with respect to the volume measure.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139103417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}