{"title":"New results for the random nearest neighbor tree","authors":"Lyuben Lichev, Dieter Mitsche","doi":"10.1007/s00440-024-01268-2","DOIUrl":null,"url":null,"abstract":"<p>In this paper, we study the online nearest neighbor random tree in dimension <span>\\(d\\in {\\mathbb {N}}\\)</span> (called <i>d</i>-NN tree for short) defined as follows. We fix the torus <span>\\({\\mathbb {T}}^d_n\\)</span> of dimension <i>d</i> and area <i>n</i> and equip it with the metric inherited from the Euclidean metric in <span>\\({\\mathbb {R}}^d\\)</span>. Then, embed consecutively <i>n</i> vertices in <span>\\({\\mathbb {T}}^d_n\\)</span> uniformly at random and independently, and let each vertex but the first one connect to its (already embedded) nearest neighbor. Call the resulting graph <span>\\(G_n\\)</span>. We show multiple results concerning the degree sequence of <span>\\(G_n\\)</span>. First, we prove that typically the number of vertices of degree at least <span>\\(k\\in {\\mathbb {N}}\\)</span> in the <i>d</i>-NN tree decreases exponentially with <i>k</i> and is tightly concentrated by a new Lipschitz-type concentration inequality that may be of independent interest. Second, we obtain that the maximum degree of <span>\\(G_n\\)</span> is of logarithmic order. Third, we give explicit bounds for the number of leaves that are independent of the dimension and also give estimates for the number of paths of length two. Moreover, we show that typically the height of a uniformly chosen vertex in <span>\\(G_n\\)</span> is <span>\\((1+o(1))\\log n\\)</span> and the diameter of <span>\\({\\mathbb {T}}^d_n\\)</span> is <span>\\((2e+o(1))\\log n\\)</span>, independently of the dimension. Finally, we define a natural infinite analog <span>\\(G_{\\infty }\\)</span> of <span>\\(G_n\\)</span> and show that it corresponds to the local limit of the sequence of finite graphs <span>\\((G_n)_{n \\ge 1}\\)</span>. Furthermore, we prove almost surely that <span>\\(G_{\\infty }\\)</span> is locally finite, that the simple random walk on <span>\\(G_{\\infty }\\)</span> is recurrent, and that <span>\\(G_{\\infty }\\)</span> is connected.</p>","PeriodicalId":20527,"journal":{"name":"Probability Theory and Related Fields","volume":"19 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Probability Theory and Related Fields","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s00440-024-01268-2","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we study the online nearest neighbor random tree in dimension \(d\in {\mathbb {N}}\) (called d-NN tree for short) defined as follows. We fix the torus \({\mathbb {T}}^d_n\) of dimension d and area n and equip it with the metric inherited from the Euclidean metric in \({\mathbb {R}}^d\). Then, embed consecutively n vertices in \({\mathbb {T}}^d_n\) uniformly at random and independently, and let each vertex but the first one connect to its (already embedded) nearest neighbor. Call the resulting graph \(G_n\). We show multiple results concerning the degree sequence of \(G_n\). First, we prove that typically the number of vertices of degree at least \(k\in {\mathbb {N}}\) in the d-NN tree decreases exponentially with k and is tightly concentrated by a new Lipschitz-type concentration inequality that may be of independent interest. Second, we obtain that the maximum degree of \(G_n\) is of logarithmic order. Third, we give explicit bounds for the number of leaves that are independent of the dimension and also give estimates for the number of paths of length two. Moreover, we show that typically the height of a uniformly chosen vertex in \(G_n\) is \((1+o(1))\log n\) and the diameter of \({\mathbb {T}}^d_n\) is \((2e+o(1))\log n\), independently of the dimension. Finally, we define a natural infinite analog \(G_{\infty }\) of \(G_n\) and show that it corresponds to the local limit of the sequence of finite graphs \((G_n)_{n \ge 1}\). Furthermore, we prove almost surely that \(G_{\infty }\) is locally finite, that the simple random walk on \(G_{\infty }\) is recurrent, and that \(G_{\infty }\) is connected.
期刊介绍:
Probability Theory and Related Fields publishes research papers in modern probability theory and its various fields of application. Thus, subjects of interest include: mathematical statistical physics, mathematical statistics, mathematical biology, theoretical computer science, and applications of probability theory to other areas of mathematics such as combinatorics, analysis, ergodic theory and geometry. Survey papers on emerging areas of importance may be considered for publication. The main languages of publication are English, French and German.