The trace-reinforced ants process does not find shortest paths

Daniel Kious, Cécile Mailler, Bruno Schapira
{"title":"The trace-reinforced ants process does not find shortest paths","authors":"Daniel Kious, Cécile Mailler, Bruno Schapira","doi":"10.5802/jep.188","DOIUrl":null,"url":null,"abstract":"In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph. \nOur contribution is three-fold: (1) We prove that, if $\\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.","PeriodicalId":106406,"journal":{"name":"Journal de l’École polytechnique — Mathématiques","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal de l’École polytechnique — Mathématiques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5802/jep.188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph. Our contribution is three-fold: (1) We prove that, if $\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.
踪迹强化蚂蚁的过程不会找到最短路径
在本文中,我们研究了蚂蚁寻找巢穴和食物来源之间最短路径的概率强化学习模型。在这个模型中,巢和食物来源是有限图中两个不同的节点$N$和$F$。蚂蚁在这张图上执行一系列随机行走,从巢穴开始,当第一次到达食物来源时停止。在随机行走的每一步中,第n只蚂蚁选择穿过相邻的边,其概率与之前至少一次穿过该边的蚂蚁的数量成正比。我们说,如果蚂蚁找到了最短路径,几乎可以肯定,当蚂蚁的数量增长到无穷大时,几乎所有的蚂蚁都通过一条最短路径从巢穴到食物来源,而不会在图的其他边损失时间。我们的贡献有三方面:(1)我们证明了,如果$\mathcal G$是一棵扎根于$N$的树,它的叶子被合并到$F$节点中,并且在$N$和$F$之间有一条边,那么蚂蚁确实找到了最短路径。(2)相比之下,我们提供了三个蚂蚁不能找到最短路径的图的例子,这表明在这个模型和大多数图中,蚂蚁不能找到最短路径。(3)在所有这些情况下,我们证明了归一化边权序列收敛于{\it确定性}极限,尽管存在线性强化机制,并且我们推测这是一个在所有有限图上有效的一般事实。为了证明这些结果,我们使用了随机逼近方法,特别是ODE方法。一个困难来自于这样一个事实,即这种方法依赖于对非线性、多维ODE解的大时间行为的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信