A Reinforcement Learning Approach to Find Optimal Propulsion Strategy for Microrobots Swimming at Low Reynolds Number

IF 4.3 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Robotics and Autonomous Systems Pub Date : 2024-02-08 DOI:10.1016/j.robot.2024.104659

Iman Jebellat , Ehsan Jebellat , Alireza Amiri-Margavi , Amin Vahidi-Moghaddam , Hossein Nejat Pishkenari

{"title":"A Reinforcement Learning Approach to Find Optimal Propulsion Strategy for Microrobots Swimming at Low Reynolds Number","authors":"Iman Jebellat , Ehsan Jebellat , Alireza Amiri-Margavi , Amin Vahidi-Moghaddam , Hossein Nejat Pishkenari","doi":"10.1016/j.robot.2024.104659","DOIUrl":null,"url":null,"abstract":"<div><p>The development of artificial microscopic robots, like synthetic microswimmers, is one of the state-of-the-art research topics due to their promising biomedical applications. The movement of microswimmers is affected by stringent constraints because of the low Reynolds number of the surrounding environment. Researchers have been working on enhancing microrobots’ propulsion tactics and figuring out new approaches to find optimal propulsion strategies. In this research, we employ a Reinforcement Learning (RL) algorithm, specifically Q-Learning, to train linear-shaped microrobots, comprised of spheres and rods, by introducing an innovative and pioneer coding approach, termed “Basic Coding”, which is utilized to specify states and actions within the RL framework. Basic Coding is a powerful general method that can be employed for different agents in any discrete RL environment. We show how to apply Basic Coding to various microrobots with different geometrical configurations, like a triangular one. Our smart microswimmers, with different numbers of spheres, acquire the knowledge of the optimal propulsion cycle to accomplish large net mechanical displacement without relying on any pre-existing locomotion expertise. The three-sphere linear microrobot recovers the cycle Najafi and Golestanian suggested. The N-sphere microrobots with higher degrees of freedom can find the optimal propulsion cycle within a reasonable number of learning steps and low computational cost utilizing our RL and Basic Coding approach, while the learning step number significantly increases using other methods like Brute-force search. For example, we show this number for the 5-sphere microrobot is 1.19E03 and 8.97E10 steps using our methodology and Brute-force, respectively. Furthermore, our intelligent microrobots can successfully and adaptively find new optimal strategies in indeterministic environments in the presence of uncertainty. Moreover, the effects of learning parameters on our RL agents are investigated in this work.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024000423","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The development of artificial microscopic robots, like synthetic microswimmers, is one of the state-of-the-art research topics due to their promising biomedical applications. The movement of microswimmers is affected by stringent constraints because of the low Reynolds number of the surrounding environment. Researchers have been working on enhancing microrobots’ propulsion tactics and figuring out new approaches to find optimal propulsion strategies. In this research, we employ a Reinforcement Learning (RL) algorithm, specifically Q-Learning, to train linear-shaped microrobots, comprised of spheres and rods, by introducing an innovative and pioneer coding approach, termed “Basic Coding”, which is utilized to specify states and actions within the RL framework. Basic Coding is a powerful general method that can be employed for different agents in any discrete RL environment. We show how to apply Basic Coding to various microrobots with different geometrical configurations, like a triangular one. Our smart microswimmers, with different numbers of spheres, acquire the knowledge of the optimal propulsion cycle to accomplish large net mechanical displacement without relying on any pre-existing locomotion expertise. The three-sphere linear microrobot recovers the cycle Najafi and Golestanian suggested. The N-sphere microrobots with higher degrees of freedom can find the optimal propulsion cycle within a reasonable number of learning steps and low computational cost utilizing our RL and Basic Coding approach, while the learning step number significantly increases using other methods like Brute-force search. For example, we show this number for the 5-sphere microrobot is 1.19E03 and 8.97E10 steps using our methodology and Brute-force, respectively. Furthermore, our intelligent microrobots can successfully and adaptively find new optimal strategies in indeterministic environments in the presence of uncertainty. Moreover, the effects of learning parameters on our RL agents are investigated in this work.

Abstract Image

查看原文本刊更多论文

为在低雷诺数条件下游泳的微型机器人寻找最佳推进策略的强化学习方法

人工微观机器人（如合成微泳器）具有广阔的生物医学应用前景，因此其开发是最前沿的研究课题之一。由于周围环境的雷诺数较低，微型游泳者的运动受到严格的限制。研究人员一直致力于加强微型机器人的推进策略，并找出新的方法来寻找最佳推进策略。在这项研究中，我们采用了一种强化学习（RL）算法，特别是 Q-Learning 算法，来训练由球体和棒体组成的线型微机器人，并引入了一种创新的先驱编码方法，即 "基本编码"，用于在 RL 框架内指定状态和行动。基本编码是一种强大的通用方法，可用于任何离散 RL 环境中的不同代理。我们展示了如何将 "基本编码 "应用于各种具有不同几何结构（如三角形）的微型机器人。我们的智能微型游泳机器人拥有不同数量的球体，可以获得最佳推进周期的知识，从而完成较大的净机械位移，而无需依赖任何已有的运动专业知识。三球线性微机器人恢复了纳杰菲和戈勒斯坦建议的循环。自由度较高的 N 球微型机器人可以利用我们的 RL 和基本编码方法，在合理的学习步数和较低的计算成本内找到最佳推进循环，而使用其他方法（如蛮力搜索），学习步数会显著增加。例如，我们的研究表明，使用我们的方法和蛮力搜索，5 球微型机器人的学习步数分别为 1.19E03 和 8.97E10。此外，在存在不确定性的情况下，我们的智能微型机器人能够在不确定环境中成功地、自适应地找到新的最优策略。此外，这项工作还研究了学习参数对我们的 RL 代理的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Robotics and Autonomous Systems 工程技术-机器人学

CiteScore

9.00

自引率

7.00%

发文量

164

审稿时长

4.5 months

期刊介绍： Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.