基于强化学习的类鱼智能体在不同流体环境中的鲁棒导航。

IF 3 3区计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY

Bioinspiration & Biomimetics Pub Date : 2025-09-30 DOI:10.1088/1748-3190/ae0dd1

Jin Zhang, Xiaolong Chen, Bochao Cao

{"title":"基于强化学习的类鱼智能体在不同流体环境中的鲁棒导航。","authors":"Jin Zhang, Xiaolong Chen, Bochao Cao","doi":"10.1088/1748-3190/ae0dd1","DOIUrl":null,"url":null,"abstract":"Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning for Robust Navigation of Fish-Like Agents in Various Fluid Environments.\",\"authors\":\"Jin Zhang, Xiaolong Chen, Bochao Cao\",\"doi\":\"10.1088/1748-3190/ae0dd1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\\\\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.\",\"PeriodicalId\":55377,\"journal\":{\"name\":\"Bioinspiration & Biomimetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinspiration & Biomimetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1088/1748-3190/ae0dd1\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/ae0dd1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

在未知流体环境中实现鲁棒和节能导航仍然是仿生水下机器人面临的关键挑战。在本研究中，我们开发了一种基于强化学习（RL）的控制框架，使鱼状游泳者能够在高保真计算流体动力学（CFD）环境中自主获取有效的导航策略。通过塑造有利于能量效率的奖励函数，智能体自发地发现了不同的运动模式，从连续爆发到爆发-海岸步态，所有这些都不需要事先了解流体力学。尽管智能体是在静止的流体环境中训练的，但学习到的游泳策略在各种导航任务中都能很好地推广，并且在复杂的流体扰动下（包括均匀流和非定常涡尾迹）仍能保持鲁棒性。在所有测试场景中，代理实现了100%的导航成功率。这些发现强调了将基于物理的模拟与基于学习的控制策略相结合的潜力，以推进受生物游泳者启发的自适应、高效和弹性水生机器人的设计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning for Robust Navigation of Fish-Like Agents in Various Fluid Environments.

Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Bioinspiration & Biomimetics 工程技术-材料科学：生物材料

CiteScore

5.90

自引率

14.70%

发文量

132

审稿时长

3 months

期刊介绍： Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology. The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include: Systems, designs and structure Communication and navigation Cooperative behaviour Self-organizing biological systems Self-healing and self-assembly Aerial locomotion and aerospace applications of biomimetics Biomorphic surface and subsurface systems Marine dynamics: swimming and underwater dynamics Applications of novel materials Biomechanics; including movement, locomotion, fluidics Cellular behaviour Sensors and senses Biomimetic or bioinformed approaches to geological exploration.