{"title":"基于强化学习的类鱼智能体在不同流体环境中的鲁棒导航。","authors":"Jin Zhang, Xiaolong Chen, Bochao Cao","doi":"10.1088/1748-3190/ae0dd1","DOIUrl":null,"url":null,"abstract":"<p><p>Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.</p>","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning for Robust Navigation of Fish-Like Agents in Various Fluid Environments.\",\"authors\":\"Jin Zhang, Xiaolong Chen, Bochao Cao\",\"doi\":\"10.1088/1748-3190/ae0dd1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\\\\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.</p>\",\"PeriodicalId\":55377,\"journal\":{\"name\":\"Bioinspiration & Biomimetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinspiration & Biomimetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1088/1748-3190/ae0dd1\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/ae0dd1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Reinforcement Learning for Robust Navigation of Fish-Like Agents in Various Fluid Environments.
Achieving robust and energy-efficient navigation in unknown fluid environments remains a key challenge for bioinspired underwater robots. In this study, we develop a reinforcement learning (RL)-based control framework that enables a fish-like swimmer to autonomously acquire effective navigation strategies within a high-fidelity computational fluid dynamics (CFD) environment. By shaping the reward function to favor energy efficiency, the agent spontaneously discovers different locomotion patterns, ranging from continuous bursting to burst-and-coast gaits, all without prior knowledge of fluid mechanics. Although the agent is trained in a quiescent fluid environment, the learned swimming policies are generalized well in various navigation tasks and remain robust under complex flow perturbations, including uniform currents and unsteady vortex wakes. In all test scenarios, the agent achieves a 100$\%$ navigation success rate. These findings highlight the potential of integrating physics-based simulation with learning-based control strategy to advance the design of adaptive, efficient, and resilient aquatic robots inspired by biological swimmers.
期刊介绍:
Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology.
The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include:
Systems, designs and structure
Communication and navigation
Cooperative behaviour
Self-organizing biological systems
Self-healing and self-assembly
Aerial locomotion and aerospace applications of biomimetics
Biomorphic surface and subsurface systems
Marine dynamics: swimming and underwater dynamics
Applications of novel materials
Biomechanics; including movement, locomotion, fluidics
Cellular behaviour
Sensors and senses
Biomimetic or bioinformed approaches to geological exploration.