利用深度强化学习在复杂的珊瑚礁环境中学习避障和捕食。

IF 3.1 3区计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY

Bioinspiration & Biomimetics Pub Date : 2024-08-07 DOI:10.1088/1748-3190/ad6544

Ji Hou, Changling He, Tao Li, Chunze Zhang, Qin Zhou

{"title":"利用深度强化学习在复杂的珊瑚礁环境中学习避障和捕食。","authors":"Ji Hou, Changling He, Tao Li, Chunze Zhang, Qin Zhou","doi":"10.1088/1748-3190/ad6544","DOIUrl":null,"url":null,"abstract":"The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning.\",\"authors\":\"Ji Hou, Changling He, Tao Li, Chunze Zhang, Qin Zhou\",\"doi\":\"10.1088/1748-3190/ad6544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.\",\"PeriodicalId\":55377,\"journal\":{\"name\":\"Bioinspiration & Biomimetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinspiration & Biomimetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1088/1748-3190/ad6544\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/ad6544","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

珊瑚礁生态系统作为游泳能力有限的鱼类的栖息地发挥着至关重要的作用，它不仅是鱼类的避难所和食物来源，还影响着鱼类的行为倾向。在复杂的水流中，鱼类通过复杂的机制在珊瑚礁环境中巧妙地导航移动目标，同时躲避障碍物并保持稳定的姿态，这在鱼类行为学、生态学和生物仿生学领域一直是一个具有挑战性的重要课题。该研究采用了一个综合模拟框架，将深度强化学习算法（DRL）与高精度流固耦合数值方法--浸没边界晶格玻尔兹曼法（lB-LBM）相结合，研究复杂环境中的鱼类捕食问题。软行为批判（SAC）算法用于提高智能鱼的随机探索能力，以应对现实世界场景中固有的多目标稀疏奖励挑战。此外，还开发了一种针对其行动目的的奖励塑造方法，能够有效捕捉结果和趋势特征。本文通过两个案例研究展示了该方法的收敛性和鲁棒性优势：一个案例研究了鱼类在静态流场中捕捉随机移动目标的情况，另一个案例研究了鱼类在珊瑚礁环境中逆流觅食捕捉漂流食物的情况。本文全面分析了各种奖励类型对复杂环境中智能鱼类决策过程的影响和意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning.

The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Bioinspiration & Biomimetics 工程技术-材料科学：生物材料

CiteScore

5.90

自引率

14.70%

发文量

132

审稿时长

3 months

期刊介绍： Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology. The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include: Systems, designs and structure Communication and navigation Cooperative behaviour Self-organizing biological systems Self-healing and self-assembly Aerial locomotion and aerospace applications of biomimetics Biomorphic surface and subsurface systems Marine dynamics: swimming and underwater dynamics Applications of novel materials Biomechanics; including movement, locomotion, fluidics Cellular behaviour Sensors and senses Biomimetic or bioinformed approaches to geological exploration.