{"title":"Encoding flexible gait strategies in stick insects through data-driven inverse reinforcement learning.","authors":"Yuchen Wang, Mitsuhiro Hayashibe, Dai Owaki","doi":"10.1088/1748-3190/addc26","DOIUrl":null,"url":null,"abstract":"<p><p>Stick insects exhibit remarkable adaptive walking capabilities across diverse environments; however, the mechanisms underlying their gait transitions remain poorly understood. Although reinforcement learning (RL) has been employed to generate insect-like gaits, the design of an appropriate reward function presents a challenge due to the probabilistic and continuous nature of gait transitions. This study utilized maximum entropy inverse RL to infer the reward function that governs stick insect gait selection, incorporating walking dynamic parameters-namely, velocity, direction, and acceleration-alongside antenna joint movements as state variables. By analyzing the inferred reward structures, we clarified the underlying principles that drive gait transitions and emphasized the role of sensory feedback in gait modulation. The efficacy of the inferred policies was validated through an assessment of their ability to reproduce expert trajectories, demonstrating that stick insect gaits can be learned from observable states during locomotion. Furthermore, interspecies variations and noncanonical gait patterns were examined, providing insights into the flexibility and adaptability of insect locomotion. This data-driven approach offers a biologically interpretable framework for gait modeling and contributes to bioinspired robotic design by facilitating adaptive control strategies for hexapod robots.</p>","PeriodicalId":55377,"journal":{"name":"Bioinspiration & Biomimetics","volume":" ","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinspiration & Biomimetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1088/1748-3190/addc26","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Stick insects exhibit remarkable adaptive walking capabilities across diverse environments; however, the mechanisms underlying their gait transitions remain poorly understood. Although reinforcement learning (RL) has been employed to generate insect-like gaits, the design of an appropriate reward function presents a challenge due to the probabilistic and continuous nature of gait transitions. This study utilized maximum entropy inverse RL to infer the reward function that governs stick insect gait selection, incorporating walking dynamic parameters-namely, velocity, direction, and acceleration-alongside antenna joint movements as state variables. By analyzing the inferred reward structures, we clarified the underlying principles that drive gait transitions and emphasized the role of sensory feedback in gait modulation. The efficacy of the inferred policies was validated through an assessment of their ability to reproduce expert trajectories, demonstrating that stick insect gaits can be learned from observable states during locomotion. Furthermore, interspecies variations and noncanonical gait patterns were examined, providing insights into the flexibility and adaptability of insect locomotion. This data-driven approach offers a biologically interpretable framework for gait modeling and contributes to bioinspired robotic design by facilitating adaptive control strategies for hexapod robots.
期刊介绍:
Bioinspiration & Biomimetics publishes research involving the study and distillation of principles and functions found in biological systems that have been developed through evolution, and application of this knowledge to produce novel and exciting basic technologies and new approaches to solving scientific problems. It provides a forum for interdisciplinary research which acts as a pipeline, facilitating the two-way flow of ideas and understanding between the extensive bodies of knowledge of the different disciplines. It has two principal aims: to draw on biology to enrich engineering and to draw from engineering to enrich biology.
The journal aims to include input from across all intersecting areas of both fields. In biology, this would include work in all fields from physiology to ecology, with either zoological or botanical focus. In engineering, this would include both design and practical application of biomimetic or bioinspired devices and systems. Typical areas of interest include:
Systems, designs and structure
Communication and navigation
Cooperative behaviour
Self-organizing biological systems
Self-healing and self-assembly
Aerial locomotion and aerospace applications of biomimetics
Biomorphic surface and subsurface systems
Marine dynamics: swimming and underwater dynamics
Applications of novel materials
Biomechanics; including movement, locomotion, fluidics
Cellular behaviour
Sensors and senses
Biomimetic or bioinformed approaches to geological exploration.