Mengyun Wang;Shengde Jia;Yifeng Niu;Yunzhuo Liu;Chao Yan;Chang Wang
{"title":"Agile Flights Through a Moving Narrow Gap for Quadrotors Using Adaptive Curriculum Learning","authors":"Mengyun Wang;Shengde Jia;Yifeng Niu;Yunzhuo Liu;Chao Yan;Chang Wang","doi":"10.1109/TIV.2024.3391384","DOIUrl":null,"url":null,"abstract":"Fast and agile flying through a gap is challenging for a quadrotor if the gap is narrow, tilted, and moving. Due to the strict time-variant position and attitude constraints, collision-free traversal trajectories under the under-actuated quadrotor dynamics are sparse and difficult to solve. To achieve this challenging task in the real world, we propose a Gap-Traversing Adaptive Curriculum Learning (GTACL) approach, which consists of adaptive curriculum reinforcement learning (ACRL) and online thrust updating (OTU). First, ACRL is introduced to improve sample efficiency, and the policy training is accelerated by designing a curriculum adapted to the agent's capability. Second, OTU is proposed to map the acceleration commands to low-level throttle signals by estimating the thrust model during flight, which reduces the intermediate control variables and helps sim2real transfer. We use the prioritized experience replay mechanism that considers both policy update contribution and data acquisition time to adapt to the changing tasks. GTACL is trained entirely in simulation and can be transferred to other quadrotors with different dynamics. Furthermore, we achieve zero-shot transfer to the real-world quadrotor without fine-tuning. The average success rates of 98% and 87.8% in simulation and real-world experiments for different task conditions demonstrate the robustness of the proposed approach. Comparative results with traditional and related learning-based approaches show the advantages of GTACL in terms of learning efficiency, control performance, and generalization.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 11","pages":"6936-6949"},"PeriodicalIF":14.0000,"publicationDate":"2024-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Vehicles","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10505852/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Fast and agile flying through a gap is challenging for a quadrotor if the gap is narrow, tilted, and moving. Due to the strict time-variant position and attitude constraints, collision-free traversal trajectories under the under-actuated quadrotor dynamics are sparse and difficult to solve. To achieve this challenging task in the real world, we propose a Gap-Traversing Adaptive Curriculum Learning (GTACL) approach, which consists of adaptive curriculum reinforcement learning (ACRL) and online thrust updating (OTU). First, ACRL is introduced to improve sample efficiency, and the policy training is accelerated by designing a curriculum adapted to the agent's capability. Second, OTU is proposed to map the acceleration commands to low-level throttle signals by estimating the thrust model during flight, which reduces the intermediate control variables and helps sim2real transfer. We use the prioritized experience replay mechanism that considers both policy update contribution and data acquisition time to adapt to the changing tasks. GTACL is trained entirely in simulation and can be transferred to other quadrotors with different dynamics. Furthermore, we achieve zero-shot transfer to the real-world quadrotor without fine-tuning. The average success rates of 98% and 87.8% in simulation and real-world experiments for different task conditions demonstrate the robustness of the proposed approach. Comparative results with traditional and related learning-based approaches show the advantages of GTACL in terms of learning efficiency, control performance, and generalization.
期刊介绍:
The IEEE Transactions on Intelligent Vehicles (T-IV) is a premier platform for publishing peer-reviewed articles that present innovative research concepts, application results, significant theoretical findings, and application case studies in the field of intelligent vehicles. With a particular emphasis on automated vehicles within roadway environments, T-IV aims to raise awareness of pressing research and application challenges.
Our focus is on providing critical information to the intelligent vehicle community, serving as a dissemination vehicle for IEEE ITS Society members and others interested in learning about the state-of-the-art developments and progress in research and applications related to intelligent vehicles. Join us in advancing knowledge and innovation in this dynamic field.