Kota Kondo, Andrea Tagliabue, Xiaoyi Cai, Claudius Tewari, Olivia Garcia, Marcos Espitia-Alvarez, Jonathan P. How
{"title":"CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning","authors":"Kota Kondo, Andrea Tagliabue, Xiaoyi Cai, Claudius Tewari, Olivia Garcia, Marcos Espitia-Alvarez, Jonathan P. How","doi":"arxiv-2405.01758","DOIUrl":null,"url":null,"abstract":"Traditional optimization-based planners, while effective, suffer from high\ncomputational costs, resulting in slow trajectory generation. A successful\nstrategy to reduce computation time involves using Imitation Learning (IL) to\ndevelop fast neural network (NN) policies from those planners, which are\ntreated as expert demonstrators. Although the resulting NN policies are\neffective at quickly generating trajectories similar to those from the expert,\n(1) their output does not explicitly account for dynamic feasibility, and (2)\nthe policies do not accommodate changes in the constraints different from those\nused during training. To overcome these limitations, we propose Constraint-Guided Diffusion (CGD),\na novel IL-based approach to trajectory planning. CGD leverages a hybrid\nlearning/online optimization scheme that combines diffusion policies with a\nsurrogate efficient optimization problem, enabling the generation of\ncollision-free, dynamically feasible trajectories. The key ideas of CGD include\ndividing the original challenging optimization problem solved by the expert\ninto two more manageable sub-problems: (a) efficiently finding collision-free\npaths, and (b) determining a dynamically-feasible time-parametrization for\nthose paths to obtain a trajectory. Compared to conventional neural network\narchitectures, we demonstrate through numerical evaluations significant\nimprovements in performance and dynamic feasibility under scenarios with new\nconstraints never encountered during training.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"35 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.01758","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional optimization-based planners, while effective, suffer from high
computational costs, resulting in slow trajectory generation. A successful
strategy to reduce computation time involves using Imitation Learning (IL) to
develop fast neural network (NN) policies from those planners, which are
treated as expert demonstrators. Although the resulting NN policies are
effective at quickly generating trajectories similar to those from the expert,
(1) their output does not explicitly account for dynamic feasibility, and (2)
the policies do not accommodate changes in the constraints different from those
used during training. To overcome these limitations, we propose Constraint-Guided Diffusion (CGD),
a novel IL-based approach to trajectory planning. CGD leverages a hybrid
learning/online optimization scheme that combines diffusion policies with a
surrogate efficient optimization problem, enabling the generation of
collision-free, dynamically feasible trajectories. The key ideas of CGD include
dividing the original challenging optimization problem solved by the expert
into two more manageable sub-problems: (a) efficiently finding collision-free
paths, and (b) determining a dynamically-feasible time-parametrization for
those paths to obtain a trajectory. Compared to conventional neural network
architectures, we demonstrate through numerical evaluations significant
improvements in performance and dynamic feasibility under scenarios with new
constraints never encountered during training.