{"title":"Learning the search heuristic for combined task and motion planning","authors":"Vektor Dewanto","doi":"10.1109/ICACSIS.2015.7415160","DOIUrl":null,"url":null,"abstract":"Autonomous robots have to plan two intricately dependent levels: task and motion. One promising approach is to plan task and motion simultaneously, yielding a sequence of high level actions that is guaranteed to have valid motion plans. In this paper, we present our work on such planning system whose backbone is the ability to estimate the cost of action sequences. This cost essentially encodes information about motion feasibility and optimality criteria. Concretely, the cost prediction serves as the heuristic for search over a task motion multigraph. The experiment results show that the proposed approach makes the planning progressively more efficient as well as ε-optimal. It means that the wasted computations are more and more reduced over planning attempts and that the complete plans found are guaranteed to have costs no more than a factor of (1 +ε) greater than the optimal. This suggests that the heuristic along with its learning formulation are justifiable and that the designed feature vector is sufficient for learning. In addition, we found that online learning during search offers better utility than the offline.","PeriodicalId":325539,"journal":{"name":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","volume":"298 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS.2015.7415160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous robots have to plan two intricately dependent levels: task and motion. One promising approach is to plan task and motion simultaneously, yielding a sequence of high level actions that is guaranteed to have valid motion plans. In this paper, we present our work on such planning system whose backbone is the ability to estimate the cost of action sequences. This cost essentially encodes information about motion feasibility and optimality criteria. Concretely, the cost prediction serves as the heuristic for search over a task motion multigraph. The experiment results show that the proposed approach makes the planning progressively more efficient as well as ε-optimal. It means that the wasted computations are more and more reduced over planning attempts and that the complete plans found are guaranteed to have costs no more than a factor of (1 +ε) greater than the optimal. This suggests that the heuristic along with its learning formulation are justifiable and that the designed feature vector is sufficient for learning. In addition, we found that online learning during search offers better utility than the offline.