A. Rutherford, Paul Duckworth, N. Hawes, Bruno Lacerda
{"title":"不确定环境下快速探索随机马尔可夫决策过程的运动规划","authors":"A. Rutherford, Paul Duckworth, N. Hawes, Bruno Lacerda","doi":"10.1109/ecmr50962.2021.9568849","DOIUrl":null,"url":null,"abstract":"We propose rapidly-exploring random Markov decision processes (RRMDPs), a novel sampling-based motion planning approach for situations where the environment parameters are not fully known a priori, but a prior distribution over such parameters is available. Our algorithm combines ideas from established motion planning algorithms to achieve motion policies that are able to robustly drive the robot to its goal in the presence of uncertain action outcomes. We evaluate RRMDP in two domains, showing that it can synthesise motion policies that are more robust than the motion plans obtained by particle rapidly-exploring random trees (pRRT), a widely used algorithm for motion planning under uncertainty which RRMDP builds upon.","PeriodicalId":200521,"journal":{"name":"2021 European Conference on Mobile Robots (ECMR)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Motion Planning in Uncertain Environments with Rapidly-Exploring Random Markov Decision Processes\",\"authors\":\"A. Rutherford, Paul Duckworth, N. Hawes, Bruno Lacerda\",\"doi\":\"10.1109/ecmr50962.2021.9568849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose rapidly-exploring random Markov decision processes (RRMDPs), a novel sampling-based motion planning approach for situations where the environment parameters are not fully known a priori, but a prior distribution over such parameters is available. Our algorithm combines ideas from established motion planning algorithms to achieve motion policies that are able to robustly drive the robot to its goal in the presence of uncertain action outcomes. We evaluate RRMDP in two domains, showing that it can synthesise motion policies that are more robust than the motion plans obtained by particle rapidly-exploring random trees (pRRT), a widely used algorithm for motion planning under uncertainty which RRMDP builds upon.\",\"PeriodicalId\":200521,\"journal\":{\"name\":\"2021 European Conference on Mobile Robots (ECMR)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 European Conference on Mobile Robots (ECMR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ecmr50962.2021.9568849\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 European Conference on Mobile Robots (ECMR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ecmr50962.2021.9568849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Motion Planning in Uncertain Environments with Rapidly-Exploring Random Markov Decision Processes
We propose rapidly-exploring random Markov decision processes (RRMDPs), a novel sampling-based motion planning approach for situations where the environment parameters are not fully known a priori, but a prior distribution over such parameters is available. Our algorithm combines ideas from established motion planning algorithms to achieve motion policies that are able to robustly drive the robot to its goal in the presence of uncertain action outcomes. We evaluate RRMDP in two domains, showing that it can synthesise motion policies that are more robust than the motion plans obtained by particle rapidly-exploring random trees (pRRT), a widely used algorithm for motion planning under uncertainty which RRMDP builds upon.