{"title":"潜在技能规划的分层蒙特卡罗树搜索","authors":"Yue Pei","doi":"10.1145/3590003.3590005","DOIUrl":null,"url":null,"abstract":"Monte Carlo Tree Search (MCTS) continues to confront the issue of exponential complexity growth in certain tasks when the planning horizon is excessively long, causing the trajectory’s past to grow exponentially. Our study presents Hierarchical MCTS Latent Skill Planner, an algorithm based on skill discovery that automatically identifies skills based on intrinsic rewards and integrates them with MCTS, enabling efficient decision-making at a higher level. In the grid world maze domain, we found that latent skill search outperformed the standard MCTS approach that do not contain skills in terms of efficiency and performance.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical Monte Carlo Tree Search for Latent Skill Planning\",\"authors\":\"Yue Pei\",\"doi\":\"10.1145/3590003.3590005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Monte Carlo Tree Search (MCTS) continues to confront the issue of exponential complexity growth in certain tasks when the planning horizon is excessively long, causing the trajectory’s past to grow exponentially. Our study presents Hierarchical MCTS Latent Skill Planner, an algorithm based on skill discovery that automatically identifies skills based on intrinsic rewards and integrates them with MCTS, enabling efficient decision-making at a higher level. In the grid world maze domain, we found that latent skill search outperformed the standard MCTS approach that do not contain skills in terms of efficiency and performance.\",\"PeriodicalId\":340225,\"journal\":{\"name\":\"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3590003.3590005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3590003.3590005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hierarchical Monte Carlo Tree Search for Latent Skill Planning
Monte Carlo Tree Search (MCTS) continues to confront the issue of exponential complexity growth in certain tasks when the planning horizon is excessively long, causing the trajectory’s past to grow exponentially. Our study presents Hierarchical MCTS Latent Skill Planner, an algorithm based on skill discovery that automatically identifies skills based on intrinsic rewards and integrates them with MCTS, enabling efficient decision-making at a higher level. In the grid world maze domain, we found that latent skill search outperformed the standard MCTS approach that do not contain skills in terms of efficiency and performance.