Predictive World Models from Real-World Partial Observations

Robin Karlsson, Alexander Carballo, Keisuke Fujii, Kento Ohtani, K. Takeda
{"title":"Predictive World Models from Real-World Partial Observations","authors":"Robin Karlsson, Alexander Carballo, Keisuke Fujii, Kento Ohtani, K. Takeda","doi":"10.1109/MOST57249.2023.00024","DOIUrl":null,"url":null,"abstract":"Cognitive scientists believe adaptable intelligent agents like humans perform reasoning through learned causal mental simulations of agents and environments. The problem of learning such simulations is called predictive world modeling. Recently, reinforcement learning (RL) agents leveraging world models have achieved SOTA performance in game environments. However, understanding how to apply the world modeling approach in complex real-world environments relevant to mobile robots remains an open question. In this paper, we present a framework for learning a probabilistic predictive world model for real-world road environments. We implement the model using a hierarchical VAE (HVAE) capable of predicting a diverse set of fully observed plausible worlds from accumulated sensor observations. While prior HVAE methods require complete states as ground truth for learning, we present a novel sequential training method to allow HVAEs to learn to predict complete states from partially observed states only. We experimentally demonstrate accurate spatial structure prediction of deterministic regions achieving 96.21 IoU, and close the gap to perfect prediction by 62 % for stochastic regions using the best prediction. By extending HVAEs to cases where complete ground truth states do not exist, we facilitate continual learning of spatial prediction as a step towards realizing explainable and comprehensive predictive world models for real-world mobile robotics applications. Code is available at https://github.com/robin-karlsson0/predictive-world-models.","PeriodicalId":338621,"journal":{"name":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","volume":"91 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mobility, Operations, Services and Technologies (MOST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MOST57249.2023.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Cognitive scientists believe adaptable intelligent agents like humans perform reasoning through learned causal mental simulations of agents and environments. The problem of learning such simulations is called predictive world modeling. Recently, reinforcement learning (RL) agents leveraging world models have achieved SOTA performance in game environments. However, understanding how to apply the world modeling approach in complex real-world environments relevant to mobile robots remains an open question. In this paper, we present a framework for learning a probabilistic predictive world model for real-world road environments. We implement the model using a hierarchical VAE (HVAE) capable of predicting a diverse set of fully observed plausible worlds from accumulated sensor observations. While prior HVAE methods require complete states as ground truth for learning, we present a novel sequential training method to allow HVAEs to learn to predict complete states from partially observed states only. We experimentally demonstrate accurate spatial structure prediction of deterministic regions achieving 96.21 IoU, and close the gap to perfect prediction by 62 % for stochastic regions using the best prediction. By extending HVAEs to cases where complete ground truth states do not exist, we facilitate continual learning of spatial prediction as a step towards realizing explainable and comprehensive predictive world models for real-world mobile robotics applications. Code is available at https://github.com/robin-karlsson0/predictive-world-models.
来自真实世界部分观测的预测世界模型
认知科学家认为,像人类这样适应性强的智能主体通过对主体和环境的因果心理模拟来进行推理。学习这种模拟的问题被称为预测世界建模。最近,利用世界模型的强化学习(RL)智能体在游戏环境中实现了SOTA性能。然而,了解如何将世界建模方法应用于与移动机器人相关的复杂现实环境中仍然是一个悬而未决的问题。在本文中,我们提出了一个用于学习现实世界道路环境的概率预测世界模型的框架。我们使用分层VAE (HVAE)来实现该模型,该模型能够从累积的传感器观测中预测各种完全观察到的可信世界。虽然以前的HVAE方法需要完整状态作为学习的基础真理,但我们提出了一种新的顺序训练方法,允许HVAE仅从部分观察状态中学习预测完整状态。实验证明,确定性区域的空间结构预测精度达到96.21 IoU,而随机区域的最佳预测精度与完美预测的差距缩小了62%。通过将HVAEs扩展到不存在完整基本真态的情况,我们促进了空间预测的持续学习,作为实现现实世界移动机器人应用的可解释和全面预测世界模型的一步。代码可从https://github.com/robin-karlsson0/predictive-world-models获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信