{"title":"Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL","authors":"Jihwan Lee, Woochang Sim, Sejin Kim, Sundong Kim","doi":"arxiv-2408.14855","DOIUrl":null,"url":null,"abstract":"This paper demonstrates that model-based reinforcement learning (model-based\nRL) is a suitable approach for the task of analogical reasoning. We hypothesize\nthat model-based RL can solve analogical reasoning tasks more efficiently\nthrough the creation of internal models. To test this, we compared DreamerV3, a\nmodel-based RL method, with Proximal Policy Optimization, a model-free RL\nmethod, on the Abstraction and Reasoning Corpus (ARC) tasks. Our results\nindicate that model-based RL not only outperforms model-free RL in learning and\ngeneralizing from single tasks but also shows significant advantages in\nreasoning across similar tasks.","PeriodicalId":501208,"journal":{"name":"arXiv - CS - Logic in Computer Science","volume":"28 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Logic in Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.14855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper demonstrates that model-based reinforcement learning (model-based
RL) is a suitable approach for the task of analogical reasoning. We hypothesize
that model-based RL can solve analogical reasoning tasks more efficiently
through the creation of internal models. To test this, we compared DreamerV3, a
model-based RL method, with Proximal Policy Optimization, a model-free RL
method, on the Abstraction and Reasoning Corpus (ARC) tasks. Our results
indicate that model-based RL not only outperforms model-free RL in learning and
generalizing from single tasks but also shows significant advantages in
reasoning across similar tasks.