Gradient-Free De Novo Learning.

IF 2 3区物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY

Entropy Pub Date : 2025-09-22 DOI:10.3390/e27090992

Karl Friston, Thomas Parr, Conor Heins, Lancelot Da Costa, Tommaso Salvatori, Alexander Tschantz, Magnus Koudahl, Toon Van de Maele, Christopher Buckley, Tim Verbelen

引用次数: 0

Abstract

This technical note applies active inference to the problem of learning goal-directed behaviour from scratch, namely, de novo learning. By de novo learning, we mean discovering, directly from observations, the structure and parameters of a discrete generative model for sequential policy optimisation. Concretely, our procedure grows and then reduces a model until it discovers a pullback attractor over (generalised) states; this attracting set supplies paths of least action among goal states while avoiding costly states. The implicit efficiency rests upon reframing the learning problem through the lens of the free energy principle, under which it is sufficient to learn a generative model whose dynamics feature such an attracting set. For context, we briefly relate this perspective to value-based formulations (e.g., Bellman optimality) and then apply the active inference formulation to a small arcade game to illustrate de novo structure learning and ensuing agency.

查看原文本刊更多论文

无梯度从头学习。

本技术说明将主动推理应用于从零开始学习目标导向行为的问题，即从头学习。通过从头学习，我们的意思是直接从观察中发现用于顺序策略优化的离散生成模型的结构和参数。具体地说，我们的过程不断增长，然后简化模型，直到发现（广义）状态上的回拉吸引子；这个吸引集提供了目标状态之间行动最少的路径，同时避免了代价高昂的状态。隐式效率依赖于通过自由能原理来重构学习问题，在自由能原理下，学习一个具有这样一个吸引集的动力学特征的生成模型就足够了。作为上下文，我们简要地将这一观点与基于价值的公式（例如，Bellman最优性）联系起来，然后将主动推理公式应用于一个小型街机游戏，以说明从头开始的结构学习和随后的代理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Entropy PHYSICS, MULTIDISCIPLINARY-

CiteScore

4.90

自引率

11.10%

发文量

1580

审稿时长

21.05 days

期刊介绍： Entropy (ISSN 1099-4300), an international and interdisciplinary journal of entropy and information studies, publishes reviews, regular research papers and short notes. Our aim is to encourage scientists to publish as much as possible their theoretical and experimental details. There is no restriction on the length of the papers. If there are computation and the experiment, the details must be provided so that the results can be reproduced.