pith. sign in

Neural policy com- position from free energy minimization

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

The ability to flexibly compose previously acquired skills to execute intelligent behaviors is a hallmark of natural intelligence. Such compositional flexibility is often attributed to context-dependent gating mechanisms that determine how multiple policies or behavioral primitives are combined. Yet, despite remarkable efforts, the normative objective from which such gating rules should arise, and the neural computations capable of implementing them, remain unclear. Existing approaches typically rely on prespecified design choices for the gating rules, and remain tied to specific architectures, learning paradigms, or datasets. Here, we introduce a normative framework in which policy composition emerges from the minimization of a variational free energy, providing a principled and broadly applicable objective for gating. Based on this framework, we derive a continuous-time gradient flow whose trajectories are guaranteed to converge, with explicit rate, to the optimal composition of primitives. We further show that this dynamics admits a mechanistic neural implementation as a soft-competitive recurrent circuit with context-sensitive local interactions. We evaluate the model on emerging flocking behaviors in multi-agent systems, human decision-making in bandit tasks, and control benchmarks in layered architectures. Across these settings, the model provides interpretable mechanistic accounts of policy composition, reproduces key behavioral signatures, yields insights into data, and matches or outperforms established models.

citation-role summary

background 1

citation-polarity summary

years

2026 2

verdicts

UNVERDICTED 2

roles

background 1

polarities

background 1

representative citing papers

Predictive Coding with Bayesian Priors via Proximal Gradients

eess.SY · 2026-06-06 · unverdicted · novelty 7.0

Predictive coding equals proximal gradient descent on MAP problems, with priors setting nonlinearities via proximal operators and yielding leaky firing-rate networks plus hierarchical MRFs.

citing papers explorer

Showing 2 of 2 citing papers.