Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

Aaron Courville; Ali Saheb Pasand; Johan Obando-Ceron; Pablo Samuel Castro; Pouya Bashivan

arxiv: 2602.19373 · v3 · pith:NPUXPVLGnew · submitted 2026-02-22 · 💻 cs.LG · cs.AI

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

Ali Saheb Pasand , Johan Obando-Ceron , Aaron Courville , Pouya Bashivan , Pablo Samuel Castro This is my paper

classification 💻 cs.LG cs.AI

keywords gaussianisotropiclearningstabletrainingunderdeepnon-stationarity

0 comments

read the original abstract

Deep reinforcement learning systems often suffer from unstable training dynamics due to non-stationarity, where learning objectives and data distributions evolve over time. We show that under non-stationary targets, isotropic Gaussian embeddings are provably advantageous. In particular, they induce stable tracking of time-varying targets for linear readouts, achieve maximal entropy under a fixed variance budget, and encourage a balanced use of all representational dimensions--all of which enable agents to be more adaptive and stable. Building on this insight, we propose the use of Sketched Isotropic Gaussian Regularization for shaping representations toward an isotropic Gaussian distribution during training. We demonstrate empirically, over a variety of domains, that this simple and computationally inexpensive method improves performance under non-stationarity while reducing representation collapse, neuron dormancy, and training instability.

This paper has not been read by Pith yet.

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

discussion (0)