Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration

· 2026 · cs.LG · arXiv 2605.22814

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Exploration is a prerequisite for learning useful behaviors in sparse-reward, long-horizon tasks, particularly within 3D environments. Curiosity-driven reinforcement learning addresses this via intrinsic rewards derived from the mismatch between the agent's predictive model of the world and reality. However, translating this intrinsic motivation to complex, photorealistic environments remains difficult, as agents can become trapped in local loops and receive fresh rewards for revisiting forgotten states. In this work, we demonstrate that this failure stems from a lack of spatial persistence and episodic context. We show that effective curiosity requires a model of the world that is persistent and continuously updated, paired with an agent that maintains an episodic trajectory history to navigate toward novel regions. We achieve this using an online 3D reconstruction as a persistent model of the world, while the agent policy is parameterized as a sequence model over RGB observations to maintain episodic context. This design enables effective exploration during training while allowing the agent to navigate using solely RGB frames at deployment. Trained purely via curiosity on HM3D, our agent outperforms RL-based active mapping baselines and generalizes zero-shot to Gibson and AI-generated worlds. Our end-to-end policy enables efficient adaptation to downstream tasks, such as apple picking and image-goal navigation, outperforming from-scratch baselines. Please see video results at https://recuriosity.github.io/.

representative citing papers

Joint Agent Memory and Exploration Learning via Novelty Signals

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

JAMEL jointly learns agent memory and exploration via novelty-driven interaction, generalizing to unseen environments while outperforming open baselines and reducing token use.

citing papers explorer

Showing 1 of 1 citing paper.

Joint Agent Memory and Exploration Learning via Novelty Signals cs.AI · 2026-06-01 · unverdicted · none · ref 47 · internal anchor
JAMEL jointly learns agent memory and exploration via novelty-driven interaction, generalizing to unseen environments while outperforming open baselines and reducing token use.

Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration

fields

years

verdicts

representative citing papers

citing papers explorer