Derives a log-determinant CIG reward from ensemble disagreement kernel with Cholesky factorization to produce causal per-step exploration signals that scale to high-dimensional spaces.
What is Intrinsic Motivation? A Typol- ogy of Computational Approaches
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
QDTraj uses Quality-Diversity algorithms with sparse rewards to produce at least five times more diverse high-performing trajectories for articulated object manipulation than compared methods, validated across 30 objects with hundreds of trajectories per task.
citing papers explorer
-
CIG: Exploration via Conditional Information Gain
Derives a log-determinant CIG reward from ensemble disagreement kernel with Cholesky factorization to produce causal per-step exploration signals that scale to high-dimensional spaces.
-
QDTraj: Exploration of Diverse Trajectory Primitives for Articulated Objects Robotic Manipulation
QDTraj uses Quality-Diversity algorithms with sparse rewards to produce at least five times more diverse high-performing trajectories for articulated object manipulation than compared methods, validated across 30 objects with hundreds of trajectories per task.