arXiv preprint arXiv:2012.03548 , year=

Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch · 2012 · arXiv 2012.03548

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Decision Transformer: Reinforcement Learning via Sequence Modeling

cs.LG · 2021-06-02 · accept · novelty 8.0

Decision Transformer casts RL as autoregressive sequence modeling conditioned on desired returns, past states and actions, matching or exceeding offline RL baselines on Atari, Gym and Key-to-Door tasks.

Learning to Theorize the World from Observation

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

NEO induces compositional latent programs as world theories from observations and executes them to enable explanation-driven generalization.

citing papers explorer

Showing 2 of 2 citing papers.

Decision Transformer: Reinforcement Learning via Sequence Modeling cs.LG · 2021-06-02 · accept · none · ref 39
Decision Transformer casts RL as autoregressive sequence modeling conditioned on desired returns, past states and actions, matching or exceeding offline RL baselines on Atari, Gym and Key-to-Door tasks.
Learning to Theorize the World from Observation cs.LG · 2026-05-05 · unverdicted · none · ref 79
NEO induces compositional latent programs as world theories from observations and executes them to enable explanation-driven generalization.

arXiv preprint arXiv:2012.03548 , year=

fields

years

verdicts

representative citing papers

citing papers explorer