International Conference on Learning Representations , year=

Mastering Atari with Discrete World Models , author=

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

representative citing papers

Counterfactual identifiability beyond global monotonicity: non-monotone triangular structural causal models

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

Non-monotone triangular SCMs with mechanism-wise invertibility and context-independent inverse transport are equivalent to exogenous isomorphism and achieve complete counterfactual identifiability, with supporting experiments on synthetic data and MuJoCo tasks.

TD-MPC2: Scalable, Robust World Models for Continuous Control

cs.LG · 2023-10-25 · conditional · novelty 6.0

TD-MPC2 scales an implicit world-model RL method to a 317M-parameter agent that masters 80 tasks across four domains with a single hyperparameter configuration.

Is Conditional Generative Modeling all you need for Decision-Making?

cs.LG · 2022-11-28 · unverdicted · novelty 6.0

Return-conditional diffusion models for policies outperform offline RL on benchmarks by circumventing dynamic programming and enable constraint or skill composition.

PROWL: Prioritized Regret-Driven Optimization for World Model Learning

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

PROWL introduces a KL-constrained adversarial curriculum and prioritized adversarial trajectory buffer to actively discover and correct rare failure modes in action-conditioned video world models.

LASER: Learning Active Sensing for Continuum Field Reconstruction

cs.LG · 2026-04-21

citing papers explorer

Showing 5 of 5 citing papers.

Counterfactual identifiability beyond global monotonicity: non-monotone triangular structural causal models cs.LG · 2026-05-06 · unverdicted · none · ref 50
Non-monotone triangular SCMs with mechanism-wise invertibility and context-independent inverse transport are equivalent to exogenous isomorphism and achieve complete counterfactual identifiability, with supporting experiments on synthetic data and MuJoCo tasks.
TD-MPC2: Scalable, Robust World Models for Continuous Control cs.LG · 2023-10-25 · conditional · none · ref 125
TD-MPC2 scales an implicit world-model RL method to a 317M-parameter agent that masters 80 tasks across four domains with a single hyperparameter configuration.
Is Conditional Generative Modeling all you need for Decision-Making? cs.LG · 2022-11-28 · unverdicted · none · ref 294
Return-conditional diffusion models for policies outperform offline RL on benchmarks by circumventing dynamic programming and enable constraint or skill composition.
PROWL: Prioritized Regret-Driven Optimization for World Model Learning cs.LG · 2026-05-11 · unverdicted · none · ref 4
PROWL introduces a KL-constrained adversarial curriculum and prioritized adversarial trajectory buffer to actively discover and correct rare failure modes in action-conditioned video world models.
LASER: Learning Active Sensing for Continuum Field Reconstruction cs.LG · 2026-04-21 · unreviewed · ref 26

International Conference on Learning Representations , year=

fields

years

verdicts

representative citing papers

citing papers explorer