Mujoco: A physics engine for model-based control

Emanuel Todorov, Tom Erez, Yuval Tassa · 2012

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

TCE bridges domain gaps in offline RL by selectively using source data or generating target-aligned transitions via a dual score-based model, outperforming baselines in experiments.

Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Target-Aligned Bellman Backup (TABB) improves cross-domain offline RL by selecting source transitions according to their contribution to accurate target-domain Bellman target estimation.

Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

cs.LG · 2025-06-26 · unverdicted · novelty 5.0

SSE improves long-horizon goal-conditioned RL by using failure and partial-success transitions to identify unreliable subgoals, streamline high-level planning, and outperform prior hierarchical methods on benchmarks.

Coding Agent Is Good As World Simulator

cs.AI · 2026-05-14

citing papers explorer

Showing 4 of 4 citing papers.

Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning cs.LG · 2026-05-13 · unverdicted · none · ref 40
TCE bridges domain gaps in offline RL by selectively using source data or generating target-aligned transitions via a dual score-based model, outperforming baselines in experiments.
Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning cs.LG · 2026-05-21 · unverdicted · none · ref 31
Target-Aligned Bellman Backup (TABB) improves cross-domain offline RL by selecting source transitions according to their contribution to accurate target-domain Bellman target estimation.
Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning cs.LG · 2025-06-26 · unverdicted · none · ref 43
SSE improves long-horizon goal-conditioned RL by using failure and partial-success transitions to identify unreliable subgoals, streamline high-level planning, and outperform prior hierarchical methods on benchmarks.
Coding Agent Is Good As World Simulator cs.AI · 2026-05-14 · unreviewed · ref 17

Mujoco: A physics engine for model-based control

fields

years

verdicts

representative citing papers

citing papers explorer