Mujoco: A physics engine for model-based control

Emanuel Todorov, Tom Erez, Yuval Tassa · 2012

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

TCE bridges domain gaps in offline RL by selectively using source data or generating target-aligned transitions via a dual score-based model, outperforming baselines in experiments.

Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Target-Aligned Bellman Backup (TABB) improves cross-domain offline RL by selecting source transitions according to their contribution to accurate target-domain Bellman target estimation.

Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

cs.LG · 2025-06-26 · unverdicted · novelty 5.0

SSE improves long-horizon goal-conditioned RL by using failure and partial-success transitions to identify unreliable subgoals, streamline high-level planning, and outperform prior hierarchical methods on benchmarks.

Coding Agent Is Good As World Simulator

cs.AI · 2026-05-14 · unverdicted · novelty 4.0

An agentic framework generates executable physics simulation code from text prompts via coordinated planning, coding, visual, and physics agents that iterate to satisfy both prompt fidelity and physical constraints.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning cs.LG · 2026-05-13 · unverdicted · none · ref 40
TCE bridges domain gaps in offline RL by selectively using source data or generating target-aligned transitions via a dual score-based model, outperforming baselines in experiments.
Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning cs.LG · 2026-05-21 · unverdicted · none · ref 31
Target-Aligned Bellman Backup (TABB) improves cross-domain offline RL by selecting source transitions according to their contribution to accurate target-domain Bellman target estimation.
Coding Agent Is Good As World Simulator cs.AI · 2026-05-14 · unverdicted · none · ref 17
An agentic framework generates executable physics simulation code from text prompts via coordinated planning, coding, visual, and physics agents that iterate to satisfy both prompt fidelity and physical constraints.

Mujoco: A physics engine for model-based control

fields

years

verdicts

representative citing papers

citing papers explorer