GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning

· 2026 · cs.RO · arXiv 2604.25459

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Embodied AI research is undergoing a shift toward vision-centric perceptual paradigms. While massively parallel simulators have catalyzed breakthroughs in proprioception-based locomotion, their potential remains largely untapped for vision-informed tasks due to the prohibitive computational overhead of large-scale photorealistic rendering. Furthermore, the creation of simulation-ready 3D assets heavily relies on labor-intensive manual modeling, while the significant sim-to-real physical gap hinders the transfer of contact-rich manipulation policies. To address these bottlenecks, we propose GS-Playground, a multi-modal simulation framework designed to accelerate end-to-end perceptual learning. We develop a novel high-performance parallel physics engine, specifically designed to integrate with a batch 3D Gaussian Splatting (3DGS) rendering pipeline to ensure high-fidelity synchronization. Our system achieves a breakthrough throughput of 10^4 FPS at 640x480 resolution, significantly lowering the barrier for large-scale visual RL. Additionally, we introduce an automated Real2Sim workflow that reconstructs photorealistic, physically consistent, and memory-efficient environments, streamlining the generation of complex simulation-ready scenes. Extensive experiments on locomotion, navigation, and manipulation demonstrate that GS-Playground effectively bridges the perceptual and physical gaps across diverse embodied tasks. Project homepage: https://gsplayground.github.io.

representative citing papers

LEGS: Fine-Tuning Teleop-Free VLAs for Humanoid Loco-manipulation in an Embodied Gaussian Splatting World

cs.RO · 2026-05-31 · unverdicted · novelty 6.0

LEGS shows synthetic data from a 3DGS-mesh hybrid simulator trains VLA policies for humanoid pick-and-place that match or exceed human teleoperation performance across multiple backbones and tasks while enabling low-cost robustness to appearance shifts.

MuJoCoUni:Persistent Batched Runtime Primitives for MuJoCo

cs.RO · 2026-05-24 · unverdicted · novelty 5.0

MuJoCoUni introduces BatchEnvPool, a C++/pybind11-based executor providing persistent batched stateful MuJoCo environments with domain randomization, sensor queries, and short stepping.

citing papers explorer

Showing 2 of 2 citing papers.

LEGS: Fine-Tuning Teleop-Free VLAs for Humanoid Loco-manipulation in an Embodied Gaussian Splatting World cs.RO · 2026-05-31 · unverdicted · none · ref 26 · internal anchor
LEGS shows synthetic data from a 3DGS-mesh hybrid simulator trains VLA policies for humanoid pick-and-place that match or exceed human teleoperation performance across multiple backbones and tasks while enabling low-cost robustness to appearance shifts.
MuJoCoUni:Persistent Batched Runtime Primitives for MuJoCo cs.RO · 2026-05-24 · unverdicted · none · ref 5 · internal anchor
MuJoCoUni introduces BatchEnvPool, a C++/pybind11-based executor providing persistent batched stateful MuJoCo environments with domain randomization, sensor queries, and short stepping.

GS-Playground: A High-Throughput Photorealistic Simulator for Vision-Informed Robot Learning

fields

years

verdicts

representative citing papers

citing papers explorer