ORRB -- OpenAI Remote Rendering Backend

Lilian Weng; Maciek Chociej; Peter Welinder

arxiv: 1906.11633 · v1 · pith:FCNC4DWVnew · submitted 2019-06-26 · 💻 cs.GR · cs.LG· stat.ML

ORRB -- OpenAI Remote Rendering Backend

Maciek Chociej , Peter Welinder , Lilian Weng This is my paper

Pith reviewed 2026-05-25 14:56 UTC · model grok-4.3

classification 💻 cs.GR cs.LGstat.ML

keywords remote renderingrobotics environmentsvisual domain randomizationUnity3dMuJoCocloud deploymentsimulation backendhigh throughput rendering

0 comments

The pith

ORRB connects Unity3d to MuJoCo to deliver fast customizable rendering for robotics environments.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces ORRB as a dedicated rendering backend for robotics simulations. It links the Unity3d game engine to the MuJoCo physics library to produce visuals on demand. The architecture prioritizes support for visual domain randomization while running at high speed when deployed in the cloud. Releasing the code publicly under an MIT license makes the tool available for building varied training setups.

Core claim

We present the OpenAI Remote Rendering Backend (ORRB), a system that allows fast and customizable rendering of robotics environments. It is based on the Unity3d game engine and interfaces with the MuJoCo physics simulation library. ORRB was designed with visual domain randomization in mind. It is optimized for cloud deployment and high throughput operation. We are releasing it to the public under a liberal MIT license.

What carries the argument

The ORRB interface that links Unity3d rendering to MuJoCo physics simulation to support high-throughput output with arbitrary visual randomization.

If this is right

High-throughput rendering enables larger batches of randomized visuals during policy training.
Cloud-optimized design reduces the time needed to generate diverse simulation data.
Support for arbitrary visual parameters allows direct control over appearance variation without physics changes.
Public release permits direct use and modification of the backend in other robotics projects.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same bridging pattern could be tested with other game engines or physics libraries to compare throughput.
Widespread adoption might standardize how visual randomization is injected into physics-based training loops.
Long-term use could reveal whether the added rendering layer affects overall simulation stability in extended runs.

Load-bearing premise

An interface between Unity3d and MuJoCo can be maintained at high throughput with low latency while supporting arbitrary visual randomization parameters without introducing simulation artifacts or synchronization errors.

What would settle it

A benchmark run that measures rendering latency and visual fidelity when applying multiple randomization parameters at peak cloud throughput and finds either latency spikes or visible artifacts.

Figures

Figures reproduced from arXiv: 1906.11633 by Lilian Weng, Maciek Chociej, Peter Welinder.

**Figure 2.** Figure 2: System overview. diligent care into making sure that the randomizers are executed in the same order, and that the flow of the randomized code is exactly the same – leading to reproducible, identical randomization results. That said, in our current model, with OpenGL rendering and a number of GPU based rendering techniques, some low level pixel discrepancy of the final images is unavoidable. We release ORRB… view at source ↗

read the original abstract

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

ORRB is a released Unity-MuJoCo rendering backend for robotics domain randomization, but the paper is a short descriptive announcement with no benchmarks or claims to evaluate.

read the letter

The main point is that OpenAI has put out ORRB, a rendering system that pairs Unity3D with MuJoCo for fast, customizable visuals in robotics simulations, built around visual domain randomization and cloud scaling, and released under MIT on GitHub. The new element is the public availability of this particular interface and setup, which removes the need for others to wire the two engines together themselves for similar use cases. The description is clear about the design goals and intended workflow, which is useful for anyone already working in sim-to-real pipelines. The paper stays at the level of architecture and release notes, with no equations, derivations, or fitted results. The main limitation is the absence of any timing data, latency figures, throughput measurements, or checks for synchronization issues, so there is no evidence presented on whether the system actually delivers on the speed and reliability claims. This is aimed at practitioners who might download the code and test it in their own environments rather than readers seeking new methods or validated performance. It does not rise to the level of needing peer review, since the contribution is the released software itself and not a testable scientific argument. Skip formal refereeing and just flag the GitHub link for anyone who needs a ready rendering backend.

Referee Report

0 major / 2 minor

Summary. The manuscript presents the OpenAI Remote Rendering Backend (ORRB), a Unity3d-based system interfaced with MuJoCo for fast, customizable rendering of robotics environments. It emphasizes support for visual domain randomization, cloud deployment, high throughput, and public release under the MIT license at the provided GitHub repository.

Significance. The public release of an open-source rendering backend tailored to robotics simulation and domain randomization could provide a practical tool for the computer graphics and robotics communities, potentially supporting reproducibility and experimentation in sim-to-real transfer tasks. Credit is due for the liberal licensing and explicit focus on cloud-optimized throughput.

minor comments (2)

[Abstract] The manuscript is extremely brief (essentially the abstract plus release notice). Consider adding a high-level architecture diagram or pseudocode snippet illustrating the Unity-MuJoCo interface and randomization pipeline.
No usage examples, API overview, or configuration parameters are described; a short 'Getting Started' paragraph would improve accessibility for potential users.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive assessment of the ORRB manuscript, including recognition of its practical utility for the robotics and graphics communities, support for reproducibility, and the liberal MIT licensing. The recommendation for minor revision is noted; however, the report contains no specific major comments requiring response.

Circularity Check

0 steps flagged

No significant circularity: systems description with no derivations or fitted quantities

full rationale

The document is a short software release announcement describing the ORRB system (Unity3d-based renderer interfaced to MuJoCo for visual domain randomization and cloud deployment). It contains no equations, no predictions, no fitted parameters, no uniqueness theorems, and no self-citations that bear load on any claim. The central claim is simply the existence and public availability of the released code; all performance or interface properties are implementation details of the released artifact rather than derived results. No load-bearing step reduces to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a systems and software release paper; it introduces no mathematical free parameters, axioms, or new physical entities.

pith-pipeline@v0.9.0 · 5596 in / 1013 out tokens · 20120 ms · 2026-05-25T14:56:18.955822+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Solving Rubik's Cube with a Robot Hand
cs.LG 2019-10 accept novelty 7.0

Reinforcement learning models trained only in simulation using automatic domain randomization solve Rubik's cube with a real robot hand.

Reference graph

Works this paper leans on

17 extracted references · 17 canonical work pages · cited by 1 Pith paper · 7 internal anchors

[1]

D. Abel, A. Agarwal, F. Diaz, A. Krishnamurthy, and R. E. Schapire. Exploratory gradient boosting for reinforcement learning in complex domains. CoRR, abs/1603.04119, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[2]

Unreal game engine

Epic Games. Unreal game engine. Available at: https://www.unrealengine.com/

work page
[3]

Google. Grpc. Available at: https://grpc.io/

work page
[4]

Jaderberg, W

M. Jaderberg, W. M. Czarnecki, I. Dunning, L. Marris, G. Lever, A. G. Castañeda, C. Beattie, N. C. Rabinowitz, A. S. Morcos, A. Ruderman, N. Sonnerat, T. Green, L. Deason, J. Z. Leibo, D. Silver, D. Hassabis, K. Kavukcuoglu, and T. Graepel. Human-level performance in 3d multiplayer games with population-based reinforcement learning. Science, 364(6443):859...

work page 2019
[5]

Juliani, V .-P

A. Juliani, V .-P. Berges, E. Vckay, Y . Gao, H. Henry, M. Mattar, and D. Lange. Unity: A general platform for intelligent agents. CoRR, abs/1809.02627, 2018

work page arXiv 2018
[6]

Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

A. Juliani, A. Khalifa, V . Berges, J. Harper, H. Henry, A. Crespi, J. Togelius, and D. Lange. Obstacle tower: A generalization challenge in vision, control, and planning. CoRR, abs/1902.01378, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1902
[7]

ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning

M. Kempka, M. Wydmuch, G. Runc, J. Toczek, and W. Jaskowski. Vizdoom: A doom-based AI research platform for visual reinforcement learning. CoRR, abs/1605.02097, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[8]

V . Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. A. Riedmiller. Playing atari with deep reinforcement learning. CoRR, abs/1312.5602, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[9]

Openai Five, 2019

OpenAI. Openai Five, 2019. Available at: https://openai.com/five/

work page 2019
[10]

Learning Dexterous In-Hand Manipulation

OpenAI, M. Andrychowicz, B. Baker, M. Chociej, R. Józefowicz, B. McGrew, J. W. Pachocki, J. Pachocki, A. Petron, M. Plappert, G. Powell, A. Ray, J. Schneider, S. Sidor, J. Tobin, P. Welinder, L. Weng, and W. Zaremba. Learning dexterous in-hand manipulation. CoRR, abs/1808.00177, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[11]

Sadeghi and S

F. Sadeghi and S. Levine. CAD2RL: real single-image ﬂight without a single real image. In Robotics: Science and Systems XIII, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA, July 12-16, 2017, 2017

work page 2017
[12]

S. Shah, D. Dey, C. Lovett, and A. Kapoor. Airsim: High-ﬁdelity visual and physical simulation for autonomous vehicles. CoRR, abs/1705.05065, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[13]

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

J. Tobin, R. Fong, A. Ray, J. Schneider, W. Zaremba, and P. Abbeel. Domain randomization for transferring deep neural networks from simulation to the real world. arXiv preprint arXiv:1703.06907, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[14]

Todorov, T

E. Todorov, T. Erez, and Y . Tassa. Mujoco: A physics engine for model-based control. In Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on , pages 5026–5033. IEEE, 2012

work page 2012
[15]

Simviz, 2018

Unity Technologies. Simviz, 2018. Available at: https://blogs.unity3d.com/tag/simviz/

work page 2018
[16]

Unity 3d game engine, 2019

Unity Technologies. Unity 3d game engine, 2019. Available at: https://unity.com

work page 2019
[17]

Vinyals, I

O. Vinyals, I. Babuschkin, J. Chung, M. Mathieu, M. Jaderberg, W. M. Czarnecki, A. Dudzik, A. Huang, P. Georgiev, R. Powell, T. Ewalds, D. Horgan, M. Kroiss, I. Danihelka, J. Agapiou, J. Oh, V . Dalibard, D. Choi, L. Sifre, Y . Sulsky, S. Vezhnevets, J. Molloy, T. Cai, D. Budden, T. Paine, C. Gulcehre, Z. Wang, T. Pfaff, T. Pohlen, Y . Wu, D. Yogatama, J....

work page 2019

[1] [1]

D. Abel, A. Agarwal, F. Diaz, A. Krishnamurthy, and R. E. Schapire. Exploratory gradient boosting for reinforcement learning in complex domains. CoRR, abs/1603.04119, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[2] [2]

Unreal game engine

Epic Games. Unreal game engine. Available at: https://www.unrealengine.com/

work page

[3] [3]

Google. Grpc. Available at: https://grpc.io/

work page

[4] [4]

Jaderberg, W

M. Jaderberg, W. M. Czarnecki, I. Dunning, L. Marris, G. Lever, A. G. Castañeda, C. Beattie, N. C. Rabinowitz, A. S. Morcos, A. Ruderman, N. Sonnerat, T. Green, L. Deason, J. Z. Leibo, D. Silver, D. Hassabis, K. Kavukcuoglu, and T. Graepel. Human-level performance in 3d multiplayer games with population-based reinforcement learning. Science, 364(6443):859...

work page 2019

[5] [5]

Juliani, V .-P

A. Juliani, V .-P. Berges, E. Vckay, Y . Gao, H. Henry, M. Mattar, and D. Lange. Unity: A general platform for intelligent agents. CoRR, abs/1809.02627, 2018

work page arXiv 2018

[6] [6]

Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

A. Juliani, A. Khalifa, V . Berges, J. Harper, H. Henry, A. Crespi, J. Togelius, and D. Lange. Obstacle tower: A generalization challenge in vision, control, and planning. CoRR, abs/1902.01378, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1902

[7] [7]

ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning

M. Kempka, M. Wydmuch, G. Runc, J. Toczek, and W. Jaskowski. Vizdoom: A doom-based AI research platform for visual reinforcement learning. CoRR, abs/1605.02097, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[8] [8]

V . Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. A. Riedmiller. Playing atari with deep reinforcement learning. CoRR, abs/1312.5602, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[9] [9]

Openai Five, 2019

OpenAI. Openai Five, 2019. Available at: https://openai.com/five/

work page 2019

[10] [10]

Learning Dexterous In-Hand Manipulation

OpenAI, M. Andrychowicz, B. Baker, M. Chociej, R. Józefowicz, B. McGrew, J. W. Pachocki, J. Pachocki, A. Petron, M. Plappert, G. Powell, A. Ray, J. Schneider, S. Sidor, J. Tobin, P. Welinder, L. Weng, and W. Zaremba. Learning dexterous in-hand manipulation. CoRR, abs/1808.00177, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[11] [11]

Sadeghi and S

F. Sadeghi and S. Levine. CAD2RL: real single-image ﬂight without a single real image. In Robotics: Science and Systems XIII, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA, July 12-16, 2017, 2017

work page 2017

[12] [12]

S. Shah, D. Dey, C. Lovett, and A. Kapoor. Airsim: High-ﬁdelity visual and physical simulation for autonomous vehicles. CoRR, abs/1705.05065, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[13] [13]

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

J. Tobin, R. Fong, A. Ray, J. Schneider, W. Zaremba, and P. Abbeel. Domain randomization for transferring deep neural networks from simulation to the real world. arXiv preprint arXiv:1703.06907, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[14] [14]

Todorov, T

E. Todorov, T. Erez, and Y . Tassa. Mujoco: A physics engine for model-based control. In Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on , pages 5026–5033. IEEE, 2012

work page 2012

[15] [15]

Simviz, 2018

Unity Technologies. Simviz, 2018. Available at: https://blogs.unity3d.com/tag/simviz/

work page 2018

[16] [16]

Unity 3d game engine, 2019

Unity Technologies. Unity 3d game engine, 2019. Available at: https://unity.com

work page 2019

[17] [17]

Vinyals, I

O. Vinyals, I. Babuschkin, J. Chung, M. Mathieu, M. Jaderberg, W. M. Czarnecki, A. Dudzik, A. Huang, P. Georgiev, R. Powell, T. Ewalds, D. Horgan, M. Kroiss, I. Danihelka, J. Agapiou, J. Oh, V . Dalibard, D. Choi, L. Sifre, Y . Sulsky, S. Vezhnevets, J. Molloy, T. Cai, D. Budden, T. Paine, C. Gulcehre, Z. Wang, T. Pfaff, T. Pohlen, Y . Wu, D. Yogatama, J....

work page 2019