hub Canonical reference

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

· 2017 · cs.RO · arXiv 1703.06907

Canonical reference. 100% of citing Pith papers cite this work as background.

21 Pith papers citing it

Background 100% of classified citations

open full Pith review browse 21 citing papers arXiv PDF

abstract

Bridging the 'reality gap' that separates simulated robotics from experiments on hardware could accelerate robotic research through improved data availability. This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator. With enough variability in the simulator, the real world may appear to the model as just another variation. We focus on the task of object localization, which is a stepping stone to general robotic manipulation skills. We find that it is possible to train a real-world object detector that is accurate to $1.5$cm and robust to distractors and partial occlusions using only data from a simulator with non-realistic random textures. To demonstrate the capabilities of our detectors, we show they can be used to perform grasping in a cluttered environment. To our knowledge, this is the first successful transfer of a deep neural network trained only on simulated RGB images (without pre-training on real images) to the real world for the purpose of robotic control.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7

citation-polarity summary

background 7

representative citing papers

Learning Object Manipulation from Scratch via Contrastive Interaction

cs.RO · 2026-06-10 · unverdicted · novelty 7.0

IWR improves CRL sample efficiency and performance in interaction-rich manipulation by interaction-aware resampling that preserves mode boundaries, yielding 19.8% average gains and a real-world air-hockey agent.

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

cs.AI · 2026-05-28 · unverdicted · novelty 7.0

MiraBench defines action-conditioned reliability via three levels (physics adherence, action-following fidelity, optimism bias detection) and applies it to 12 model configurations using a 16,000-judgment human corpus, finding visual fidelity a poor proxy for action fidelity, no reliable scale benefi

Robots that learn to evaluate models of collective behavior

cs.RO · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

A robotic fish learns goal-directed policies in simulation and interacts with live fish to quantify how well different behavioral models match real responses using Wasserstein distances on performance metrics.

Solving Rubik's Cube with a Robot Hand

cs.LG · 2019-10-16 · accept · novelty 7.0

Reinforcement learning models trained only in simulation using automatic domain randomization solve Rubik's cube with a real robot hand.

Tune to Learn: How Controller Gains Shape Robot Policy Learning

cs.RO · 2026-04-02 · conditional · novelty 7.0

Controller gains affect learnability differently for behavior cloning, RL from scratch, and sim-to-real transfer, so optimal gains depend on the learning paradigm rather than desired task behavior.

A Simulation Platform for Flapping-Wing Vehicles

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

FWAV-Sim is a high-fidelity Unity simulation framework for flapping-wing vehicles that integrates blade-element aerodynamics with bluff-body drag, spatiotemporally correlated fractal turbulence, and realistic IMU/LiDAR/RGB sensor models to support autonomy development.

Distributionally Robust Control via Stein Variational Inference for Contact-Rich Manipulation

cs.RO · 2026-05-18 · unverdicted · novelty 6.0

Introduces a Stein variational inference-based deterministic formulation for distributionally robust control in contact-rich robotic manipulation, reporting up to 3x improved robustness under parametric uncertainty.

Computer Use at the Edge of the Statistical Precipice

cs.SE · 2026-05-07 · unverdicted · novelty 6.0

A blind replay script matches frontier model performance on static CUA benchmarks due to non-principled environments and evaluation methods, prompting PRISM design principles and the DigiWorld benchmark with improved statistical aggregation.

Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting

cs.RO · 2026-04-14 · unverdicted · novelty 6.0

Habitat-GS integrates 3D Gaussian Splatting scene rendering and Gaussian avatars into Habitat-Sim, yielding agents with stronger cross-domain generalization and effective human-aware navigation.

EmbodiedGovBench: A Benchmark for Governance, Recovery, and Upgrade Safety in Embodied Agent Systems

cs.RO · 2026-04-13 · unverdicted · novelty 6.0

EmbodiedGovBench is a new benchmark framework that measures embodied agent systems on seven governance dimensions including policy adherence, recovery success, and upgrade safety.

Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning

cs.RO · 2021-08-24 · conditional · novelty 6.0

Isaac Gym achieves 2-3 orders of magnitude faster robot policy training by keeping physics simulation and PyTorch-based RL entirely on GPU with direct buffer sharing.

Bridging Performance and Generalization in Reinforcement Learning for Agile Flight

cs.RO · 2026-06-25 · unverdicted · novelty 5.0

RL framework for agile drone racing combines task-aware switching and physically informed procedural track generation to achieve 7.4x better zero-shot generalization to unseen tracks while maintaining competitive speeds.

Guided Action Flow: Q-Guided Inference for Flow-Matching Vision-Language-Action Policies

cs.RO · 2026-07-02 · unverdicted · novelty 4.0

Guided Action Flow applies a rollout-trained critic to steer frozen flow-matching VLA policies at inference time via action gradients, reporting success rate gains on LIBERO manipulation tasks.

ORRB -- OpenAI Remote Rendering Backend

cs.GR · 2019-06-26 · unverdicted · novelty 4.0

ORRB is an open-source remote rendering backend that pairs Unity3d with MuJoCo for high-throughput, customizable visual domain randomization in robotics environments.

SEVO: Semantic-Enhanced Virtual Observation for Robust VLA Manipulation via Active Illumination and Data-Centric Collection

cs.RO · 2026-05-11 · conditional · novelty 4.0

SEVO raises ACT and SmolVLA pick-and-place success from 30-35% to 75-85% in novel environments by using active illumination, semantic cues, and diversified teleoperation data.

Vision-Language-Action Models: Experimental Insights from a Real-World UR5 Platform

cs.RO · 2026-06-29 · unverdicted · novelty 3.0

Real-robot trials with OpenVLA on a UR5e arm show consistent offline-to-closed-loop gaps driven by action semantics, coordinate conventions, temporal alignment, image preprocessing, and dataset quality rather than model capacity.

Machine Learning Approaches for Improved Scalability of Metallic Magnetic Calorimeters

physics.ins-det · 2026-06-23 · unverdicted · novelty 3.0

Machine learning methods are explored for pulse classification, artifact rejection, and shape analysis in metallic magnetic calorimeters to improve scalability over traditional signal processing.

Zero-shot Transfer of Reinforcement Learning Control Policies for the Swing-Up and Stabilization of a Cart-Pole System

cs.RO · 2026-06-20 · unverdicted · novelty 3.0

Zero-shot sim-to-real transfer of independently trained RL policies for cart-pole swing-up and stabilization is achieved via sensitivity-guided domain randomization, linear curriculum learning, and first-order action smoothing with Simulink switching logic.

A Real-Calibrated Synthetic-First Data Engine

eess.IV · 2026-05-10 · unverdicted · novelty 3.0

A data curation pipeline using diffusion-generated synthetic images improves pose estimation when added to real data but underperforms when used without real anchors.

Efficiently Linking Real Scenes with Synthetic Data Generation for AI-based Cognitive Robotics and Computer Vision Applications

cs.RO · 2026-06-18 · unverdicted · novelty 2.0

The paper reviews limits in AI vision for robotics and describes work-in-progress on bridging sim-to-real domain gaps by linking real and synthetic training data.

GUI-Perturbed: Domain Randomization Reveals Systematic Brittleness in GUI Grounding Models

cs.LG · 2026-04-15

citing papers explorer

Showing 16 of 16 citing papers after filters.

Learning Object Manipulation from Scratch via Contrastive Interaction cs.RO · 2026-06-10 · unverdicted · none · ref 91 · internal anchor
IWR improves CRL sample efficiency and performance in interaction-rich manipulation by interaction-aware resampling that preserves mode boundaries, yielding 19.8% average gains and a real-world air-hockey agent.
MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models cs.AI · 2026-05-28 · unverdicted · none · ref 40 · internal anchor
MiraBench defines action-conditioned reliability via three levels (physics adherence, action-following fidelity, optimism bias detection) and applies it to 12 model configurations using a 16,000-judgment human corpus, finding visual fidelity a poor proxy for action fidelity, no reliable scale benefi
Robots that learn to evaluate models of collective behavior cs.RO · 2026-04-08 · unverdicted · none · ref 17 · 2 links · internal anchor
A robotic fish learns goal-directed policies in simulation and interacts with live fish to quantify how well different behavioral models match real responses using Wasserstein distances on performance metrics.
A Simulation Platform for Flapping-Wing Vehicles cs.RO · 2026-06-01 · unverdicted · none · ref 37 · internal anchor
FWAV-Sim is a high-fidelity Unity simulation framework for flapping-wing vehicles that integrates blade-element aerodynamics with bluff-body drag, spatiotemporally correlated fractal turbulence, and realistic IMU/LiDAR/RGB sensor models to support autonomy development.
Distributionally Robust Control via Stein Variational Inference for Contact-Rich Manipulation cs.RO · 2026-05-18 · unverdicted · none · ref 46 · internal anchor
Introduces a Stein variational inference-based deterministic formulation for distributionally robust control in contact-rich robotic manipulation, reporting up to 3x improved robustness under parametric uncertainty.
Computer Use at the Edge of the Statistical Precipice cs.SE · 2026-05-07 · unverdicted · none · ref 15
A blind replay script matches frontier model performance on static CUA benchmarks due to non-principled environments and evaluation methods, prompting PRISM design principles and the DigiWorld benchmark with improved statistical aggregation.
Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting cs.RO · 2026-04-14 · unverdicted · none · ref 29
Habitat-GS integrates 3D Gaussian Splatting scene rendering and Gaussian avatars into Habitat-Sim, yielding agents with stronger cross-domain generalization and effective human-aware navigation.
EmbodiedGovBench: A Benchmark for Governance, Recovery, and Upgrade Safety in Embodied Agent Systems cs.RO · 2026-04-13 · unverdicted · none · ref 22
EmbodiedGovBench is a new benchmark framework that measures embodied agent systems on seven governance dimensions including policy adherence, recovery success, and upgrade safety.
Bridging Performance and Generalization in Reinforcement Learning for Agile Flight cs.RO · 2026-06-25 · unverdicted · none · ref 14 · internal anchor
RL framework for agile drone racing combines task-aware switching and physically informed procedural track generation to achieve 7.4x better zero-shot generalization to unseen tracks while maintaining competitive speeds.
Guided Action Flow: Q-Guided Inference for Flow-Matching Vision-Language-Action Policies cs.RO · 2026-07-02 · unverdicted · none · ref 27 · internal anchor
Guided Action Flow applies a rollout-trained critic to steer frozen flow-matching VLA policies at inference time via action gradients, reporting success rate gains on LIBERO manipulation tasks.
ORRB -- OpenAI Remote Rendering Backend cs.GR · 2019-06-26 · unverdicted · none · ref 13 · internal anchor
ORRB is an open-source remote rendering backend that pairs Unity3d with MuJoCo for high-throughput, customizable visual domain randomization in robotics environments.
Vision-Language-Action Models: Experimental Insights from a Real-World UR5 Platform cs.RO · 2026-06-29 · unverdicted · none · ref 29 · internal anchor
Real-robot trials with OpenVLA on a UR5e arm show consistent offline-to-closed-loop gaps driven by action semantics, coordinate conventions, temporal alignment, image preprocessing, and dataset quality rather than model capacity.
Machine Learning Approaches for Improved Scalability of Metallic Magnetic Calorimeters physics.ins-det · 2026-06-23 · unverdicted · none · ref 64 · internal anchor
Machine learning methods are explored for pulse classification, artifact rejection, and shape analysis in metallic magnetic calorimeters to improve scalability over traditional signal processing.
Zero-shot Transfer of Reinforcement Learning Control Policies for the Swing-Up and Stabilization of a Cart-Pole System cs.RO · 2026-06-20 · unverdicted · none · ref 23 · internal anchor
Zero-shot sim-to-real transfer of independently trained RL policies for cart-pole swing-up and stabilization is achieved via sensitivity-guided domain randomization, linear curriculum learning, and first-order action smoothing with Simulink switching logic.
A Real-Calibrated Synthetic-First Data Engine eess.IV · 2026-05-10 · unverdicted · none · ref 13
A data curation pipeline using diffusion-generated synthetic images improves pose estimation when added to real data but underperforms when used without real anchors.
Efficiently Linking Real Scenes with Synthetic Data Generation for AI-based Cognitive Robotics and Computer Vision Applications cs.RO · 2026-06-18 · unverdicted · none · ref 51 · internal anchor
The paper reviews limits in AI vision for robotics and describes work-in-progress on bridging sim-to-real domain gaps by linking real and synthetic training data.

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer