As shown in Table 2, we can set a very large environment batch size: up to 256 (2048/8) for colocated placement and 272 (3276/(6 ∗ 2)) for hybrid placement

combines very large vectorized environment counts with GPU-accelerated simulation · 2048

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training

cs.AI · 2026-02-05 · unverdicted · novelty 7.0

RL-VLA³ is an asynchronous RL framework for VLA training that delivers up to 85.2% higher throughput than synchronous baselines while preserving identical sample efficiency and scaling to 256 GPUs.

citing papers explorer

Showing 1 of 1 citing paper.

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training cs.AI · 2026-02-05 · unverdicted · none · ref 26
RL-VLA³ is an asynchronous RL framework for VLA training that delivers up to 85.2% higher throughput than synchronous baselines while preserving identical sample efficiency and scaling to 256 GPUs.

As shown in Table 2, we can set a very large environment batch size: up to 256 (2048/8) for colocated placement and 272 (3276/(6 ∗ 2)) for hybrid placement

fields

years

verdicts

representative citing papers

citing papers explorer