RoboCerebra: A large-scale benchmark for long-horizon robotic manipulation evalua- tion,

· 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models

cs.AI · 2026-03-14 · accept · novelty 5.0

vla-eval decouples VLA model inference from benchmark execution via WebSocket and Docker, supporting 14 benchmarks with up to 47x speedup and reproducing published scores across six codebases.

citing papers explorer

Showing 1 of 1 citing paper.

vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models cs.AI · 2026-03-14 · accept · none · ref 12
vla-eval decouples VLA model inference from benchmark execution via WebSocket and Docker, supporting 14 benchmarks with up to 47x speedup and reproducing published scores across six codebases.

RoboCerebra: A large-scale benchmark for long-horizon robotic manipulation evalua- tion,

fields

years

verdicts

representative citing papers

citing papers explorer