Robo3R predicts accurate metric-scale 3D scene geometry from RGB images and robot states for improved robotic manipulation performance.
Continuous 3d perception model with persistent state
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
UNVERDICTED 2representative citing papers
FLEG reconstructs language-embedded 3D Gaussians from arbitrary input views using a dual-branch distillation framework and a sparse set of semantic Gaussians that requires only 5% of prior embeddings.
citing papers explorer
-
Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction
Robo3R predicts accurate metric-scale 3D scene geometry from RGB images and robot states for improved robotic manipulation performance.
-
FLEG: Feed-Forward Language Embedded Gaussian Splatting from Any Views via Compact Semantic Representation
FLEG reconstructs language-embedded 3D Gaussians from arbitrary input views using a dual-branch distillation framework and a sparse set of semantic Gaussians that requires only 5% of prior embeddings.