URL https://arxiv.org/abs/ 2408.14368

Li, P · 2024 · arXiv 2408.14368

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

What Matters in Building Vision-Language-Action Models for Generalist Robots

cs.RO · 2024-12-18 · unverdicted · novelty 5.0

Systematic tests of VLM backbones, policy architectures, and cross-embodiment data yield RoboVLMs that set new SOTA on robot manipulation benchmarks while requiring few manual designs.

EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation

cs.RO · 2025-11-17 · unverdicted · novelty 4.0

EL3DD extends latent 3D diffusion with language inputs and reference demonstrations to improve success rates on sequential manipulation tasks in the CALVIN dataset.

citing papers explorer

Showing 2 of 2 citing papers.

What Matters in Building Vision-Language-Action Models for Generalist Robots cs.RO · 2024-12-18 · unverdicted · none · ref 21
Systematic tests of VLM backbones, policy architectures, and cross-embodiment data yield RoboVLMs that set new SOTA on robot manipulation benchmarks while requiring few manual designs.
EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation cs.RO · 2025-11-17 · unverdicted · none · ref 9
EL3DD extends latent 3D diffusion with language inputs and reference demonstrations to improve success rates on sequential manipulation tasks in the CALVIN dataset.

URL https://arxiv.org/abs/ 2408.14368

fields

years

verdicts

representative citing papers

citing papers explorer