Unified-io: A unified model for vision, language, and multi-modal tasks

· 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

cs.RO · 2024-09-19 · unverdicted · novelty 4.0

TinyVLA achieves faster inference and higher data efficiency than OpenVLA on robotic manipulation tasks by initializing from high-speed multimodal models and adding a diffusion policy decoder, without any pre-training phase.

citing papers explorer

Showing 1 of 1 citing paper.

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation cs.RO · 2024-09-19 · unverdicted · none · ref 31
TinyVLA achieves faster inference and higher data efficiency than OpenVLA on robotic manipulation tasks by initializing from high-speed multimodal models and adding a diffusion policy decoder, without any pre-training phase.

Unified-io: A unified model for vision, language, and multi-modal tasks

fields

years

verdicts

representative citing papers

citing papers explorer