Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications,

· 2025 · arXiv 2510.07077

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

ASCII Art Turns LLMs into VLA Controllers

cs.RO · 2026-06-19 · unverdicted · novelty 6.0

ASCII rendering of visual states enables fine-tuned text-only LLMs to serve as VLA controllers that identify objects and generate feasible action sequences in 2D manipulation benchmarks in simulation and on hardware.

Comprehensive AI governance requires addressing non-model gains

cs.CY · 2026-05-01 · unverdicted · novelty 6.0

Non-model gains via inference, systems, and assets can drive AI capabilities independently of base models, requiring governance beyond model-level evaluation and mitigation.

Can Predicted Dynamics Exist in the Physical World?

cs.RO · 2026-05-23 · unverdicted · novelty 4.0

Physical admissibility is defined as a prediction-control interface using kinematic, dynamic, and composed-horizon conditions to reject invalid dynamics proposals, with AUC 0.957 on LeRobot PushT and 87-89% prevention of invalid actions in interventions.

citing papers explorer

Showing 3 of 3 citing papers after filters.

ASCII Art Turns LLMs into VLA Controllers cs.RO · 2026-06-19 · unverdicted · none · ref 14
ASCII rendering of visual states enables fine-tuned text-only LLMs to serve as VLA controllers that identify objects and generate feasible action sequences in 2D manipulation benchmarks in simulation and on hardware.
Comprehensive AI governance requires addressing non-model gains cs.CY · 2026-05-01 · unverdicted · none · ref 55
Non-model gains via inference, systems, and assets can drive AI capabilities independently of base models, requiring governance beyond model-level evaluation and mitigation.
Can Predicted Dynamics Exist in the Physical World? cs.RO · 2026-05-23 · unverdicted · none · ref 16
Physical admissibility is defined as a prediction-control interface using kinematic, dynamic, and composed-horizon conditions to reject invalid dynamics proposals, with AUC 0.957 on LeRobot PushT and 87-89% prevention of invalid actions in interventions.

Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications,

fields

years

verdicts

representative citing papers

citing papers explorer