pith:KJDRCWTR
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Large VLM-based VLA models for robotic manipulation can be systematically classified into monolithic and hierarchical architectures.
arxiv:2508.13073 v2 · 2025-08-18 · cs.RO · cs.CV
Record completeness
Claims
This survey provides the first systematic, taxonomy-oriented review of large VLM-based VLA models for robotic manipulation, resolving inconsistencies in existing taxonomies and filling a critical gap.
That the proposed split into monolithic (single/dual-system) and hierarchical models, along with the listed integration domains, comprehensively captures the field without significant omissions or overlaps that would undermine the taxonomy's utility.
This survey organizes large VLM-based VLA models for robotic manipulation into monolithic and hierarchical paradigms, reviews their integrations and datasets, and outlines future directions.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:13.173723Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
5247115a713e0653fb1ebfbe91334c2b53750093a87c76fe467a5744a7e3bb46
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KJDRCWTRHYDFH6Y6X67JCM2MFN \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 5247115a713e0653fb1ebfbe91334c2b53750093a87c76fe467a5744a7e3bb46
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "ca67b0f9cf1a480f3dad4c5e03c723c6e779cd4fad89a3bf2e5d3abbaf71aebb",
"cross_cats_sorted": [
"cs.CV"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.RO",
"submitted_at": "2025-08-18T16:45:48Z",
"title_canon_sha256": "5c704c940d76581b6834b4f713d011330cc8435255f2c993a4f040337734f580"
},
"schema_version": "1.0",
"source": {
"id": "2508.13073",
"kind": "arxiv",
"version": 2
}
}