pith:WUMYKSR7
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Adding world modeling to predict future images lets vision-language-action models use large driving datasets more effectively and accelerate performance gains as data scales.
arxiv:2510.12796 v2 · 2025-10-14 · cs.CV · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{WUMYKSR74ZAYOLM2FTE775KAD2}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
we propose DriveVLA-W0, a training paradigm that employs world modeling to predict future images. ... Crucially, it amplifies the data scaling law, showing that performance gains accelerate as the training dataset size increases.
That the added world modeling task of predicting future images supplies a dense, unbiased self-supervised signal that meaningfully utilizes unused model capacity without requiring extra labels or introducing new failure modes in driving dynamics.
DriveVLA-W0 adds world modeling to predict future images in VLA models, overcoming sparse action supervision and amplifying data scaling laws on NAVSIM benchmarks and a large in-house dataset.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:14.789505Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
b519854a3fe641872d9a2cc9fff5401e9cd4a492cba5a529daca1605f6176e25
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/WUMYKSR74ZAYOLM2FTE775KAD2 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b519854a3fe641872d9a2cc9fff5401e9cd4a492cba5a529daca1605f6176e25
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "b0c36fc3151591d8a7b9a6ce74f682f50eb4bcf76c0dc2eaedbfb740b51fb4c2",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2025-10-14T17:59:47Z",
"title_canon_sha256": "0d6fd76fc2dd6307b1b71f8456c10fcefcf50453f3ad221fbbd9a0ed2deee3a4"
},
"schema_version": "1.0",
"source": {
"id": "2510.12796",
"kind": "arxiv",
"version": 2
}
}